BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000548
         (1431 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 2441 bits (6327), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1166/1457 (80%), Positives = 1283/1457 (88%), Gaps = 41/1457 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQFT     S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+KQSNYE
Sbjct: 476  GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535

Query: 541  LV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            LV                          ELPGCKGIWTVYHK++RGHNADS++MA  DDE
Sbjct: 536  LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595

Query: 575  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596  YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655

Query: 635  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
            DG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656  DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705

Query: 695  QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
              PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY 
Sbjct: 706  NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SEE   QG
Sbjct: 766  VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
            RKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS  
Sbjct: 826  RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FLSGSRP 
Sbjct: 886  NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946  WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005

Query: 995  VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1054
            VQKIPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N NLSS +L
Sbjct: 1006 VQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDEL 1065

Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1114
            HR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG
Sbjct: 1066 HRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1125

Query: 1115 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
            TAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHLLIASGPK
Sbjct: 1126 TAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPK 1185

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            IILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQLNLLAK
Sbjct: 1186 IILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAK 1245

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            DFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV
Sbjct: 1246 DFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1305

Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
            TKFLRLQML  SSDRT A  GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL
Sbjct: 1306 TKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1365

Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
            VD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA Q GTTR
Sbjct: 1366 VDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTR 1425

Query: 1415 SQILSNLNDLALGTSFL 1431
             QILSNLNDL+LGTSFL
Sbjct: 1426 MQILSNLNDLSLGTSFL 1442


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 2436 bits (6313), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1166/1463 (79%), Positives = 1283/1463 (87%), Gaps = 47/1463 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQFT     S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            GEELSLYGSA N+TE++Q      KTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+
Sbjct: 476  GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535

Query: 535  KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            KQSNYELV                          ELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536  KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595

Query: 569  AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            A  DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ 
Sbjct: 596  ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655

Query: 629  RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            RGARILDG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656  RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
            TCTVS+  PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   D
Sbjct: 706  TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
            QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SE
Sbjct: 766  QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825

Query: 809  EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
            E   QGRKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826  EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885

Query: 869  DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
            + VS   SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FL
Sbjct: 886  EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945

Query: 929  SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
            SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946  SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            YDNYWPVQKIPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N N
Sbjct: 1006 YDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDN 1065

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
            LSS +LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENE
Sbjct: 1066 LSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1125

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            TLLAIGTAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHLL
Sbjct: 1126 TLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLL 1185

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
            IASGPKIILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQ
Sbjct: 1186 IASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQ 1245

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            LNLLAKDFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAEF
Sbjct: 1246 LNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEF 1305

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
            HVGAHVTKFLRLQML  SSDRT A  GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ
Sbjct: 1306 HVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1365

Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
            SLQKKLVD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA 
Sbjct: 1366 SLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQ 1425

Query: 1409 QTGTTRSQILSNLNDLALGTSFL 1431
            Q GTTR QILSNLNDL+LGTSFL
Sbjct: 1426 QIGTTRMQILSNLNDLSLGTSFL 1448


>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 2425 bits (6285), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1175/1461 (80%), Positives = 1296/1461 (88%), Gaps = 30/1461 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
            MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1    MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60

Query: 60   VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
            VTA +V+E+YVVRVQE+GS+ES++S ETKR  LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61   VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120

Query: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
               G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121  PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180

Query: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
            GPL+KVDPQGRCGG+LVY +QMIIL+A+Q  SGLVGD+D   SGG  SAR++SS+VINLR
Sbjct: 181  GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240

Query: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ  LIW
Sbjct: 241  DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300

Query: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
            S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301  SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360

Query: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
            FSVELDA  A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+  SVLTSDITTIG
Sbjct: 361  FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420

Query: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421  NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
            +GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481  SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540

Query: 540  ELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
            +LV                          +LPGC+GIWTVYHK++RGHN D S+MAA  D
Sbjct: 541  DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600

Query: 574  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
            EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601  EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660

Query: 634  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            LDGS+MTQDLS G SNSES  GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661  LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA---DGGPLDQG 750
            + TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDGA   DGGP DQG
Sbjct: 721  INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE  KDS+ + N  SEE 
Sbjct: 781  DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
             G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D 
Sbjct: 841  AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
            VS    + + ++SASRLRNLRF R PLD+Y +EET     CQRITIF NISGHQGFFL G
Sbjct: 901  VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961  SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
            NYWPVQKIPLK TPHQ+TYF EKNLYPLIVSVPV KP+NQVLS L+DQEVGHQI+NHNLS
Sbjct: 1021 NYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNLS 1080

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1110
            S +L +TY+VEE+EVRILE +  GGPWQT+ATIPMQSSENALTVRVVTLFN TTKENETL
Sbjct: 1081 SDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENETL 1140

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
            LAIGTAYVQGEDVAARGRVLLFS  ++ +N Q LV+EVYSKELKGAISALASLQGHLLIA
Sbjct: 1141 LAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLIA 1200

Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
            SGPKIILHKWTGTELNG+AFYDAPPLYV S+NIVKNFILLGDIHKSIYFLSWKEQGAQL+
Sbjct: 1201 SGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQLS 1260

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
            LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM ESWKGQKLLSRAEFHV
Sbjct: 1261 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHV 1320

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
            GAH+TKF+RL ML+TSSDR+GAAPG DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL
Sbjct: 1321 GAHITKFIRLSMLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1380

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            Q+KLVD+VPHVAGLNPRSFRQF S+GK HRPGP+SIVDCELLSH+EMLPLEEQLEIA Q 
Sbjct: 1381 QRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQV 1440

Query: 1411 GTTRSQILSNLNDLALGTSFL 1431
            GTTR+QILSNLNDL+LGTSFL
Sbjct: 1441 GTTRAQILSNLNDLSLGTSFL 1461


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 2391 bits (6197), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1148/1459 (78%), Positives = 1290/1459 (88%), Gaps = 40/1459 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+E PS+    +G +PNL
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56

Query: 59   VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
            VVTAANV+E+Y VR+QE+  +  K + +++R  L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57   VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114

Query: 119  LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
            LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115  LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174

Query: 179  RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
            RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D  GS G  +ARIESS++INL
Sbjct: 175  RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234

Query: 239  RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235  RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295  WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354

Query: 359  SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVL+S ITTI
Sbjct: 355  SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414

Query: 419  GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415  GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473

Query: 479  VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
            V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474  VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533

Query: 539  YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            YEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  D
Sbjct: 534  YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593

Query: 573  DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594  DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653

Query: 633  ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
            ILDGS+MTQD+SFG SN ESGS S+++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654  ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713

Query: 693  SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
            SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG   D GDI
Sbjct: 714  SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +    
Sbjct: 774  YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QGRKENI  MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P++TSK +D  S
Sbjct: 831  QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
               S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI  ++GFFLSGSR
Sbjct: 891  AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951  PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010

Query: 993  WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1052
            WPVQKIPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ++ HQ ++ N++  
Sbjct: 1011 WPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDINHQNESQNMNPD 1069

Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1112
            + +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLLA
Sbjct: 1070 EQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLA 1129

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            IGTAYVQGEDVAARGR+LLFS G+N DNPQ LV+EVYSKELKGAISALASLQGHLLIASG
Sbjct: 1130 IGTAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASG 1189

Query: 1173 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
            PKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL
Sbjct: 1190 PKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLL 1249

Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
            AKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHVGA
Sbjct: 1250 AKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1309

Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
            HVTKFLRLQML+T SDR GA PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSLQ+
Sbjct: 1310 HVTKFLRLQMLST-SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQR 1368

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIAHQ GT
Sbjct: 1369 KLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGT 1428

Query: 1413 TRSQILSNLNDLALGTSFL 1431
            TRSQILSNL+DL+LGTSFL
Sbjct: 1429 TRSQILSNLSDLSLGTSFL 1447


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score = 2367 bits (6134), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1147/1460 (78%), Positives = 1292/1460 (88%), Gaps = 40/1460 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+ E PS+    +GP+PN
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVVTAANV+E+Y VR+QE+  +    S +++R  L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57   LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116  VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
            ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G  +ARIESS++IN
Sbjct: 176  ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236  LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296  IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVL+S ITT
Sbjct: 356  SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416  IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
            MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475  MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  
Sbjct: 535  NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595  DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+MTQD+SFG SNSESGS SE++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655  RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            +SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG   D GD
Sbjct: 715  ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +   
Sbjct: 775  IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
             QGRK+NI +MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P+ TSK +D  
Sbjct: 832  NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S   S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI  +QGFFLSGS
Sbjct: 892  SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952  RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
            YWPVQKIPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ+  HQ ++ N++ 
Sbjct: 1012 YWPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDFNHQNESQNMNP 1070

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             + +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLL
Sbjct: 1071 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLL 1130

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            AIGTAYVQGEDVAARGR+LLFS G+  DNPQ LV+EVYSKELKGAISALASLQGHLLIAS
Sbjct: 1131 AIGTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIAS 1190

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            GPKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+L
Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            LAKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHVG
Sbjct: 1251 LAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1310

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            AHVTKFLRLQML+T SDR G+ PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSLQ
Sbjct: 1311 AHVTKFLRLQMLST-SDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQ 1369

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            +KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIA+Q G
Sbjct: 1370 RKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIG 1429

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TTRSQILSNL+DL+LGTSFL
Sbjct: 1430 TTRSQILSNLSDLSLGTSFL 1449


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score = 2360 bits (6117), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1150/1466 (78%), Positives = 1271/1466 (86%), Gaps = 46/1466 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
            MS+AAYKMMHWPT I  C SGF+THSR++    +P + T++LDS+ PS+R    GIGP P
Sbjct: 1    MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V + NV+E+YVVRVQEEG++   +SGE KR  +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60   NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +LS  G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117  GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
            FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG   SA I SS++I
Sbjct: 177  FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236

Query: 237  NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
            NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ  
Sbjct: 237  NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296

Query: 297  LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
            LIWS  NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297  LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
            R++FSVELDAA+ATWL  DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVLTSDIT
Sbjct: 357  RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416

Query: 417  TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
            T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417  TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476

Query: 477  DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            DMV+GEELSLY SA NN ES+Q     KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477  DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536

Query: 532  GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
            GISKQSNYEL                          VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537  GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
             +MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597  LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFERGARILDGS+MTQDLSFG SNSE+G  SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656  VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715  DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
              +QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT   E  KD    +  
Sbjct: 775  AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
              EE  G GRKE+  +MKVVEL M RWS  HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832  VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K +D VS   S+  S +SASRLRNLRF R PLD YTREET     CQRIT FKNISG+QG
Sbjct: 892  KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952  FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011

Query: 986  GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
             S+YDNYWPVQKIPLK TPHQ+TYFAE+NLYPLIVSVPV KP+NQVLS L+DQEVGHQI+
Sbjct: 1012 VSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIE 1071

Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
            NHNLSS ++HRTY+V+E+EVRILEP  + GPWQ +ATIPMQ+SENALTVR+V+LFNT+TK
Sbjct: 1072 NHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTK 1129

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
            ENETLLA+GTAYVQGEDVAARGR+LLFS  +N +N Q LV+EVYSKELKGAISALASLQG
Sbjct: 1130 ENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQG 1189

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            HLLIASGPKIILHKWTGTEL G+AF DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ
Sbjct: 1190 HLLIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1249

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
            GAQL+LLAKDF SLDCF+TEFLIDGSTLSLVVSDEQKN+QIFYYAPKMSESWKGQKLLSR
Sbjct: 1250 GAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSR 1309

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
            AEFHVGA VTKF+RLQML+ S DR+GAAP SDKTNRFALLFGTLDGSIGCIAPLDELTFR
Sbjct: 1310 AEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1369

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            RLQSLQKKLVD+VPHVAGLNP+SFRQF S+GKAHRPGP+SIVDCE+LS+YEM+PLEEQ+E
Sbjct: 1370 RLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVE 1429

Query: 1406 IAHQTGTTRSQILSNLNDLALGTSFL 1431
            IA Q GTTR+QILSNLNDL LGTSFL
Sbjct: 1430 IAQQIGTTRAQILSNLNDLTLGTSFL 1455


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score = 2256 bits (5845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1096/1460 (75%), Positives = 1254/1460 (85%), Gaps = 45/1460 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
            MSFAA+KMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TA N++E+Y+VR QEEG ++E +     KR  +MDG+S  SLELVCHYRLHGNVES+A
Sbjct: 61   VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  SSD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLR-ISSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYELV                          ELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG  NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720  VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D  + E     E E+N +SE+  
Sbjct: 780  IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
               R E I + KVVEL+MQRWS  H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835  ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R P D  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 894  SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
            YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KP+NQVLS L+DQE G QIDNHNLSS
Sbjct: 1014 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLSS 1073

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             DL RTYTVEE+E++ILEP+R+GGPW+T+ATIPMQSSE+ALTVRVVTL N +T ENETLL
Sbjct: 1074 DDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETLL 1133

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1134 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1193

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1194 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSL 1253

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            LAKDFGSLDCFATEFLIDG+TLSL VSDEQKNIQ+FYYAPKM+ESWKGQKLLSRAEFHVG
Sbjct: 1254 LAKDFGSLDCFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVG 1313

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            +HVTKFLRLQM+ +         G+DKTNRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1314 SHVTKFLRLQMVTS---------GADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1364

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            KKLVD+VPHVAGLNP SFRQF ++GKA R GPDSI+DCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1365 KKLVDAVPHVAGLNPHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIG 1424

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TTRS IL NL +L++GTSFL
Sbjct: 1425 TTRSVILLNLVELSVGTSFL 1444


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score = 2252 bits (5835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1118/1509 (74%), Positives = 1248/1509 (82%), Gaps = 83/1509 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TA NV+E+YVVRV EEG +ESK+SGE +R  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60   TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
              G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120  SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180  PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240  LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300  ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            +VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVLTS I +IGN
Sbjct: 360  NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
            SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E                           
Sbjct: 420  SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479

Query: 455  ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
                G IE +  +                        R+ R         V G+ELSLYG
Sbjct: 480  TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539

Query: 489  SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
            SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV      
Sbjct: 540  SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599

Query: 543  --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
                                ELPGCKGIWTVYHK++RG  ADSSRM   DDEYHAYLIIS
Sbjct: 600  KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659

Query: 583  LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660  LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
            L+   + +ESG+ SE  TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV  PAA  S
Sbjct: 720  LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779

Query: 703  SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
            SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V CY++G 
Sbjct: 780  SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + NS  +E    GR E+  +M
Sbjct: 840  LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRNESSQNM 897

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S+S SN+
Sbjct: 898  KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
            S+SRLRNLRF R PLD   RE+ P+G    R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958  SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
            LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQK+PLK 
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKG 1077

Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1062
            TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +TY+VEE
Sbjct: 1078 TPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVEE 1136

Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED 1122
            +E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTAYVQGED
Sbjct: 1137 FEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGED 1196

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
            VAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKIILHKWTG
Sbjct: 1197 VAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1256

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKDFGSLDC+
Sbjct: 1257 AELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCY 1316

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVTKFLRLQM
Sbjct: 1317 ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQM 1376

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
            L+TSSD+   +  SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL D+VPHV 
Sbjct: 1377 LSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVG 1435

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRSQILSNLN
Sbjct: 1436 GLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLN 1495

Query: 1423 DLALGTSFL 1431
            DL+LGTSFL
Sbjct: 1496 DLSLGTSFL 1504


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score = 2249 bits (5829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1092/1460 (74%), Positives = 1254/1460 (85%), Gaps = 47/1460 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR +S D  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYELV                          ELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
            YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            AHV+KFLRLQM+++         G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TTR  IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
            subunit [Arabidopsis thaliana]
          Length = 1442

 Score = 2248 bits (5825), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1091/1460 (74%), Positives = 1253/1460 (85%), Gaps = 47/1460 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR +S D  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYELV                          ELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S     ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
            YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            AHV+KFLRLQM+++         G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TTR  IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score = 2244 bits (5815), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1092/1466 (74%), Positives = 1254/1466 (85%), Gaps = 53/1466 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR +S D  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
             +  EELSL+GS  NN++SAQ      K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539

Query: 532  GISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNADS 565
            G+SKQSNYELV                          ELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540  GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
            S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600  SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660  VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGG
Sbjct: 720  DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
            P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N 
Sbjct: 780  PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
            +SE+ T     + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T 
Sbjct: 836  NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K+++ +S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQG
Sbjct: 892  KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952  FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011

Query: 986  GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
             S YDNYWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+D
Sbjct: 1012 ASIYDNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLD 1071

Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
            NHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T 
Sbjct: 1072 NHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTG 1131

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
            ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QG
Sbjct: 1132 ENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQG 1191

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            HLLI+SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQ
Sbjct: 1192 HLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQ 1251

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
            G+QL+LLAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSR
Sbjct: 1252 GSQLSLLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSR 1311

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
            AEFHVGAHV+KFLRLQM+++         G+DK NRFALLFGTLDGS GCIAPLDE+TFR
Sbjct: 1312 AEFHVGAHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFR 1362

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            RLQSLQKKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE
Sbjct: 1363 RLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLE 1422

Query: 1406 IAHQTGTTRSQILSNLNDLALGTSFL 1431
            +AHQ GTTR  IL +L DL++GTSFL
Sbjct: 1423 LAHQIGTTRYSILKDLVDLSVGTSFL 1448


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1877 bits (4863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 944/1472 (64%), Positives = 1123/1472 (76%), Gaps = 72/1472 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            ICQLPS   YD+YWPVQK+PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998  ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
              H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
             NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            +ASLQGHLLIASGPKI L+KWTG EL  +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD  KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            QKLLSRAEFHVGAH+TKFLRLQML T          S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML 
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEMLS 1409

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1873 bits (4852), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 943/1472 (64%), Positives = 1121/1472 (76%), Gaps = 72/1472 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            ICQLPS   YD+YWPVQK+PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998  ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
              H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
             NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            +ASLQGHLLIASGPKI L+KWTG EL  +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD  KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            QKLLSRAEFHVGAH+TKFLRLQML T          S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL  YEML 
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEMLS 1409

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score = 1865 bits (4830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 946/1471 (64%), Positives = 1128/1471 (76%), Gaps = 69/1471 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
            MS+AAYKMMHWPTGI +C +GFITH  +D             E D  L +     + +GP
Sbjct: 1    MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60

Query: 55   VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
             PNLVV AANV+E+Y VR     +      + S +SG      + DGIS A LELVCHYR
Sbjct: 61   TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115

Query: 109  LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            LHGN+ES+AILS G A+N  RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW 
Sbjct: 116  LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172

Query: 169  HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
            HLKRGRESFA GP++K DP GRCG  LVYGLQMIILK++Q G  LVG+++   +    + 
Sbjct: 173  HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232

Query: 229  RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            RIESS++I+LR LD  HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233  RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A  
Sbjct: 293  SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
             D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+  
Sbjct: 353  PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412

Query: 409  SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
            SV++S +TTIG+S FFLGSR+GDSLLVQF+CG  TS++     E   DIE D P +KRL+
Sbjct: 413  SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472

Query: 469  RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
            R  SD LQD+ + EELS   +   N+ ESAQK  S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473  RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531

Query: 528  ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
             +ATG +KQSNYEL                          VELP C+GIWTVY+KSSRGH
Sbjct: 532  PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591

Query: 562  NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
              +       D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592  TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644

Query: 622  RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
            RVIQV+  GAR+LDGS+MTQ+L+F   +SES S       V S SIADPYVLL M DG+I
Sbjct: 645  RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LLVGD STC +S+  P+ + S  + +S+CTLYHD+GPEPWLRKT  DAWLS+GV  A+D
Sbjct: 705  QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
             +     DQ DIY ++CYESG LEIF+VP+F  VF+V  F SG + +VD + +   +DS 
Sbjct: 765  VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   +E     +KE  +++++VELAM RWS   SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825  ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             E+  K    +S   S+ + N S SRL+NLRF R  +D  +RE+    A   RITIF N+
Sbjct: 882  LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
             G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940  GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999

Query: 981  CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            CQLPS   YDNYWPVQKIPL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS++ DQE+
Sbjct: 1000 CQLPSAYNYDNYWPVQKIPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQEM 1059

Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
             H +DN   S+ DL +TYTVEE+EVR+LE ++ GG W+TR+TIPMQS ENALTVR+VTL 
Sbjct: 1060 IHHMDNDASSADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTLH 1119

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
            NTTTKENETL+AIGTAYVQGEDVAARGRVLLFS  + ++N QNLVTEVYSKE KGA+SA+
Sbjct: 1120 NTTTKENETLMAIGTAYVQGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSAV 1178

Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
            ASLQGHL+IASGPKI L+KW G+EL  +AFYDA PL+VVSLNIVKNF+L GDIHKS+YFL
Sbjct: 1179 ASLQGHLVIASGPKITLNKWNGSELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSVYFL 1237

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            SWKEQG+QL LLAKDFGSLDCFATEFLIDGSTLSLVVSD  KN+QIFYYAPKM ESWKGQ
Sbjct: 1238 SWKEQGSQLTLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNLQIFYYAPKMVESWKGQ 1297

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            KLLSRAE HVGAH+TKFLRLQML       G A  S+KTNRFALLFGTLDGSIGCIAP+D
Sbjct: 1298 KLLSRAELHVGAHMTKFLRLQMLPAQ----GLA--SEKTNRFALLFGTLDGSIGCIAPVD 1351

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ELTFRRLQSLQ+KLVD+V HV GLNPRSFRQF SNGKAHRPGPD+I+D ELL++YE+L L
Sbjct: 1352 ELTFRRLQSLQRKLVDAVSHVCGLNPRSFRQFKSNGKAHRPGPDNIIDFELLTYYEILSL 1411

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            EEQL++A Q GTTR+QILSN +D++LGTSFL
Sbjct: 1412 EEQLDMAQQIGTTRAQILSNFSDISLGTSFL 1442


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score = 1796 bits (4651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 925/1528 (60%), Positives = 1114/1528 (72%), Gaps = 122/1528 (7%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATW  ND+ + S+K G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            ICQLPS   YDNYWPVQK+PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998  ICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
              H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
             NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFMK-SENSQNLVTEVYSKESKGAVSA 1176

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            +ASLQGHLLIASGPKI L+KWTG EL  +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMSE-- 1275
            LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD  KN+Q+  F     + +  
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQVKNFVLFGDIHKSI 1295

Query: 1276 ---SWKGQ----KLLSRAEFHVGAHVTKFL----RLQMLATSSDRTGA----AP------ 1314
               SWK Q     LL++    +    T+FL     L ++A+ SD+       AP      
Sbjct: 1296 YFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESW 1355

Query: 1315 -------------------------------GSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
                                            S+KTNRFALLFG LDG IGCIAP+DELT
Sbjct: 1356 KGQKLLSRAEFHVGAHITKFLRLQMLPTQGLSSEKTNRFALLFGNLDGGIGCIAPIDELT 1415

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            FRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML L+EQ
Sbjct: 1416 FRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEMLSLDEQ 1475

Query: 1404 LEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            L++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1476 LDVAQQIGTTRSQILSNFSDISLGTSFL 1503


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score = 1585 bits (4103), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 827/1496 (55%), Positives = 1035/1496 (69%), Gaps = 109/1496 (7%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
            MS+AA+KM+H PTG+ NC + ++THS  +     IPL       ++L +  G G  PNLV
Sbjct: 1    MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55

Query: 60   VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
            +T ANV+E++ VR+ E       GS    N   T R  LM G+S   LEL CHYRLHGNV
Sbjct: 56   ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115

Query: 114  ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
            ESL +LS   A+  + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116  ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175

Query: 174  RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
            RE FA GP V+ DP GRC GVL+Y  Q+++LKA+Q G GL  ++++   GG   A + +S
Sbjct: 176  REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235

Query: 234  HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
            ++++LRDLDMKH+KDF+F+HG              YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236  YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295

Query: 280  TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
            TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV  AN++HYHSQS SCA
Sbjct: 296  TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355

Query: 340  LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            L LN +AV+ + S E PRS  SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356  LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            RL+L+K+  SVLTS + TIG + FFLGSRL DSLLVQ T GS +   SS +    GDIEA
Sbjct: 416  RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471

Query: 460  D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
            D  AP+ KRL+R  S+  + +   EE+SLY S    ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472  DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530

Query: 517  DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
            DF+YGLR NAD SATG+ KQSNYEL                          V LPGC GI
Sbjct: 531  DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590

Query: 551  WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
            WTVYHK+ R  + +     + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G 
Sbjct: 591  WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
            TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L    S  E+ S    N+ V+   IADP
Sbjct: 651  TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
            Y+LL M+DGS++L+VGD     +S+  P     +   +++ TLY DKGP  WLR+T ++ 
Sbjct: 711  YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770

Query: 730  ------WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                  W ST              DQG +Y +VC  SG  EI+++P   CV+ VD F  G
Sbjct: 771  NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
             + + D  + E   +S   +   +EE    G    ++   S+ V ++  + W     RPF
Sbjct: 818  MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L A L+DGT+LCY A+ ++  E++   +      R  + S    SRL +LRF+R P+D  
Sbjct: 878  LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932

Query: 901  TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            + +E   GA      +   FKN+    G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933  SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
            T LHNVNC HGFIY+T+QG LKICQLPS   YDN WPVQKIPL+ TPHQITY ++ NLY 
Sbjct: 991  TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKIPLRGTPHQITYHSDVNLYA 1050

Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGG 1075
            LI+S PV +P +QVL        GH  D    +S+  D  R  T E+YEVRI+EP + GG
Sbjct: 1051 LIISTPVSRPTSQVL-----MGDGHPFDQQQENSIGEDGQRLVTSEDYEVRIIEPAQPGG 1105

Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
             W+ +A I M  +ENALTVR+V++ N TT + +TLLAIGT+YVQGEDVAA+GR++L S G
Sbjct: 1106 NWEAKAAIKMHLTENALTVRIVSIKNITTDQTQTLLAIGTSYVQGEDVAAKGRIILVSVG 1165

Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            ++  +P +   EVYSKELKG+ISA+ASLQGHLLIA GPKIILH W G+ELNG AF+DA P
Sbjct: 1166 KDPQDPGSWAREVYSKELKGSISAIASLQGHLLIAIGPKIILHSWNGSELNGAAFFDA-P 1224

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            LYVVSLNIVKNFIL GDIHKSIYFL WKE GAQL LLAKDFGSLDC+ATEFLIDGSTLSL
Sbjct: 1225 LYVVSLNIVKNFILFGDIHKSIYFLCWKEDGAQLTLLAKDFGSLDCYATEFLIDGSTLSL 1284

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +VSD +KN+QIF YAPK  ESWKGQKLLSRAEFH+GAHV KF RLQML T        PG
Sbjct: 1285 LVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRLQMLPT--------PG 1336

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
            S ++NR+A+LFGTLDG+I  +APLDELTFRRL +LQ+KLVD V HVAG+NPR+FRQF  +
Sbjct: 1337 SARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCD 1396

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            GKAHRPGPD+IVDCELLSHY+MLPL+EQLEIA Q GTTR+ +LSNL DLAL TSFL
Sbjct: 1397 GKAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDLALSTSFL 1452


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score = 1455 bits (3766), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 773/1485 (52%), Positives = 989/1485 (66%), Gaps = 126/1485 (8%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AA K++H PTG++ C S FITHS     P  P        S   S      +PNLV+
Sbjct: 1    MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
              ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51   VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110  RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
            +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169  IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227  ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287  INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347  PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S + TIG+  FF+GSRLGDSLLVQF   S ++ LS       G+ +     +KR+
Sbjct: 407  GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
            R      L D  N + L  Y SA ++++     F F+VRDSL NIGP++D +        
Sbjct: 464  R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516

Query: 520  -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
                         +G    L I + +       Q+N  L            V+LPGC G+
Sbjct: 517  QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576

Query: 551  WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
            WTVYH+S +        + A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G 
Sbjct: 577  WTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
            +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S ADPY
Sbjct: 629  SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679

Query: 671  VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
            VLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680  VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739

Query: 731  LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
             + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + D 
Sbjct: 740  QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794

Query: 791  YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
              R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DGT+
Sbjct: 795  --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
            L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  +   
Sbjct: 852  LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897

Query: 909  -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
                Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC H
Sbjct: 898  VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
            G IYVT++  LKIC+L +   YDN WPVQKIPLK TPHQ+ +  + N+Y L++S  V  P
Sbjct: 958  GLIYVTNEASLKICRLSNILNYDNDWPVQKIPLKGTPHQMAHHPDLNIYVLVLSFSVSVP 1017

Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
             + VL    D   GHQID    S  +D  +   V+++EVR+LEP   G PW+T+ TI  Q
Sbjct: 1018 TSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIKFQ 1077

Query: 1087 SSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
             +EN LTVR+V++ N  T++ E LLAIGT Y+QGEDVA+RGR++L S G +  +P+    
Sbjct: 1078 PAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKVWAK 1137

Query: 1147 EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNIVKN
Sbjct: 1138 ELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNIVKN 1196

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            F+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KNIQ+
Sbjct: 1197 FVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQV 1256

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
            F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T        PGS +TNRFAL F
Sbjct: 1257 FSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFALCF 1308

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
            GTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GPD+ 
Sbjct: 1309 GTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNT 1368

Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            VD E L  YE L L++Q+ IA Q GTTR QI +NL D++L TSF 
Sbjct: 1369 VDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score = 1454 bits (3763), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 773/1487 (51%), Positives = 991/1487 (66%), Gaps = 130/1487 (8%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AA K++H PTG++ C S FITHS     P  P        S   S      +PNLV+
Sbjct: 1    MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
              ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51   VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110  RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
            +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169  IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227  ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287  INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347  PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
             SVL+S + TIG+  FF+GSRLGDSLLVQF    G++  S+ L+  + G+ +     +KR
Sbjct: 407  GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462

Query: 467  LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
            +R      L D  +  E SLY   S  ++S +   F F+VRDSL NIGP++D +      
Sbjct: 463  MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514

Query: 521  --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
                                 L I + +       Q+N  L            V+LPGC 
Sbjct: 515  SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            G+WTVYH+S +        + A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575  GVWTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
            G +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S AD
Sbjct: 627  GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            PYVLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678  PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             W + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + 
Sbjct: 738  VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D+  R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DG
Sbjct: 793  DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
            T+L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  + 
Sbjct: 850  TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895

Query: 909  ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
                  Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC
Sbjct: 896  DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             HG IYVT++  LKIC+L +   YDN WPVQKIPLK TPHQ+ +  + N+Y L++S  V 
Sbjct: 956  QHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLKGTPHQMAHHPDLNIYVLVLSFSVS 1015

Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1084
             P + VL    D   GHQID    S  +D  +   V+++EVR+LEP   G PW+T+ TI 
Sbjct: 1016 VPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIK 1075

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
             Q +EN LTVR+V++ N  T++ E LLAIGT Y+QGEDVA+RGR++L S G +  +P+  
Sbjct: 1076 FQPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKVW 1135

Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
              E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNIV
Sbjct: 1136 AKELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNIV 1194

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KNF+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KNI
Sbjct: 1195 KNFVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNI 1254

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
            Q+F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T        PGS +TNRFAL
Sbjct: 1255 QVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFAL 1306

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
             FGTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GPD
Sbjct: 1307 CFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPD 1366

Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            + VD E L  YE L L++Q+ IA Q GTTR QI +NL D++L TSF 
Sbjct: 1367 NTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like, partial [Cucumis sativus]
          Length = 741

 Score = 1203 bits (3113), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 583/735 (79%), Positives = 641/735 (87%), Gaps = 4/735 (0%)

Query: 697  PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
            PAA  SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V 
Sbjct: 11   PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
            CY++G LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + NS  +E    GR 
Sbjct: 71   CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128

Query: 817  ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
            E+  +MKV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S
Sbjct: 129  ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188

Query: 877  LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
            +S SN+S+SRLRNLRF R PLD   RE+ P+G   +R++IFKNISG+QG FL GSRP W 
Sbjct: 189  VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248

Query: 937  MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
            MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249  MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308

Query: 997  KIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1056
            K+PLK TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +
Sbjct: 309  KVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQ 367

Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1116
            TY+VEE+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTA
Sbjct: 368  TYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTA 427

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
            YVQGEDVAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKII
Sbjct: 428  YVQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKII 487

Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
            LHKWTG ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKDF
Sbjct: 488  LHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF 547

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
            GSLDC+ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVTK
Sbjct: 548  GSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTK 607

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
            FLRLQML+TSSD+   +  SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL D
Sbjct: 608  FLRLQMLSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGD 666

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
            +VPHV GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRSQ
Sbjct: 667  AVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQ 726

Query: 1417 ILSNLNDLALGTSFL 1431
            ILSNLNDL+LGTSFL
Sbjct: 727  ILSNLNDLSLGTSFL 741


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 505/1529 (33%), Positives = 762/1529 (49%), Gaps = 204/1529 (13%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH   D  P                       PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
              AN + ++ +R        +  SG+        G  A SLE+V  + L+G V S+A++ 
Sbjct: 38   MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
            +       +RD++++A  ++K+SV+E+D S   +  +S+H +E+P       G  S  R 
Sbjct: 89   RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145

Query: 180  ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
                PL   DP+GRC  VL+   G   + L     A     G  G +D    G G +A +
Sbjct: 146  APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205

Query: 231  ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
              S V++L  DL +  V+D  F+HGY EPV++ILHERE TWA R+   + TC+++A+SI+
Sbjct: 206  RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               K+  +IW    LP   Y+L A+P P+GG +V+  N + + SQ +S ALALN  A   
Sbjct: 266  LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325

Query: 350  DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
              S        SVELD+AHA  L     L++TK G L+LL++  +GR +     + L + 
Sbjct: 326  TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
              +VL+S +  I   L FLGSR+GDSLLV            ML +    K + G+ E   
Sbjct: 382  GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441

Query: 462  PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
            P     +  +S    D L+ M+ GE E +   + +   E     ++F VRDS++ I P+ 
Sbjct: 442  PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499

Query: 517  DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
            D + G   +            +A G  K     +++               LPG  G WT
Sbjct: 500  DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559

Query: 553  VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            VYH+S    R   + ++  AA  D +H+YL+ISLE+ TMVLET + L EV+E+V+     
Sbjct: 560  VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             T+AAGN+ GR+R+ QV + G RI +G    QDLS       +G  S +  +++  + DP
Sbjct: 620  ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
            YVL  MSDGS+R+L GD    +V   +P++  +  + + ++S  L  D  P         
Sbjct: 679  YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738

Query: 719  -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
             EP +LR+T+T    STGV          P D+      V    G LE++ +P+   +++
Sbjct: 739  REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785

Query: 777  VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
             D    G   +      + +  D + E+  +          ++  + ++VE  +  +   
Sbjct: 786  ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835

Query: 836  HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            H RP L A+  DG++L Y+A+L   G  N      P                   LRF R
Sbjct: 836  HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877

Query: 895  TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
             P++           +   G+   R     +  G +G F+SG RP W +V R R+   P 
Sbjct: 878  VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937

Query: 949  LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
              +    V+FT  HNVNC +GF+  T+ G ++ICQ+P    Y+  WPV+K+ L+ TPH +
Sbjct: 938  RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHV 997

Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQV------LSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
             Y  +  LY L  S PV    ++V      LS LI     + +    +  V         
Sbjct: 998  QYLPDFRLYALSTSAPVKWKDHEVNEDDIHLSTLIKVRKANAMAKGGVEQV--------- 1048

Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
             + +R+L P      WQ      +   E+  ++R V L NT T   +++L +GTA   GE
Sbjct: 1049 -FSLRLLVPGTLECAWQ----YTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTALPGGE 1103

Query: 1122 DVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
            D   RGRVL+F      T R       LV     ++ K A +AL  + GHL +A G K+I
Sbjct: 1104 DAPCRGRVLIFEVVWQMTDRGTKWQGQLVC---VRDAKMACTALEGVGGHLAVAIGTKLI 1160

Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ--LNLLAK 1234
            +H W G  L  +AF+D  PL+ V++N+VKNFILLGDI K  +F  WK+   +  L  +AK
Sbjct: 1161 VHSWDGHSLMPVAFFDT-PLHTVTMNVVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAK 1219

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            DF  +D  ATEFL+DGSTLS++ +D   N  IF Y PK  ESWKGQKLL++  FHVG+ V
Sbjct: 1220 DFEGMDILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPV 1279

Query: 1295 TKFLRLQMLATSSDRTGAAPGSD--------KTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
             + +R ++ A +     AAPG          + NR A+ FGTLDGS+G + P++E     
Sbjct: 1280 HRMVRFRLKAPT-----AAPGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHAS 1334

Query: 1347 LQSLQKKLVDSVPH--VAGLNPRSFRQFHS-NGKAHR-PGPDSIVDCELLSHYEMLPLEE 1402
            LQSLQ+ L  + PH  +AGLN R+ R   +  G+  R P P S++D  LL+ YE +P + 
Sbjct: 1335 LQSLQRYLTYATPHAALAGLNARTHRHPKTVEGRPMRQPAPHSLLDGGLLAVYEHMPWKA 1394

Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            Q + A + G TR   L +L+ L+  T+F+
Sbjct: 1395 QAKAAKEAGMTRDVALGHLHQLSARTAFM 1423


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score =  677 bits (1746), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 476/1522 (31%), Positives = 747/1522 (49%), Gaps = 227/1522 (14%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS A ++ +H PTG+ +  + + T    D                       G  PNL+V
Sbjct: 1    MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
             +AN I +Y V     G +ES                   L++   +   G + S+++L 
Sbjct: 38   ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
            +       +RD++++A  + K+SV+E+D +   +  +SMH FES     P    L+  RE
Sbjct: 77   RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136

Query: 176  SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
            +    PLV  DP+GRC  V++       ++ +L +  GG GLV ++D  G   G +A + 
Sbjct: 137  A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191

Query: 232  SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
             S  ++L  + +  ++D  F+HGY EP + +L+E+  TWAGR +    TC I ALS+   
Sbjct: 192  ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             ++  +IW   NLP  +YKL A+  P+GG LV   + + + SQ +S  L LN +      
Sbjct: 250  KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307

Query: 352  SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
             QE   +   + LD A A+ +  D  L++TKTG L+LL +  DGR ++R+ L +   +VL
Sbjct: 308  PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365

Query: 412  TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML-----------SSGLKEEFGDI 457
            +S +  +   L FLGSR+GDSLLV+FT     +   ML           +   K++  ++
Sbjct: 366  SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            EA    T +     +DA+Q      E    G A          +   V+DSL+ + P+ D
Sbjct: 426  EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480

Query: 518  FSYGLRINADASATGISKQSNYELVE-----------------------------LPGCK 548
             + G      ++  G +K    EL+                              LP  +
Sbjct: 481  IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            G+WT+++   R   +   R     + +H +L++S+++ TM++ET + L EV+ S+++   
Sbjct: 536  GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
              T+AA N+FG    +QV   G R+L G    QD+     ++  G+      + S  I D
Sbjct: 588  QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKGA-----AIASAQILD 642

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPE- 719
            PY+++ +SDGSIRLL GD    +VS+    AI +S   V++  L  D         G E 
Sbjct: 643  PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700

Query: 720  --PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
               W+ + +T+  ++   G    GA     +  +    +  E G+LE+F +P+   ++  
Sbjct: 701  KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
            D    G        MR  +   +T +N+ S               ++V++ +  +   H 
Sbjct: 757  DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            RP L A+  DGT+L Y+ ++          D+P+  +               LRFSR  +
Sbjct: 795  RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840

Query: 898  D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
            D             A    ++  GA   RI       G QG F++G  P W +V R R+ 
Sbjct: 841  DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
              P   +G +VAFTV HNVNC HGFI  T+ G ++ICQ+PS   Y+  WPV+K+ LK TP
Sbjct: 901  ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVALKCTP 960

Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--- 1061
            H ITY  +  LY L+ S PV       +   I+Q+  H I    L+ V   R    +   
Sbjct: 961  HTITYLPDFKLYALVTSAPV-----PWVEREIEQDNVHGI---ALAKVRRERAKANDDME 1012

Query: 1062 -EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1120
             +Y VR+L P      WQ      ++  E+   VR V L +  T    +LLA+GTA   G
Sbjct: 1013 LQYSVRLLVPGSLDSAWQH----ALEPGEHVQCVRNVQLRDINTGALLSLLAVGTAMPGG 1068

Query: 1121 EDVAARGRVLLFST--GRNADNPQNLVTE---VYSKELKGAISALASLQGHLLIASGPKI 1175
            ED   RGRV+LF     R+A++      +      +E K A +AL++L GHL++A G K+
Sbjct: 1069 EDTPCRGRVILFQMVWERDAESMDGYRWKGQVCCVREAKMACTALSALDGHLIVAVGTKL 1128

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL--LA 1233
             +H W G ELN +AF+D  P++ VS+N+VKNFIL+GD+ K ++F  WK  G + ++  L+
Sbjct: 1129 TVHTWDGVELNSVAFFDT-PIHTVSINVVKNFILVGDLEKGLHFFRWKANGFEKSIIQLS 1187

Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
            KDF  +D  +TEFLIDG+TLSL+ SD   N +IF Y PK  ESWKGQKLL R+ +HVG+ 
Sbjct: 1188 KDFDRMDVVSTEFLIDGATLSLLGSDMSGNARIFGYDPKSLESWKGQKLLVRSAYHVGSP 1247

Query: 1294 VTKFLRLQMLATSSDRTGAAPGS--DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            +++ +R  +  T++    AAPG     TNR A+ FGTLDG++G   P DE T+ +L +LQ
Sbjct: 1248 ISRMVRFNVEGTTAK---AAPGERPKGTNRHAVFFGTLDGALGIFMPTDEPTYAKLHALQ 1304

Query: 1352 KKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            ++L  +V    G NPR+FR  +           P  ++D  LLS +E L   EQ  +A +
Sbjct: 1305 RELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLAPLDVLDGGLLSKFETLTFTEQRAVAER 1364

Query: 1410 TGTTRSQILSNLNDLALGTSFL 1431
            +G  R   L  +  L+   +F+
Sbjct: 1365 SGVDRDLALGLIQHLSASNAFV 1386


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score =  662 bits (1707), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/1549 (31%), Positives = 773/1549 (49%), Gaps = 198/1549 (12%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH         P+              G G  PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
              AN + IY +R   +G      SG   +          ++ D IS A            
Sbjct: 38   LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95

Query: 100  ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
               SLE+V  + L+G V S+A L +       +RD+++LA  ++K+SV+EFD S   L  
Sbjct: 96   AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155

Query: 157  TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
            +S+H +E+P       G  S  R     P+V  DP+GRC  VL+    G ++ +L     
Sbjct: 156  SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212

Query: 210  GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
               + G + + G G     G +A ++ S+V++L R++ +++V+D  F+HGY EPV+++LH
Sbjct: 213  AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272

Query: 265  ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            E  LTWA R +    T  +SA+S++   ++H +IW    LPH  Y+L A+P+P+GG +V+
Sbjct: 273  EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332

Query: 325  GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
              N + + SQ +S ALALN  A      D + +   ++ +  LD A+A  +    AL++T
Sbjct: 333  SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392

Query: 382  KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            K G L LL++  +GR +     + L +   +VL+S +  +   L FLGSR+GDSLLV   
Sbjct: 393  KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451

Query: 439  CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESA 497
            C +  +  ++  +       A   +   +R        D V G    SL  +A+ +    
Sbjct: 452  CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511

Query: 498  QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
               ++F VRDS++ I P+ D + G    A AS +G                      + +
Sbjct: 512  HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567

Query: 536  QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
                ELV       LPG KG WTV+H S+   R   + ++  A   D YHAYL+ISL + 
Sbjct: 568  GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            TM+LET + L EV+E V+      T+ AGN FGR R++QV+++G R+  G    QD++  
Sbjct: 628  TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
               +++G G E   +++  I+ PYVL  +SDGS+ +L GD  + T+         + P  
Sbjct: 688  ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746

Query: 698  -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
                 A +     P ++    HD+ P  +L++ +T    +T    +   +  D     + 
Sbjct: 747  GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
             ++  V    GALE++ +P+ +  +T +    G   +                     + 
Sbjct: 806  PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                      + ++VEL +  ++  H RP L A+  DG +L Y+A+              
Sbjct: 858  AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNIS--- 921
                 + +V+      L  LRF+R P++    E    GA      P  R+T F+ +    
Sbjct: 904  -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
            G +G F+SG +P W +  R R+   P   +   +V+FT  HNVNC+ GFI  T+ G ++I
Sbjct: 956  GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015

Query: 981  CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            CQ+P    Y+  WPV+K+ L+ TPH + Y  +  LY L  S P         +  ++ EV
Sbjct: 1016 CQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFKLYALSTSAP---------AKWVEPEV 1066

Query: 1041 GHQIDNHNLSSVDLHRTYTV------EEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              + D H  + V   R   +      E++ V++L P    G  +T  +  M   E+   V
Sbjct: 1067 AEE-DIHAATVVKTRRAKAMARGGVEEQFAVKLLVP----GSLETAWSRTMDPGEHVQAV 1121

Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
            + V + N  T    ++LA+GTA   GED   RGRV+LF       + +     +      
Sbjct: 1122 KNVQVRNLRTGALHSMLAVGTAMPGGEDTPCRGRVILFEISWQMVDGETRRVPLLLLFFD 1181

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
             A++AL+ L+GHL++A G K+I+H W G EL  +AF+D P ++ V++N+VKNF+ +GD+ 
Sbjct: 1182 DALAALSGLEGHLVVAIGTKLIVHAWDGAELIPVAFFDTP-VHTVTINVVKNFVCIGDVQ 1240

Query: 1215 KSIYFLSWKE--QGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
            K  YF  WK+  +  + NL  LAKDF S+D  +TEFL+DGSTLSL+ +D   N  +F Y 
Sbjct: 1241 KGAYFFRWKDDPRTGEKNLIQLAKDFESMDVLSTEFLVDGSTLSLLAADTAGNAYVFAYD 1300

Query: 1271 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML----ATSSDRTGAAPGSDK--TNRFAL 1324
            PK SESWKGQKLL++A FHVG+ V + +R ++     A +  R    P   K   NR A+
Sbjct: 1301 PKSSESWKGQKLLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAV 1360

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHR-PG 1382
             FGTLDGS+G + P++  T  +L+ LQ+ L  +    AGLN RS+R   +  G+A R P 
Sbjct: 1361 FFGTLDGSLGILVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPA 1420

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            P +++D E+L  +E L   +Q E A   G TR + L+ L+ L+  T+F+
Sbjct: 1421 PHNLLDGEMLQGFESLAWTKQAEAADAAGMTREEALTYLHTLSAKTAFM 1469


>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
          Length = 573

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
           MS+AAYKMMH PTGI +C +GFITHS AD                   ++DS    + R 
Sbjct: 1   MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60

Query: 52  IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
           +GP PNLVV+AANV+E+Y VR +   G++++ NS  T    ++DGIS A LELVCHYR  
Sbjct: 61  VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118

Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
                           LHGN+ES+A+LS G      RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
           GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G  
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           LVG+++        + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           R+S K  TCM+SA SIS  LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           SQS SC+LALN+++   D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
           YDGR VQRLDL K+  SVL+S  TT+G+S  FLGSRL DSLLVQF+CG  TS+L   L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474

Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
           E  DIE+D P +KRL+R  SD LQD+ + EELS +  A  N   + +  SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534

Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
           GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563


>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
 gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
          Length = 432

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/411 (77%), Positives = 356/411 (86%), Gaps = 8/411 (1%)

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
            SV   +PLNQVLS + DQE  H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T+
Sbjct: 30   SVCSFRPLNQVLSSMADQESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETK 89

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +TIPMQ  ENALTVR+VTL NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N
Sbjct: 90   STIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SEN 148

Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
             QNLVTEVYSKE KGA+SA+ASLQGHLLIASGPKI L+KWTG EL  +AFYDAP L+VVS
Sbjct: 149  SQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAVAFYDAP-LHVVS 207

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
            LNIVKNF+L GDIHKSIYFLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD 
Sbjct: 208  LNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDS 267

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
             KN+QIFYYAPKM ESWKGQKLLSRAEFHVGAH+TKFLRLQML T          S+KTN
Sbjct: 268  DKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTN 321

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            RFALLFG LDG IGCIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HR
Sbjct: 322  RFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHR 381

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            PGPD+I+D ELL+HYEML L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 382  PGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 432


>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
 gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
          Length = 389

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 311/397 (78%), Positives = 344/397 (86%), Gaps = 8/397 (2%)

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + DQE+G  I+N   S  DL + YTV+E+EVRI+E  +  G W+TR TIPMQS ENALTV
Sbjct: 1    MADQELGLHIENDVTSGDDLQKVYTVDEFEVRIMELGKPSGHWETRFTIPMQSFENALTV 60

Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
            R+VTL NT+TKENETL+AIGTAYVQGEDVAARGRVLL+S  R ++N QNLVTEVYSKE K
Sbjct: 61   RIVTLQNTSTKENETLMAIGTAYVQGEDVAARGRVLLYSFSR-SENSQNLVTEVYSKESK 119

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            GA+SA+ASLQGHLLIASGPKI L+KWTG+EL  +AFYDAP L+VVSLNIVKNF+L GDIH
Sbjct: 120  GAVSAVASLQGHLLIASGPKITLNKWTGSELTAVAFYDAP-LHVVSLNIVKNFVLFGDIH 178

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            KSIYFLSWKEQG+QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD  KN+QIFYYAPKM 
Sbjct: 179  KSIYFLSWKEQGSQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNVQIFYYAPKMV 238

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            ESWKGQKLLSRAEFHVGAHV+KFLRLQML T          S+KTNRFAL+FGTLDG IG
Sbjct: 239  ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ------GLASEKTNRFALVFGTLDGGIG 292

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
            CIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFR F SNGKAHRPGPD+I+D ELLSH
Sbjct: 293  CIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSH 352

Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            YEML LEEQLEIA Q GTTRSQILSN +D  LGTSFL
Sbjct: 353  YEMLSLEEQLEIAQQIGTTRSQILSNFSDFLLGTSFL 389


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score =  612 bits (1579), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 461/1508 (30%), Positives = 732/1508 (48%), Gaps = 227/1508 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E +  S  S + K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     +D+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKD---TLSDEQEGGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G  ++  + E   P  K+
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423

Query: 467  LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
             R  SS             L D V+  E+ +YGS A + T+ A  TFSF V DS++NIGP
Sbjct: 424  KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479

Query: 515  LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
              + S G         + N +        +G  K     ++            ELPGC  
Sbjct: 480  CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539

Query: 550  IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
            +WTV      K S G    AD+ +    D          + H +LI+S E  TM+L+T  
Sbjct: 540  MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
             + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +     
Sbjct: 600  EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651

Query: 655  GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
                S ++  S+ADPYV++  ++G + + V    T       +++Q P     S+     
Sbjct: 652  ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708

Query: 705  -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
                       +  SSC               T+Y D       E  +    + A ++ G
Sbjct: 709  AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768

Query: 735  V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
                  G A     GG    G      + V+  E+G +EI+ +P++  VF V  F  G+ 
Sbjct: 769  KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
             +VD+    +   S T+ +   EE T QG    +  + +V L   R     SRP+L  + 
Sbjct: 829  VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878

Query: 846  TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
             +  +L Y+A+ +  + P+N  K                       +RF + P     RE
Sbjct: 879  VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915

Query: 904  ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
            + P          G P + + +         F++ISG+ G F+ G  P W ++  R  LR
Sbjct: 916  KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
            +HP   DG I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T 
Sbjct: 976  LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1035

Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
            H ++Y  E  +Y +  SV       + L   I +  G + +   +   + +     E++ 
Sbjct: 1036 HFVSYHVESKVYAVCTSV-------KELCTRIPRMTGEEKEFETIERDERYINPLQEKFS 1088

Query: 1065 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGE 1121
            ++++ P      W+T     I ++  E+   ++ V L +  T    +  +A GT  +QGE
Sbjct: 1089 IQLISPVS----WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQGE 1144

Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKII 1176
            +V  RGR+L+         P   +T+     +Y KE KG ++AL    G+L+ A G KI 
Sbjct: 1145 EVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIF 1204

Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
            L      +L G+AF D   LY+  +  +KNFIL  D+ KSI  L ++E+   L+L+++D 
Sbjct: 1205 LWALKDNDLTGMAFIDTQ-LYIHQMISIKNFILAADVMKSISLLRYQEESKTLSLVSRDA 1263

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
              L+ ++ EF++D + L  +VSD  KN+ ++ Y P+  ES+ G +LL RA+F+ GAH+  
Sbjct: 1264 KPLEVYSIEFIVDNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINS 1323

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
              R  M    +  +G+       N+    F TLDG IG + P+ E T+RRL  LQ  L  
Sbjct: 1324 LWR--MPCRGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 1381

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
             +PH AGLNP++FR  HSN ++ +    +I+D ELL+ Y  L   E+ E+A + GTT+  
Sbjct: 1382 MLPHHAGLNPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDI 1441

Query: 1417 ILSNLNDL 1424
            IL +L ++
Sbjct: 1442 ILDDLLEI 1449


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  611 bits (1575), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 458/1505 (30%), Positives = 716/1505 (47%), Gaps = 214/1505 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29   NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
                P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135  NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232  SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187  PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
             T + HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247  ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
              +     +    + LD AHA ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 307  TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367  ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468  RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
              +     SS   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + +  
Sbjct: 425  DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 520  ---------------YGLRINADA--SATGI---------SKQSNYELV----------- 542
                           YGL   A+     TG+         S + + E+V           
Sbjct: 481  EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540

Query: 543  ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
                           ELPGC  +WTV         ++ +    +  R A  +D    H +
Sbjct: 541  SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600

Query: 579  LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
            LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G  
Sbjct: 601  LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657

Query: 639  MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
                L F P +         + ++  ++ADPYV++  ++G + + +    T       ++
Sbjct: 658  -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709

Query: 694  VQTPAAIESSKKPVSSCTLYHDKG-------------PEPWLRKTSTDAWLSTGVGEAID 740
            +  P     SK  V +  LY D                EP  R +     L       +D
Sbjct: 710  LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767

Query: 741  G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
                    D G                P D+           + ++  E+G +E++ +P+
Sbjct: 768  DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827

Query: 771  FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
            +  VF V  F  G+  +VD+    +     T+  +  EE T QG    +  + +V L   
Sbjct: 828  WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881

Query: 831  RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
               +  SRP+L  +  D  +L Y+A+    P ++      +         N++    +  
Sbjct: 882  ---SRQSRPYLL-VHVDQELLLYEAF----PHDSQLGQGNLKVRFKKVPHNINFREKKLK 933

Query: 891  RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
               +      T E         R   F++I G+ G F+ G  P W +V  R  LR+HP  
Sbjct: 934  PSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMG 993

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
             DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T H + Y
Sbjct: 994  IDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAY 1053

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
              E  +Y +  S     P  +     I +  G + +   +   D +     E + ++++ 
Sbjct: 1054 HVESKVYAVATSTNA--PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLIS 1106

Query: 1070 PDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1126
            P      W+    A I ++  E+   ++ V+L +  T    +  +A GT  +QGE+V  R
Sbjct: 1107 PVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR 1162

Query: 1127 GRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
            GR+L+         P   +T+     +Y KE KG ++AL    GHL+ A G KI L    
Sbjct: 1163 GRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLR 1222

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
             +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   L+L+++D   L+ 
Sbjct: 1223 ASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEV 1281

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
            ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G  LL RA+FH+GAHV  F R  
Sbjct: 1282 YSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGLLLLRRADFHLGAHVNTFWR-- 1339

Query: 1302 MLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
                 +   GA  G  K      N+    F TLDG IG + P+ E T+RRL  LQ  L  
Sbjct: 1340 -----TPCRGAVEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 1394

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
             +PH AGLNPR+FR  H + +  +    +++D ELLS Y  L   E+ E+A + GTT   
Sbjct: 1395 MLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTTPDI 1454

Query: 1417 ILSNL 1421
            IL +L
Sbjct: 1455 ILDDL 1459


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  608 bits (1569), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 455/1509 (30%), Positives = 732/1509 (48%), Gaps = 235/1509 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GN+ S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +LVYG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G + +  + + D    PS
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425

Query: 464  TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
             K+   SS++           L D V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 426  KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481

Query: 513  GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
            GP  + S G         + N +        +G  K     ++            ELPGC
Sbjct: 482  GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541

Query: 548  KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
              +WTV     +    D   +               D + H +LI+S E  TM+L+T   
Sbjct: 542  HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +      
Sbjct: 602  IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
               S ++  S+ADPYV++  ++G + + V         T  +++Q P  I S  + ++ C
Sbjct: 653  --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKPQ-IPSQSRVITLC 709

Query: 711  -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
                                           T+ HD        E  L   S  +  +T 
Sbjct: 710  AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767

Query: 735  VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
              E I+ +   P   G            + ++  E+G +EI+ +P++  VF V  F  G+
Sbjct: 768  AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827

Query: 785  THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
              +VD+              SS +  T G+G+KE +        V E+A+     +HS+P
Sbjct: 828  RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873

Query: 840  FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            +L  +  +  +L Y+A+ +  + P+N  K                       +RF + P 
Sbjct: 874  YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909

Query: 898  DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
            +   RE                E   G   +  R   F++ISG+ G F+ G  P W +V 
Sbjct: 910  NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969

Query: 940  -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 998
             R  LR+HP   DGSI +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+KI
Sbjct: 970  SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
            PL+ T H ++Y  E  +Y +  SV   +P  +     I +  G + +   +   + +   
Sbjct: 1030 PLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEYEVIERDERYIHP 1082

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAY 1117
              E++ ++++ P        TR  I ++  E+   ++ V L +  T    +  +A GT  
Sbjct: 1083 QQEKFSIQLISPVSWEAIPNTR--IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCL 1140

Query: 1118 VQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            +QGE+V  RGR+L+         P     +N    +Y KE KG ++AL    G+L+ A G
Sbjct: 1141 MQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIG 1200

Query: 1173 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
             KI L      +L G+AF D   LY+  +  +KNFIL  D+ KSI  L ++E+   L+L+
Sbjct: 1201 QKIFLWVLKDNDLTGMAFIDTQ-LYIHQMFSIKNFILAADLMKSISLLRYQEESKTLSLV 1259

Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
            ++D   L+ ++ EF++D + L  +VSD  KN+ ++ Y P+  ES+ G +LL RA+F+ GA
Sbjct: 1260 SRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGA 1319

Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
            ++  F R+         +  A   D  N+    F TLDG IG + P+ E T+RRL  LQ 
Sbjct: 1320 NINTFWRMPCRGALDASSKKALTWD--NKHITWFATLDGGIGLLLPMQEKTYRRLLMLQN 1377

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
             L   +PH AGLNP++FR  HS+ ++ +    +I+D ELL+ Y  L + E+ E+A + GT
Sbjct: 1378 ALTTMLPHHAGLNPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGT 1437

Query: 1413 TRSQILSNL 1421
            T+  IL +L
Sbjct: 1438 TQDIILDDL 1446


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score =  606 bits (1563), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 440/1474 (29%), Positives = 721/1474 (48%), Gaps = 191/1474 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE    L L+ G   
Sbjct: 82   ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    +++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311

Query: 354  ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
               +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312  LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA-----PSTKRL 467
            + + T+     FLGSRLG+SLL+++T       L  G  ++  + E D      P +K+ 
Sbjct: 372  TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431

Query: 468  RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
            R  SS    D V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G     
Sbjct: 432  RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487

Query: 522  ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
                + N +        +G  K     ++            ELPGC  +WTV        
Sbjct: 488  SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547

Query: 555  --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
               ++ R     +   A  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 548  EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606

Query: 613  AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             AGN+   + +IQV   G R+L+G  +TQ L F P +         S ++  S+ADPYV+
Sbjct: 607  FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656

Query: 673  LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
            +  ++G + + V         T  +++Q P     S+       + VS         SC+
Sbjct: 657  IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
            +  D          +    LST +         G++  G     +++             
Sbjct: 717  ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776

Query: 753  -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
                 + ++  +SG +EI+ +P++  VF V  F  G+  +VD+   ++    E E     
Sbjct: 777  AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
            EE T QG    +  + +V L       +HSRP+L  +  D  +L Y+A+ +  + P+N  
Sbjct: 835  EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHGAPCQ----- 912
            K                       +RF + P +   RE+           G   +     
Sbjct: 889  K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925

Query: 913  -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
                 R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +F+  HN+NC 
Sbjct: 926  RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
             GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T H ++Y  E  +Y +  S+    
Sbjct: 986  KGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSL---- 1041

Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
               + L   I +  G + +   +   + +     +++ ++++ P        TR  I ++
Sbjct: 1042 ---KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR--IDLE 1096

Query: 1087 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP---- 1141
              E    ++ V L +  T    +  +A GT  +QGE+V  RGR+L+         P    
Sbjct: 1097 EWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILILDVIEVVPEPGQPL 1156

Query: 1142 -QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
             +N    +Y KE KG ++AL    G+L+ A G KI L      +L G+AF D   L++  
Sbjct: 1157 TKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTGMAFIDTQ-LHIHQ 1215

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
            +  +KNFIL  D+ KS+  L ++E+   L+L+++D   L+ ++ EF++D + L  +VSD 
Sbjct: 1216 MMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDR 1275

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
             KN+ ++ Y P+  ES+ G +LL RA+F+ GA++  F R  M    +   G+       N
Sbjct: 1276 DKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--MPCRGALEAGSRKAMTWDN 1333

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            +    F TLDG +G + P+ E T+RRL  LQ  L   + H AGLNP++FR  H + ++ +
Sbjct: 1334 KHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRMLHCDRRSLQ 1393

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
                +I+D ELL+ Y  L + E+ E+A + GTT+
Sbjct: 1394 NPVKNILDGELLNKYLYLSMMERSELAKKIGTTQ 1427


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  600 bits (1547), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 455/1539 (29%), Positives = 728/1539 (47%), Gaps = 244/1539 (15%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            +A YK  H PTG+  C +     +R                             NLVV  
Sbjct: 2    YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
               + ++ +  Q+EGS  +++ G + +R          LELV  + L GN+ESL  +   
Sbjct: 35   TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85

Query: 123  GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            G      RDS++++F+DAK+S++++D   H ++  S+H FE  +   +K    +  R P+
Sbjct: 86   G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            V++DP+ RC  +L YG  +++L   Q G      +D+  S       +  S++I+++++D
Sbjct: 139  VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197

Query: 243  MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             K  ++ D  F+HGY EP ++IL+E   TWAGR++ ++ TC + A+S++ + K HP++W 
Sbjct: 198  EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
               LP D   ++ VP PIGGVLV   N + Y +QS         Y VS++S  E      
Sbjct: 258  LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311

Query: 355  -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              P+   ++ L+ ++A ++ ND  + S K G++ ++T++ DG R V+     KT  SVLT
Sbjct: 312  LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            S +   G+   FLGSRLG+SLLV++T       +  G +     ++ D  + +R +  + 
Sbjct: 372  SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
              L  + + +EL +YG A         +++F V DSL+NIGP      G    L ++   
Sbjct: 428  SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486

Query: 526  ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
            ADA        S +G  K     ++            ELPGC  +WTV+ K  +      
Sbjct: 487  ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546

Query: 561  --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
              H   S      +++YH++LI+S E  +M+L+T   + EV +S  +  Q  TI AGN  
Sbjct: 547  AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
                ++QV   G R+L+G    Q +       +SG     S ++  S+ DPY +L M+DG
Sbjct: 606  NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656

Query: 679  SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
            S+ L+  +   S   ++V  P+  +SSK  V +C  Y D                K P P
Sbjct: 657  SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714

Query: 721  WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
                T+                      T   L+    E        P++      Q   
Sbjct: 715  KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + +VC E+G +EI+ +P F  VF V  F      IVD+       DS     SS  E   
Sbjct: 775  WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                    S+ V E+ +      + R  L A++ D  +L Y+A+ +   E          
Sbjct: 827  -------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH-------- 870

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFKN 919
                            NLRF +   +   RE+ P   P                + +F +
Sbjct: 871  ---------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFND 915

Query: 920  ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            IS + G F+ GS P W  V  R     HP   DG +  F   HNVNC  GF+Y  ++G L
Sbjct: 916  ISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGEL 975

Query: 979  KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI-D 1037
            +I  LP+  +YD+ WPV+K+PL+ TPH ++Y  E   Y ++ S    +P  ++  +   D
Sbjct: 976  RISVLPTHLSYDSPWPVRKVPLRYTPHMVSYNRESKTYAIVTSEQ--EPCKKIPRVTAED 1033

Query: 1038 QEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            +E    I        D    Y + E + ++++ P      W+    IP    +      V
Sbjct: 1034 KEFVDTIR-------DARFIYPSTERFVLQLISPIS----WEV---IPNTRHDLDEWEHV 1079

Query: 1097 VTLFNTTTKENET------LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1147
             T+ N      ET       + +GT  + GE++A RGR+L+F        P   +T+   
Sbjct: 1080 TTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVRGRILIFDIIEVVPEPGQPLTKNKF 1139

Query: 1148 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1205
              +Y KE KG ++AL  + G+L+   G KI +  +T  +L G+AF D   LY+ SL  ++
Sbjct: 1140 KLLYEKEQKGPVTALNQVNGYLVSGIGQKIYIWNFTDNDLVGMAFIDTQ-LYIHSLVTIR 1198

Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            NF++  D+ KSI  L  +E+   L  ++KD  +L+ +A +F IDG  +  +VSD +KN+ 
Sbjct: 1199 NFVIAADVCKSITLLRLQEETKTLAFVSKDPKNLEVYAADFFIDGPQIGFLVSDVEKNLV 1258

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
            +F Y P+  ES  GQ+LL RA+ +VG H+T F R+   A       A+    K  R    
Sbjct: 1259 LFTYQPEAIESQGGQRLLQRADINVGTHITSFFRIAAKA----HLKASGEKSKEMRQLTC 1314

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            FGTLDG++G + P+ E TFRRL  LQ KLVD +PHVAGLNP++FR      +       +
Sbjct: 1315 FGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWRKRKLCNPHRN 1374

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            ++D +LL  Y  L   E+ E+A + GTT +QI+ ++ D+
Sbjct: 1375 VLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDI 1413


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
            [Crassostrea gigas]
          Length = 1369

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 439/1431 (30%), Positives = 700/1431 (48%), Gaps = 180/1431 (12%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            +E +  + L GN+ S+  +   GA     RDS++L+F +AK+SV+E+D   H L+ TS+H
Sbjct: 5    MECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLH 60

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             FE P    +K G  +    P V+VDP GRC  +LVYG  M+IL   +      GD    
Sbjct: 61   FFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD---- 113

Query: 221  GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
               G   + I SS++I+LR+ D K  +VKDF F+HGY EP + IL E   TWAGR + + 
Sbjct: 114  NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRA 173

Query: 279  HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
             TC I A+S++   K HP+IWS  +LP D  ++LAVP PIGGV+++  N++ Y +QS   
Sbjct: 174  DTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP- 232

Query: 339  ALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                  Y VSL+S    S   P   +    + LD   A ++  D  +LS K G+L +LT+
Sbjct: 233  -----PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTL 287

Query: 392  VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
            V DG R V+  +  K+  SVLTS +    +   FLGSRLG+SLL+++T  +   + +  L
Sbjct: 288  VVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGDL 347

Query: 451  KEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVR 506
             ++    + D P+ K+ +   S  +   V    N  +L +YGSA N T +   +++F V 
Sbjct: 348  DKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEVC 403

Query: 507  DSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV----------- 542
            D++ NIGP             ++FS     + +   T G  K     ++           
Sbjct: 404  DNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTT 463

Query: 543  -ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADLL 596
             ELPGC  +WTV       +  + ++S     DD     H++LI+S    +M+LET   +
Sbjct: 464  FELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQEM 523

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
             E+  S  +  Q  TI AGN+ G R ++QV +   R+L+G    Q +       ++GS  
Sbjct: 524  NELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTGS-- 575

Query: 657  ENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIES 702
                V+  S+ADPY++L   +G I              RL+VG PS   +S  +   + S
Sbjct: 576  ---PVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVIS 629

Query: 703  SKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADGG 745
            + K VS      ++       P+    KT T+   S              GE        
Sbjct: 630  AYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFNS 689

Query: 746  PLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
              + G                     + ++C E+G LEI+ +P++  V+ V  F  G+  
Sbjct: 690  SFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQKL 749

Query: 787  IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
            +VD+           ++      G  Q  K N     + EL M       SRP L A + 
Sbjct: 750  LVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARVE 798

Query: 847  DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
            D        Y++E       S D     R   + +    R +  +  +   + + +EE  
Sbjct: 799  D------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEKK 852

Query: 907  HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
             G    ++  FK+++G+ G F+ G+ P W  V  R  LR+HP   DG +  F+  HN+NC
Sbjct: 853  VG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNINC 908

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             HGF+Y    G L+I  LP+  TYD  WPV+K+PL+ TPH + Y  E  +Y ++ S P +
Sbjct: 909  PHGFLYFNKMGELRISVLPTHLTYDAPWPVRKVPLRCTPHFVAYHFENKIYAVVTSTPEI 968

Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQT--RAT 1082
               N++     +      I+       D    Y T+  + +++  P      W+      
Sbjct: 969  --CNKLPKTTTEDREWDTIEK------DERFIYPTIPRFTLQLYSPTS----WEVVPNTK 1016

Query: 1083 IPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1141
            I  +  E+ ++++ + L +  T    ++ + +GT    GE+V +RGRV++         P
Sbjct: 1017 IECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVIIADIIEVVPEP 1076

Query: 1142 -----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
                 ++ +  +Y KE KG ++ALA + G L+ A G K+ + +    +L G+AF D   +
Sbjct: 1077 GMPLTKHKIKTLYEKEQKGPVTALADINGLLITAIGQKLYIWQLKDNDLMGVAFIDT-HI 1135

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y+ +L  +K+ IL GDI KS+    ++E+   L+++++D   L+ +  +FLID + L  +
Sbjct: 1136 YIHTLVTIKHIILAGDILKSVSVYQYQEEHKVLSIVSRDPRPLEVYTADFLIDNTQLCCL 1195

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSD-RTGAA 1313
            VSD  KN+ ++ Y P+  ES  GQ+L+ +A+F+ G++V+   R+  ++   SSD R   A
Sbjct: 1196 VSDRMKNLVVYSYQPEARESHGGQRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGA 1255

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
            P      R    F TLDGS+G + PL E  +RRL  LQ  LV  +PHVAGLNPRS+R   
Sbjct: 1256 P----EKRHITYFATLDGSLGFVLPLSEKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVI 1311

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                  R    +I+D ELL  Y  L + E++EIA + GT+  QI+ +L ++
Sbjct: 1312 GTFPELRNPQKNILDGELLWKYTNLSIMEKIEIAKRLGTSNDQIMDDLMEI 1362


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
            floridanus]
          Length = 1418

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 436/1478 (29%), Positives = 717/1478 (48%), Gaps = 197/1478 (13%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV  ANVI ++ +    + ++  K +     ++         LE +  Y LHGN+ S+ 
Sbjct: 30   LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82   AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   S    S+    I SS+
Sbjct: 135  HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192

Query: 235  VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
            +I L+ L+  M +V D  F++GY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193  MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293  KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
            + HP+IWS  NLP D Y+++ V  P+GG L++  N++ Y +QS     ++LN+ A +  +
Sbjct: 253  RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312

Query: 352  SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
                P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313  FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411  LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
            LTS +    ++  FLGSRLG+SLL++FT     ++ +    E    + E++    K+ ++
Sbjct: 373  LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432

Query: 470  S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                   +SD L D+ + EEL +YGS ++ T     ++ F V DSL+NIGP  + S G  
Sbjct: 433  DFLGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522  ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                     N D      + +G  K     ++            ELPGC+ +WTV     
Sbjct: 491  AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546

Query: 559  RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             G   +  ++ A  +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ AGNL 
Sbjct: 547  -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
              R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 605  ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654

Query: 679  SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
             + LL       T  +    A    +  + +   Y D        L +T+ D  +     
Sbjct: 655  QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714

Query: 735  VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
            V E      ID  D                   PLD                +   + +V
Sbjct: 715  VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
              +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E        
Sbjct: 775  YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826

Query: 817  ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
                 M+V E+ M     H +RP L   L D  +  YQAY +  P+   K          
Sbjct: 827  -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868

Query: 877  LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
                     R + L     P  L    +EE  P  A   RI +   F NI+G+ G F+  
Sbjct: 869  --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920

Query: 931  SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
              P W  +  R  LR HP   DG I +F   +NVNC  GF+Y   +  L+IC LP+  +Y
Sbjct: 921  DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
            D  WPV+K+PL+ TPH +TY  E   Y +I S+   +PL                  +  
Sbjct: 981  DAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YRF 1023

Query: 1050 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1099
            +  D  + +T EE   R L P +        +   W+T     I ++  E+   ++ V+L
Sbjct: 1024 NGED--KEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKIELEQWEHVTCLKNVSL 1081

Query: 1100 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
                T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE 
Sbjct: 1082 AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1141

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            KG I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D+
Sbjct: 1142 KGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADV 1200

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
            +KSI  L ++E+   L+L+++DF   + +  E+LID + L   ++D + N+ +F Y P+ 
Sbjct: 1201 YKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPES 1260

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             ES  GQKL+ +A+FH+G  V  F R++  ++  ++      G+DK  R   ++ TLDGS
Sbjct: 1261 RESLGGQKLIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADK--RHVTMYATLDGS 1318

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCEL 1391
            +G I P+ E T+RRL  LQ  LV  + H+AGLNP+S+RQ + +   ++  P   I+D +L
Sbjct: 1319 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDL 1378

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            +  Y  LP  E+ ++A + GT   +I+ ++ ++   T+
Sbjct: 1379 VWRYLFLPNNEKTDVAKKIGTRVQEIIEDITEIDRQTA 1416


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus terrestris]
          Length = 1417

 Score =  578 bits (1489), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 428/1478 (28%), Positives = 709/1478 (47%), Gaps = 198/1478 (13%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193  IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253  VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354  EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
            E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307  ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406  TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE-ADAPST 464
               SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   E   +    +    
Sbjct: 367  AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465  KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
            K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427  KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518  FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
             S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486  ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552  TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545  TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612  IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
            I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599  IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
             L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649  TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717  GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
             PE                           +   T +    S G+ +          +  
Sbjct: 709  APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768

Query: 751  DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
              Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769  PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                        M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828  ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL-- 870

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                 + L    +       LR    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 871  ---RFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            YD  WPV+K+PL+ TPH +TY  E   Y +I S+   +PL                  + 
Sbjct: 980  YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
             +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1080

Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
            L    T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             KG I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D
Sbjct: 1141 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1199

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            ++KSI  L ++E+   L+L+++DF   + +  E+LID + L  +V+D + N+ +F Y P+
Sbjct: 1200 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPE 1259

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
              ES  GQKL+ +A+FH+G  V  F R++  L+  ++      G+DK  R   ++ +LDG
Sbjct: 1260 SRESLGGQKLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADK--RHVTMYASLDG 1317

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
            S+G I P+ E T+RRL  LQ  LV  + H+AGLNP+++R + S+ +        I+D +L
Sbjct: 1318 SLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1377

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            +  Y  LP  E++++A + GT   +I+ +L ++   T+
Sbjct: 1378 VWRYFYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            cpsf [Tribolium castaneum]
          Length = 1413

 Score =  575 bits (1483), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 431/1477 (29%), Positives = 708/1477 (47%), Gaps = 208/1477 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+
Sbjct: 29   SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G   
Sbjct: 82   QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
                P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S
Sbjct: 135  HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194

Query: 234  HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++  
Sbjct: 195  YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S
Sbjct: 255  QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308

Query: 352  SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
              E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+    
Sbjct: 309  IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
             K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   
Sbjct: 369  EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417

Query: 462  PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
            PS KRL+ S+S          D L D +        + EEL +YG+    +     ++ F
Sbjct: 418  PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476

Query: 504  AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
             V DSL+NIGP             ++FS  L ++ +   T           + K    ++
Sbjct: 477  EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536

Query: 542  VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            V    LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E
Sbjct: 537  VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580

Query: 599  VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
            + ++  +     T+ AGNL   + ++QV     R+L G    Q +       E GS    
Sbjct: 581  I-DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELGS---- 630

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
              ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D   
Sbjct: 631  -PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 689

Query: 718  -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
                         PE ++ ++ T   +     + + G D          P  +  +Y   
Sbjct: 690  LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 748

Query: 755  --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                           V  E+  LEI+ +P+F   + +     G   +VD    E++  S 
Sbjct: 749  WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 806

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
            +   S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  
Sbjct: 807  STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 857

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
            P    K          +   NVS       R      D +  +E        ++  F NI
Sbjct: 858  PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 906

Query: 921  SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            +G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+
Sbjct: 907  AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 966

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            I  LP+  +YD  WPV+K+PL+ TPH +TY  E   Y L+ S+   +P N+         
Sbjct: 967  IGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF----- 1019

Query: 1040 VGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
                 ++  LS  D    +     E++ + +  P      W       I +   E+   +
Sbjct: 1020 ---NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNCL 1072

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
            + V+L    T+   +  +A+GT Y  GEDV +RGR+L+F        P     +N   E+
Sbjct: 1073 KNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1132

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            Y+K+ KG ++AL+ ++G L+ A G KI + +    +L G+AF D   +Y   +  +K+ +
Sbjct: 1133 YAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSLL 1191

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L+ D++KSI  L ++E+   L+L+++DF   + F+ E++ID +T+  +VSD +KN+ ++ 
Sbjct: 1192 LVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLYM 1251

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFG 1327
            Y P+  ES  GQ+LL +A+FH+G  V  F R++  L    +      G+DK  R   ++ 
Sbjct: 1252 YQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMYA 1309

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            TLDG +G I P+ E T+RRL  LQ  LV    H+AGLNP++FR + S  K       S++
Sbjct: 1310 TLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVI 1369

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            D EL+ +Y  L + E+LE++ + GT   ++L +L+D+
Sbjct: 1370 DGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1406


>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus impatiens]
          Length = 1417

 Score =  575 bits (1481), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 427/1478 (28%), Positives = 709/1478 (47%), Gaps = 198/1478 (13%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193  IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253  VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354  EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
            E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307  ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406  TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
               SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    
Sbjct: 367  AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465  KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
            K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427  KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518  FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
             S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486  ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552  TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545  TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612  IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
            I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599  IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
             L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649  TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717  GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
             PE                           +   T +    S GV +          +  
Sbjct: 709  APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768

Query: 751  DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
              Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769  PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                        M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828  ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                 + L    +        R    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 870  --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            YD  WPV+K+PL+ TPH +TY  E   Y +I S+   +PL                  + 
Sbjct: 980  YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
             +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1080

Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
            L    T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             KG I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D
Sbjct: 1141 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIAD 1199

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            ++KSI  L ++E+   L+L+++DF   + +  E+LID + L  +V+D + N+ +F Y P+
Sbjct: 1200 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPE 1259

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
              ES  GQKL+ +A+FH+G  V  F R++  ++  ++      G+DK  R   ++ +LDG
Sbjct: 1260 SRESLGGQKLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADK--RHVTMYASLDG 1317

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
            S+G I P+ E T+RRL  LQ  LV  + H+AGLNP+++R + S+ +        I+D +L
Sbjct: 1318 SLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1377

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            +  Y  LP  E++++A + GT   +I+ +L ++   T+
Sbjct: 1378 VWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
          Length = 1409

 Score =  568 bits (1463), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 429/1457 (29%), Positives = 701/1457 (48%), Gaps = 178/1457 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LVV   N++ ++ +    +    +K    T+RR          LE +  + L  NV S+
Sbjct: 29   SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+     RD+++L+F +AK+ V+E+D   H LR  S+H FE  +   +K G  +
Sbjct: 81   QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------GSGGGFSARI 230
                P V+VDP+GRC  +LVYG +++IL   +     + D D         S     A +
Sbjct: 134  HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESK--LDDPDIALLDPHSSSVATAKAPV 191

Query: 231  ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             SS+ I LR++D  +++V D  F++GY EP ++IL+E   T+AGR++ +  TC + A+S+
Sbjct: 192  LSSYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSL 251

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAV 347
            +   + HP IWS  NLP +  + + VP P+GG L+   N + Y +QS     +++N+ A 
Sbjct: 252  NIQQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAE 311

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
            +  + Q   +    + L+ + AT++ +D  +LS KTG+L +L+++ D  R V+     K 
Sbjct: 312  NSTNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKA 371

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT+ +    +   FLGSRLG+SLL++FT    +      L E     E   PS +R
Sbjct: 372  AASVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRR 429

Query: 467  LRRSSSDALQ----DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
             + +  D +     D+ + +EL +YG+   ++     +F F V DSL+NIGP  + S G 
Sbjct: 430  RQDALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGE 489

Query: 523  RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
                    +  ++  + ELV                          ELPGC  +WTV   
Sbjct: 490  PAFLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI-- 546

Query: 557  SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
               G  +DS    A DD  HA+LI+S +  TM+L+T   + EV  S  +  QG TI AGN
Sbjct: 547  ---GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGN 602

Query: 617  LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
            L   + ++QV + G R+L G    Q +               S+V+  S ADPYV L   
Sbjct: 603  LASNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTE 652

Query: 677  DGSI------------RLLVGDPST---------CTVSVQTPAAIESSKKPVSSCTLYHD 715
            DG +            RL V  P+          CT    +     ++++ + + T   D
Sbjct: 653  DGQVVLLTLRESRGQGRLSVFKPTIPTNPRVSKICTYRDVSGLFTLTTEEELQNATFKSD 712

Query: 716  KGPEPWLRKTS--TDAWLSTG---------VGEAIDGADGGPLDQGDIYS---------V 755
                  ++K +   D  L  G         +    + +   P  +   YS          
Sbjct: 713  SKN---MKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
            V  E+G L+I+ +P+F   F + +   G   + D             ++ +   G     
Sbjct: 770  VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
               IH + VV L       H  R  +  + T+  ++ YQA+ F +GP    +        
Sbjct: 818  DPEIHELLVVSLG------HLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRF------- 864

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
            R L  + +   R    +        Y  E     A   R+  F NISG+ G F+ G  P 
Sbjct: 865  RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916

Query: 935  WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            W  +  R  LR HP L DG + +F   HNVNC  GF+Y TS+  L+IC LP+  +YD  W
Sbjct: 917  WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976

Query: 994  PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
            PV+K+PL+ TPH +TY  E   Y LI S    +P N+       ++  H +++ +    D
Sbjct: 977  PVRKVPLRCTPHMVTYHLESKTYCLITSSS--EPSNEYFR-FNGEDKEHSVEDRD----D 1029

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS--ENALTVRVVTL-FNTTTKENETL 1110
                   +++ + +  P      W+      M+    E+   V+ V L +  T    +  
Sbjct: 1030 RFPLPLQDKFSIVLFSP----VSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKGY 1085

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
            +A+GT Y   ED+ ++GR+L++        P     +N    VY+KE KG ++AL  + G
Sbjct: 1086 VAVGTNYNYSEDITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQKGPVTALCHVLG 1145

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
             L+ A G KI + +    +L GIAF D   +Y+  +  VK+ IL+ D++KSI  L ++E+
Sbjct: 1146 FLVTAMGQKIYIWQLKDNDLVGIAFIDT-QIYIHQMISVKSLILVADVYKSISLLRFQEE 1204

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
               L+L+++DF   + +A E L+D + +  ++SD + NI ++ Y P+  +S  GQKLL +
Sbjct: 1205 YRTLSLVSRDFRPCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRK 1264

Query: 1286 AEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            A+FH+G H+  + R++  L   ++      G++K  R   +F TLDG++G + P+ E T+
Sbjct: 1265 ADFHLGQHINSWFRIRCRLGDQAENYDFPIGAEK--RHISMFATLDGALGYLLPIPEKTY 1322

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            RRLQ LQ  LV  +PH+AGLNP++FR + S  K        IVD EL+  Y  L + E+ 
Sbjct: 1323 RRLQMLQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQ 1382

Query: 1405 EIAHQTGTTRSQILSNL 1421
            ++A + G+    I+ ++
Sbjct: 1383 DVAKKMGSKMDDIIEDI 1399


>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Megachile rotundata]
          Length = 1415

 Score =  567 bits (1462), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 429/1468 (29%), Positives = 705/1468 (48%), Gaps = 180/1468 (12%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV   N+I ++ +    + +K  K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
               P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195  LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
            P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255  PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356  -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                   P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309  STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG-SGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLTS +    ++  FLGSRLG+SLL++F    S  S   +  +    + E +    K+
Sbjct: 369  ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428

Query: 467  LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +++       +SD L D+ + EEL +YGS ++ T     ++ F V DSL+NIGP  + S 
Sbjct: 429  VKQDFIGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521  GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
            G         +  S+  + ELV                          ELPGC+ +WTV 
Sbjct: 487  GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545

Query: 555  HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                 G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ A
Sbjct: 546  -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599

Query: 615  GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
            GNL   R ++QV + G R+L G    Q +                 ++  S ADPYV L 
Sbjct: 600  GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649

Query: 675  MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDKGPE 719
              DG + LL       T  +    A    +  + +   Y                D+ PE
Sbjct: 650  SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709

Query: 720  --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
                    P +     +  L  G G A           ++G     P  Q  +      Y
Sbjct: 710  EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769

Query: 754  SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
             ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E    
Sbjct: 770  WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                     M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K      
Sbjct: 826  ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867

Query: 873  TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
                L    +    +  NLR      D     ET H   C  +  F NI+G+ G F+   
Sbjct: 868  ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DG I +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920  YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
              WPV+K+PL+ TPH +TY  E   Y +I S+   +PL            G   +     
Sbjct: 980  APWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSYYRF-----NGEDKEFTEED 1032

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
              D     + E++ + +  P      W+T     I +   E+   ++ V+L    T+   
Sbjct: 1033 RPDRFIFPSQEQFSIVLFSPVS----WETIPNTKIELDQWEHVTCLKNVSLAYEGTRSGL 1088

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
            +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE KG I+A+  
Sbjct: 1089 KGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQ 1148

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D++KSI  L +
Sbjct: 1149 VSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIADVYKSISLLRF 1207

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
            +E+   L+L+++DF   + +  E+LID + L  +V+D + NI +F Y P+  ES  GQKL
Sbjct: 1208 QEEYRTLSLVSRDFRPAEVYTIEYLIDNNNLGFLVADGESNIALFMYQPESRESLGGQKL 1267

Query: 1283 LSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            + +A+FH+G  V  F R++  ++  ++      G+DK  R   ++ +LDGS+G I P+ E
Sbjct: 1268 IRKADFHLGQKVNTFFRIRCRISDPANDKKHFSGADK--RHVTMYASLDGSLGYILPVPE 1325

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             T+RRL  LQ  LV  + H+AGLNP+++R + S  +        I+D +L+  Y  LP  
Sbjct: 1326 KTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPNN 1385

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            E++++A + GT   +I+ +L ++   T+
Sbjct: 1386 EKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413


>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
          Length = 483

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 270/458 (58%), Positives = 328/458 (71%), Gaps = 10/458 (2%)

Query: 685  GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
             DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33   ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
               D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93   SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
             +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153  DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
             +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210  VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266  GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
            QLPS   YDNYWPVQK+PL  TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+G
Sbjct: 326  QLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQELG 385

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
              ++N   S  DL   YTV+E+EVRI+E  ++ G W+TR+TIPMQS ENALTVR+VTL N
Sbjct: 386  LHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQN 445

Query: 1102 TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
            T+TKENETL+AIGTAYVQGEDVAARGRVLLFS  ++ +
Sbjct: 446  TSTKENETLMAIGTAYVQGEDVAARGRVLLFSFSKSEN 483


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 390/1293 (30%), Positives = 603/1293 (46%), Gaps = 204/1293 (15%)

Query: 260  MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
            + IL+E+  TWAGR +    TC I ALS+    ++  +IW   NLP  +YKL A+  P+G
Sbjct: 6    LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65

Query: 320  GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
            GVLV   + + + SQ +S AL LN +                   +  D+    P +   
Sbjct: 66   GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125

Query: 359  -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                    + LD A A  +  D  L++TK G L LL +  DGR ++R+ L +   +VL+S
Sbjct: 126  RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
             +  +   L FLGSR+GDSLLV+FT  S  +   ML  G  +E    E +  S KR +  
Sbjct: 186  GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245

Query: 471  SSDALQDMV-----------------NGEELS--LYGS-------ASNNTESAQKT---- 500
               A++                    + +EL   LYG+           T++A+K     
Sbjct: 246  DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305

Query: 501  -----------FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQ---- 536
                       + F V+DSL+ + P+ D + G    +  D        +A G  K     
Sbjct: 306  AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365

Query: 537  -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                       +  E   LP  +G+W ++ +       + +R     + +H +L++ L+ 
Sbjct: 366  ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416

Query: 586  RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
                        EV+ S+++     T+AA N FG    +Q+ E   RIL      QD++ 
Sbjct: 417  ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
                +  GS      + S  I DPY+++ +SDG++RLL GD    TVS+    A+ +S +
Sbjct: 465  ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
                       G   W+ +++T+  ++   G    GA     +Q +    +  E G+LE+
Sbjct: 520  RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            F +P+   ++  D    G   +  T        SE  I                   ++V
Sbjct: 573  FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            ++ +  +   H RP L A+  DGT+L Y+ ++         S++P++             
Sbjct: 611  DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659

Query: 886  RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                LRFSR  +D             A    ++  G    RI+      G QG F++G  
Sbjct: 660  ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V R R+   P   +G IVAFT  HNVNC +GFI  T+ G ++ICQ+PS   Y+  
Sbjct: 717  PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776

Query: 993  WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLSS 1051
            WPV+KI LK TPH + Y  +  LY L+ S  V           +D+E+ G  +   +LS 
Sbjct: 777  WPVRKIALKCTPHAVAYLPDFKLYALVTSANVP---------WVDREIDGENVHGLSLSK 827

Query: 1052 VDLHRTYTVE----EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1107
                R    +    +Y VR+L P      WQ      ++  E+   VR V L +  T  +
Sbjct: 828  ARRERAKAHDDMELQYSVRLLVPGSLDCVWQHT----LEPGEHVQCVRNVQLKDINTGHS 883

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFST-----GRNADNPQNLVTEVYSKELKGAISALAS 1162
             + LA+GTA   GED   RGRV LF+        +AD  +        +E K A +AL  
Sbjct: 884  LSYLAVGTAMPGGEDTPCRGRVYLFNMVWERDSESADGYRWKGQVCCVREAKMACTALEG 943

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            L GHL++A G K+ +H W G ELN +AF+D P ++ VS+N+VKNFIL+GD+ K ++F  W
Sbjct: 944  LGGHLIVAVGTKLTVHTWDGRELNSVAFFDTP-IHTVSINVVKNFILVGDLEKGLHFFRW 1002

Query: 1223 KEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            K+ G + +L  L+KDF  +D  ++EFLIDG+TLSL+ SD   N + F Y PK  ESWKGQ
Sbjct: 1003 KDTGFEKSLIQLSKDFERMDVVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQ 1062

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            KLL RA +HVG+ +++ +R  +  + S            NRFA+ FGTLDG++G   P D
Sbjct: 1063 KLLPRAAYHVGSPISRMVRFNVEGSKSKMASTDGKPKGANRFAVFFGTLDGALGIFMPTD 1122

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +T+ +L ++Q++L  +V    G NPR+FR  +           P  ++D  LLS +E L
Sbjct: 1123 PVTYEKLLAIQRELTTAVRSPIGCNPRTFRTPKVFEGKHVQLRAPLDVLDGGLLSKFETL 1182

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               EQ++IA      R   L  +  L+   +FL
Sbjct: 1183 TFSEQVKIASSAQVDRDLTLGLIQQLSASNAFL 1215


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  551 bits (1420), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 423/1477 (28%), Positives = 698/1477 (47%), Gaps = 229/1477 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+
Sbjct: 29   SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G   
Sbjct: 82   QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
                P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S
Sbjct: 135  HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194

Query: 234  HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++  
Sbjct: 195  YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S
Sbjct: 255  QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308

Query: 352  SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
              E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+    
Sbjct: 309  IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
             K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   
Sbjct: 369  EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417

Query: 462  PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
            PS KRL+ S+S          D L D +        + EEL +YG+    +     ++ F
Sbjct: 418  PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476

Query: 504  AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
             V DSL+NIGP             ++FS  L ++ +   T           + K    ++
Sbjct: 477  EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536

Query: 542  VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            V    LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E
Sbjct: 537  VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580

Query: 599  VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
            + ++  +     T+ AG                 I    ++  +L               
Sbjct: 581  I-DNTGFATHIPTVYAG-----------------INQLQHIPLELG-------------- 608

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
            S ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D   
Sbjct: 609  SPIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 668

Query: 718  -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
                         PE ++ ++ T   +     + + G D          P  +  +Y   
Sbjct: 669  LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 727

Query: 755  --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                           V  E+  LEI+ +P+F   + +     G   +VD    E++  S 
Sbjct: 728  WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 785

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
            +   S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  
Sbjct: 786  STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 836

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
            P    K          +   NVS       R      D +  +E        ++  F NI
Sbjct: 837  PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 885

Query: 921  SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            +G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+
Sbjct: 886  AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 945

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            I  LP+  +YD  WPV+K+PL+ TPH +TY  E   Y L+ S+   +P N+         
Sbjct: 946  IGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF----- 998

Query: 1040 VGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
                 ++  LS  D    +     E++ + +  P      W       I +   E+   +
Sbjct: 999  ---NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNCL 1051

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
            + V+L    T+   +  +A+GT Y  GEDV +RGR+L+F        P     +N   E+
Sbjct: 1052 KNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1111

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            Y+K+ KG ++AL+ ++G L+ A G KI + +    +L G+AF D   +Y   +  +K+ +
Sbjct: 1112 YAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSLL 1170

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L+ D++KSI  L ++E+   L+L+++DF   + F+ E++ID +T+  +VSD +KN+ ++ 
Sbjct: 1171 LVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLYM 1230

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFG 1327
            Y P+  ES  GQ+LL +A+FH+G  V  F R++  L    +      G+DK  R   ++ 
Sbjct: 1231 YQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMYA 1288

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            TLDG +G I P+ E T+RRL  LQ  LV    H+AGLNP++FR + S  K       S++
Sbjct: 1289 TLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVI 1348

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            D EL+ +Y  L + E+LE++ + GT   ++L +L+D+
Sbjct: 1349 DGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1385


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 430/1516 (28%), Positives = 695/1516 (45%), Gaps = 273/1516 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE       R++
Sbjct: 82   ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                                                   DE   G G G  +    +++I
Sbjct: 132  LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +   C I A+S++   K 
Sbjct: 155  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 215  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274

Query: 354  ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
               +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 275  LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
            + + T+     FLGSRLG+SLL+++T       L  G     KE+  D++        L 
Sbjct: 335  TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393

Query: 469  RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
              S+ +  D    E  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G    
Sbjct: 394  SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451

Query: 526  ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
                  G + + + E+V                          ELPGC  +WTV     +
Sbjct: 452  LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510

Query: 560  GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
                      ++         D + H +LI+S E  TM+L+T   + E+  S  +  QG 
Sbjct: 511  EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
            T+ AGN+   + +IQV   G R+L+G    + L F P +         S ++  S+ADPY
Sbjct: 570  TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619

Query: 671  VLLGMSDGSI---------------RLLVGDPSTCTVSVQTPAAIESSKKPVS------- 708
            V++  ++G +               RL +  P    +S Q+      + + VS       
Sbjct: 620  VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKPQ---ISTQSRVIALCAYRDVSGMFTTEN 676

Query: 709  --SC---------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEA 738
              SC               T+ HD         E  L   S+       +  + + V   
Sbjct: 677  KVSCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPG 736

Query: 739  IDGADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALK 797
               ++GGP   +   + +V  ESG +EI+ +P++  VF V  F  G+  +VD+   ++  
Sbjct: 737  PSVSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSAT 796

Query: 798  DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
              + E     EE T QG    +  + +V L       +HSRP+L  +  +  +L Y+A+ 
Sbjct: 797  QGDKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFP 848

Query: 858  F--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PH 907
            +  + P+N  K                       +RF + P +   RE+           
Sbjct: 849  YDQQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAE 885

Query: 908  GAPCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVA 956
            GA  +          R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +
Sbjct: 886  GAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIES 945

Query: 957  FTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN-- 1014
            F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T H ++Y  E    
Sbjct: 946  FSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKAS 1005

Query: 1015 -----LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
                 +Y +  SV       + L   I +  G + +   +   + +     +++ ++++ 
Sbjct: 1006 LSHCCVYAVCTSV-------KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLIS 1058

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGR 1128
            P        TR  I ++  E    ++ V L +  T    +  +A GT  +QGE+V  RGR
Sbjct: 1059 PVSWEAIPNTR--IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGR 1116

Query: 1129 VLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
            +L+         P   +T+     +Y KE KG ++AL    G+L+ A G KI L      
Sbjct: 1117 ILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDN 1176

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            +L G+AF D   LY+  +  +KNFIL  D+ KS+  L ++E+   L+L+++D   L+ ++
Sbjct: 1177 DLTGMAFIDTQ-LYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYS 1235

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             EF++D S L  +VSD  KN+ ++ Y P+  ES+ G +LL RA+F+ GA++  F R  M 
Sbjct: 1236 IEFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--MP 1293

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
               +   G+       N+    F TLDG +G + P+ E T+RRL  LQ  L   + H AG
Sbjct: 1294 CRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAG 1353

Query: 1364 LNPRSF-------------------------RQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            LNP++F                         R  H + ++ +    +I+D ELL+ Y  L
Sbjct: 1354 LNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKYLYL 1413

Query: 1399 PLEEQLEIAHQTGTTR 1414
             + E+ E+A + GTT+
Sbjct: 1414 SMMERSELAKKIGTTQ 1429


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score =  533 bits (1373), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 428/1471 (29%), Positives = 687/1471 (46%), Gaps = 205/1471 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLV    N I +Y +  + +  ++  ++ ETK        +   LE V  Y L GNV S+
Sbjct: 29   NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  GA     RD+++L FE+AK+S+ ++D     L+  S+H FE  +   L+ G   
Sbjct: 82   ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132

Query: 177  FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
              RG     V+VDP+GRC  +L+YG  +I+L   +       D  +  S     + I S+
Sbjct: 133  CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188

Query: 234  HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I+LR LD +  +V D  F+HGY EP ++IL+E   TW  RV+ +  TC I A+S++  
Sbjct: 189  YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP+D  +   VP PIGGV+V   N++ Y +QS         Y VSL+S
Sbjct: 249  DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302

Query: 352  SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
                       P+    + LD A A ++ ND  ++S K G+L +LT+V D  R V+   L
Sbjct: 303  LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
             K   SVLT+ +   G++  FLGSRLG+SLL+++         SS      G+ +     
Sbjct: 363  DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420

Query: 464  TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
            T  +    S  +  + + +EL +YG  S   ES   T F F V DS++NIGP        
Sbjct: 421  TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480

Query: 516  -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
                 ++FS+    + +   T           + +Q   ++V   ELPGC  +WTV    
Sbjct: 481  PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540

Query: 558  SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
                 +D     +     HA+L++S    +MVLET   + E+  S  +     T+ A N+
Sbjct: 541  DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594

Query: 618  FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               R ++QV      +L G    Q L+   S          S V+S S+ADP+VLL   D
Sbjct: 595  GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644

Query: 678  GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
            G +  LV      DP   T+S+ T    + SK  V +  LY D            EP   
Sbjct: 645  GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699

Query: 721  --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
                                                  W ++ S          E  +GA
Sbjct: 700  TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
            D  P      ++V+   +G LE++ +P++   F V  F +G   ++D+     L  S   
Sbjct: 760  DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
               S++E              V E+ +  +  + S+P L A + D  +  Y+ +   G +
Sbjct: 816  PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863

Query: 863  NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
                                +  RL+ +RF R       R     E+ P      R   F
Sbjct: 864  --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLRP--F 900

Query: 918  KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             +ISG+ G F+ GS P W  M  R  LR HP   DG+I  FT  HNVNC  GF+Y +S  
Sbjct: 901  TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             L+IC LP+  +YD  WPV+K+PL+ TPH + Y  +   Y ++ S  V  P  Q++ +  
Sbjct: 961  ELRICVLPTHLSYDAPWPVRKVPLRCTPHFVVYHPDSKTYSVVSSQQV--PCTQLVRVAG 1018

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            D E   +I+   +   D      + ++ +++  P        TR    ++  E+ + ++ 
Sbjct: 1019 DGE--KEIE--AVQKDDRFVFPIMNKFNIQLFSPVSWEPIPNTR--FDLEEWEHVMCIKT 1072

Query: 1097 VTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYS 1150
            + L +  T    +  + +GT     EDV++RG++ ++        P     +N +  VY+
Sbjct: 1073 INLKSEGTLSGLKGYVVVGTNLNYNEDVSSRGKLTIYDVIDVVPEPGQPLTKNKIKVVYN 1132

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1210
            KE KG ++AL  +QG L+ A G K+ + +    +L GIAF D   +Y+  +  +KN I++
Sbjct: 1133 KEQKGPVTALDGVQGFLVTAIGQKVYIWQLKDNDLAGIAFIDT-QIYIHKMEALKNLIII 1191

Query: 1211 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
            GD+ KSI  L ++E    L+L++KD   L  +   +L+D ++L+ +V+D+ KN  ++ Y 
Sbjct: 1192 GDVCKSISVLRYQEDMKVLSLVSKDVRPLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQ 1251

Query: 1271 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
            P + +S  GQ+L+ +A+ ++G+ V  F R++     SD + +        +    + TLD
Sbjct: 1252 PDLVQSQGGQRLIRKADINIGSLVNAFFRVK--CRVSDPSTSKTDQSLAMKHITYYVTLD 1309

Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            GSIG + P+ E  +RRL  LQK L+  V   AGLNP+++R   +  +       +I+D +
Sbjct: 1310 GSIGYLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGD 1369

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            L   Y  L   ++ E+A + GTT  QI  +L
Sbjct: 1370 LAWKYLALTSHDRAEMAKRIGTTSHQIEDDL 1400


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  528 bits (1359), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 398/1305 (30%), Positives = 624/1305 (47%), Gaps = 198/1305 (15%)

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+HGY EP ++IL+E   TW GR++ +  TC I ALS++   + HP+IWS  NLP D  +
Sbjct: 3    FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62

Query: 311  LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
            LLAVP P+GGVL++  +++ Y +QS         Y VSL+S  +        P+    + 
Sbjct: 63   LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
            LD A A +L  D  +LS K G+L +LT+  DG R V+     K   SVLT+ +T   +  
Sbjct: 117  LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
             FLGSRLG+SLL+ +T      +     ++E  + +A+ P +K+ R    DA+ D     
Sbjct: 177  LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232

Query: 478  --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
              +++ +EL +YGS +  T+    +++F V DSL+NIGP      G           N D
Sbjct: 233  VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291

Query: 528  -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
                  +  G  K     ++            ELPGC  +WTV           K+    
Sbjct: 292  PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351

Query: 562  NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
            +  S   AA       HA+LI+S    +M+L+T   + E+  S  +  Q  T+ AGNL  
Sbjct: 352  DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410

Query: 620  RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
             R V+QV   G R+LDG+   Q +               S++++ S+ADP+V++  ++G 
Sbjct: 411  GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460

Query: 680  I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
            +    L GDP+  C ++V  P       K +S C       L+  +  EP   + +    
Sbjct: 461  VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518

Query: 731  LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
                  E+ID +  G LD  D                                       
Sbjct: 519  PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578

Query: 752  -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
              +  V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E
Sbjct: 579  TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                     +H + VV L ++     HSRP L A + D  +L Y+A+ F           
Sbjct: 639  SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
               T R   +          LRF +   D + RE     + P     ++       +  F
Sbjct: 677  -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726

Query: 918  KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             +ISG+ G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG
Sbjct: 727  SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             L+I  LP+  TYD  WPV+K+PL+ TPH + Y  +   Y ++ S P   P N ++    
Sbjct: 787  ELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-- 842

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
                G + +   L     +   T++++ +++L P      W+T     + +   E+   +
Sbjct: 843  ---TGEEKEYELLERDSRYIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCL 895

Query: 1095 RVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
            + V L +  TT   +  LA+GT Y  GEDV +RGR+++         P     +N +  V
Sbjct: 896  KNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKIV 955

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            YSKE KG ++AL+ + G LL A G KI + +    EL G+AF D   +Y+ S+  VKN I
Sbjct: 956  YSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNLI 1014

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L+GD+ KS+  L ++E    L+L+++D   L+ +A EF ID + +S +V+D ++N+ ++ 
Sbjct: 1015 LVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYM 1074

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
            Y P+  ES  GQ+LL R +FHVG+ V    R++       +      S    R   +  T
Sbjct: 1075 YQPESRESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMAT 1134

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGPD 1384
            LDGS+  + P+ E T+RRL  LQ  LV ++PH AGLNP+++R ++S     G  H+    
Sbjct: 1135 LDGSLAYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK---- 1190

Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            +I+D EL+  +  L   E+ E++ + GTT +QI  +L ++   T+
Sbjct: 1191 NILDGELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 1235


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score =  517 bits (1332), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 424/1486 (28%), Positives = 697/1486 (46%), Gaps = 190/1486 (12%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   I  +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       E G  
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 652  ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+    G   IV
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
             T   E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 827  LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1104 SISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  LV    H+ GLNP+ +R   S 
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVSYQEHLCGLNPKEYRTLKSF 1399

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             K        I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 420/1486 (28%), Positives = 698/1486 (46%), Gaps = 190/1486 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++NI ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1283 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             K        I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 417/1486 (28%), Positives = 699/1486 (47%), Gaps = 190/1486 (12%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YG+ +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             +V  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       DA    E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1104 SISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S 
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSF 1399

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             K        ++D +L+  Y ++   E+ E+A + GT   +IL++L
Sbjct: 1400 KKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEEILADL 1445


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 418/1494 (27%), Positives = 691/1494 (46%), Gaps = 206/1494 (13%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  +     L   +    +      R  
Sbjct: 136  GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429

Query: 452  ----------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                      +E  D++    AP+  + RR         +  EEL +YGS +  +    +
Sbjct: 430  QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480

Query: 500  TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
             F F V DSL+N+ P+     G R+  +                     +ATG SK    
Sbjct: 481  KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+         D+++  +  D+ H ++++S    T
Sbjct: 541  SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 592  LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 649  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700

Query: 708  SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
             + + Y D                                EP ++    +  L    G A
Sbjct: 701  VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760

Query: 739  I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
                   D A        D +             VV  +SG LEI+ +P+   V+ V+  
Sbjct: 761  FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
             +G   + D    E +  S T   +S          ++ +S   +EL +     +  RP 
Sbjct: 821  GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L  + T   +L YQ  +F  P+   K        R L   N+   +  ++       D  
Sbjct: 879  LL-VRTRVELLIYQ--VFRYPKGHLKI-----RFRKLEQLNLMDHQPSHIELDEN--DER 928

Query: 901  TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
               E+    P   Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G + +F
Sbjct: 929  EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
               +NVN  +GF+Y  +   LKI  LPS  +YD+ WP++K+PL+ TP Q+ Y  E  +Y 
Sbjct: 989  AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKVPLRCTPRQLVYHRENRVYC 1048

Query: 1018 LIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGG 1075
            LI      +P+ +       D+E+  +              Y +  ++E+ ++ P+    
Sbjct: 1049 LITQNE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET--- 1096

Query: 1076 PWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1132
             W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1097 -WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155

Query: 1133 STGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG 1187
                    P   +T     EV+ KE KG +SA++ + G L+   G KI + +    +L G
Sbjct: 1156 DIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIG 1215

Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            +AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF+
Sbjct: 1216 VAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFM 1274

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
            +D S L  +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q      
Sbjct: 1275 VDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGL 1334

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
             +          N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+
Sbjct: 1335 HQRQPFLYE---NKHFVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLSYQEHLCGLNPK 1391

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             +R   +  K        I+D +L+  Y +L   E+ E+A + GT   +ILS+L
Sbjct: 1392 EYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILSDL 1445


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score =  514 bits (1325), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 419/1491 (28%), Positives = 695/1491 (46%), Gaps = 200/1491 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++        D+E 
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423

Query: 460  DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
             +   +R  +     L+++                 +  EEL +YGS +  +    + F 
Sbjct: 424  QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483

Query: 503  FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
            F V DSL+N+ P+     G R+  +                     +ATG SK       
Sbjct: 484  FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543

Query: 538  ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                N +++   EL GC  +WTV+         D+++ ++ +D+ H ++ +S    T+VL
Sbjct: 544  VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594

Query: 591  ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
            +T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       
Sbjct: 595  QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648

Query: 651  ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
                    S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + 
Sbjct: 649  -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703

Query: 711  TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
            + Y D       KG                       EP ++    +  L    G A   
Sbjct: 704  SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763

Query: 740  ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                D A        D +             VV  +SG LEI+ +P+   V+ V+   +G
Sbjct: 764  NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
               + D    E +  S T   +S          ++ +S   +EL++     +  RP L  
Sbjct: 824  AMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL- 880

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
            + T   +L YQ  +F  P+   K        R L   N+   +  ++       D     
Sbjct: 881  VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEEI 931

Query: 904  ETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVL 960
            E+    P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   
Sbjct: 932  ESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAF 991

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
            +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI 
Sbjct: 992  NNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKVPLRCTPRQLVYHRENRVYCLIT 1051

Query: 1021 SVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQ 1078
                 +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+
Sbjct: 1052 QTE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WE 1098

Query: 1079 T--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
                A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++   
Sbjct: 1099 IVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDII 1158

Query: 1136 RNADNPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAF 1190
                 P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF
Sbjct: 1159 EVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAF 1218

Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
             D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D 
Sbjct: 1219 IDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDN 1277

Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1310
            S L  +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       + 
Sbjct: 1278 SNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQR 1337

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
                     N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R
Sbjct: 1338 QPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYR 1394

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
               S+ K        I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 1395 TLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score =  514 bits (1324), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 417/1498 (27%), Positives = 688/1498 (45%), Gaps = 206/1498 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +L+YG ++++L   +  S     L   +    +      R  
Sbjct: 138  GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  TC++ A+S
Sbjct: 198  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  NLP D  +LL +  PIGG LV+  N + Y +QS         Y V
Sbjct: 258  LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 312  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 372  NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431

Query: 448  ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
                S   E  G ++ D       APS  + RR         +  EEL +YG+ +  +  
Sbjct: 432  AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482

Query: 497  AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
              + F F V DSL+N+ P+     G R+  +                     +ATG SK 
Sbjct: 483  QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542

Query: 537  S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
                      N +++   EL GC  +WTV+         D+++  +  D+ H ++++S +
Sbjct: 543  GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593

Query: 585  ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
              T+VL+T   + E+ E+  + V   TI  GNL   R ++QV  R  R+L G+ + Q++ 
Sbjct: 594  NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652

Query: 645  FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
                          S V+ V+IADPYV L + +G +  L    S  T  +       SS 
Sbjct: 653  I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702

Query: 705  KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
              V +   Y D                                  EP ++    +  L  
Sbjct: 703  PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762

Query: 734  GVGEAI------DGADGGPLDQGDIYS-------------VVCYESGALEIFDVPNFNCV 774
              G A       D A        D +              VV  +SG LEI+ +P+   V
Sbjct: 763  DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822

Query: 775  FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
            + V+   +G   + D    E +  S T   +S          ++ +S   +EL++     
Sbjct: 823  YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880

Query: 835  HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            +  RP L  + T   +L YQ  +F  P+   K        R +   N+   +  ++    
Sbjct: 881  NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932

Query: 895  TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
               +             Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 933  NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
            + +F   +N+N  +GF++  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E 
Sbjct: 993  VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHREN 1052

Query: 1014 NLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPD 1071
             +Y LI      +P+ +       D+E+  +              Y +  ++++ ++ P+
Sbjct: 1053 RVYCLITQTE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFDMVLISPE 1103

Query: 1072 RAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
                 W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG 
Sbjct: 1104 ----TWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGN 1159

Query: 1129 VLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
            + ++        P   +T     EV+ KE KG +SA++ + G L+   G KI + +    
Sbjct: 1160 IHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDG 1219

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            +L G+AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ + 
Sbjct: 1220 DLIGVAFIDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYG 1278

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             EF++D + L  +V+D + N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q  
Sbjct: 1279 IEFMVDNTNLGFLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCH 1338

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
                 +          N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ G
Sbjct: 1339 QRGLHQRQPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCG 1395

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            LNP+ +R   S+ +        I+D +L+  Y +L   E+ E+A + GT   +IL++L
Sbjct: 1396 LNPKEYRTLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADL 1453


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 413/1533 (26%), Positives = 686/1533 (44%), Gaps = 265/1533 (17%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            FA YK +H PT ++ C     T +  +                           NL++  
Sbjct: 2    FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
             N++E+Y+++                        + +++ LV  + L G ++S+ A+  Q
Sbjct: 35   NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71

Query: 122  GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
            G      +++ +++ FED AK+SV+EFD+    L+  S+H  E      L+ G+  F   
Sbjct: 72   G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
              + +DPQ R   V++   +++IL   Q G              L GD++      G   
Sbjct: 124  QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183

Query: 229  R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            +     ++   +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T  +
Sbjct: 184  KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243

Query: 284  SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
            +A+S     K +P IWS  +LPHD  KL+ +   + GG LV+G N+I + +Q A+  L+ 
Sbjct: 244  TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303

Query: 343  NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            N++AVS + +  +  ++F   ++  D    T++  D  L+S K G+L  + +   G  + 
Sbjct: 304  NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
             +++ KT+ +   S + T+  +L FLGS++GDS+L ++         S    EE   + A
Sbjct: 363  NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422

Query: 460  -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
                 +    KR      D    +   EE ++  S S  ++         ++    NIGP
Sbjct: 423  AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482

Query: 515  LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
            +   +  +  + D S  G   ++N   +    C GI      TV ++S +      + + 
Sbjct: 483  ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540

Query: 570  ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
                     +   E+  YLI+SLE +T V E+   L EVT    +     T+  G +  R
Sbjct: 541  FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
              ++QV  R + +L GS         P           S++    I DPYVLL   DGS+
Sbjct: 599  ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
             LL  D    T        IES+   +++  LY    PE          +   G+ E   
Sbjct: 646  VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                    +G     V +  GA EI  VP+  CVF+  +F    T + D           
Sbjct: 690  --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   E  +    +  +    V E+A++   +    P+L ++L+D T+  Y+++L   
Sbjct: 731  -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
             + T+KS D               +RL  LRFS+       P+    ++        +  
Sbjct: 781  -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824

Query: 913  -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
                       ++  FKNI G+ G F +G +P W       LRVHP      +  FT  H
Sbjct: 825  YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884

Query: 962  NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
            + NC HGFIY+T        Q  L I  L +   ++ YWP +KI LK+TP+ IT+  + N
Sbjct: 885  HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTN 944

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
                  SVPV      +L   I    G              + +TV+ +         +G
Sbjct: 945  TCLAFTSVPV----KAILPDSIPFPEGK-------CPPPAEQKHTVKLF---------SG 984

Query: 1075 GPWQTRATIPMQSSENALTVRVVTL----------------FNTTTKENETLLAIGTAYV 1118
              WQ          E+A+  +VV L                 N+  ++  +++A+GTAYV
Sbjct: 985  HNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAYV 1044

Query: 1119 QGEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
            Q E    RGR+LLF      GR  +   NL++   S  +KG I+ L  +  +++ + G +
Sbjct: 1045 QSERELCRGRLLLFDLDPILGRENEYKLNLIS---STSVKGPITTLEQVDRYIICSVGNR 1101

Query: 1175 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
            I  +   W    ++  +FYD    Y  SLN V+NFI+ GDI+KS+ FL WKE+G +L LL
Sbjct: 1102 IYTYYFDWEEKRMHITSFYDTQ-FYTASLNTVRNFIMFGDIYKSVSFLRWKEKGHRLILL 1160

Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
            AKD   L   ++EFL++   L L V D  KN+QIF Y P+  ES  G+ L+   +FH+G 
Sbjct: 1161 AKDNRPLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQESNDGRNLVPVCDFHIGT 1220

Query: 1293 HVTKFLRLQM----------LATSSDRTGAAPGSDKT----NRFALLFGTLDGSIGCIAP 1338
             +   +R+++          L   +++   +   D T    N   +LFG++DG+IG +AP
Sbjct: 1221 LINSLIRMKVRELPDDNTIRLGNVNEKPKQSGKKDITKTNPNHQFILFGSVDGAIGYVAP 1280

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            ++E+T RRL +LQ K+   +   AGL+P+SFR +    +       +I+D +L+ +Y  +
Sbjct: 1281 INEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERTEYNYKKNIIDGQLIWNYANI 1340

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
                Q ++A Q GT    IL ++ +L   T F 
Sbjct: 1341 NTILQRDLARQIGTNSDNILRSIQELNQATFFF 1373


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score =  511 bits (1316), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 418/1504 (27%), Positives = 693/1504 (46%), Gaps = 222/1504 (14%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T +GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHR 1046

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
            E  +Y LI      +P+ +          L ++  G +    N S           ++E+
Sbjct: 1047 ENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFEM 1093

Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
             ++ P+     W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED
Sbjct: 1094 VLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSED 1149

Query: 1123 VAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIIL 1177
            + +RG + ++        P   +T     EV+ KE KG +SA++ + G L+   G KI +
Sbjct: 1150 ITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYI 1209

Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             +    +L G+AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF 
Sbjct: 1210 WQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFN 1268

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
             L+ +  EF++D S L  +V+D ++N+ ++ Y P+  ES  GQKL+ +A++H+G  V   
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
             R+Q       +          N+  +++GTLDG +G   PL E  +RR   LQ  L+  
Sbjct: 1329 FRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSY 1385

Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
              H+ GLNP+ FR   S  K        I+D +L+  Y +LP  ++ E+A + GT   +I
Sbjct: 1386 QDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEI 1445

Query: 1418 LSNL 1421
            LS+L
Sbjct: 1446 LSDL 1449


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score =  510 bits (1313), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 418/1504 (27%), Positives = 692/1504 (46%), Gaps = 222/1504 (14%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHR 1046

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
            E  +Y LI      +P+ +          L ++  G +    N S           ++E+
Sbjct: 1047 ENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFEM 1093

Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
             ++ P+     W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED
Sbjct: 1094 VLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSED 1149

Query: 1123 VAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIIL 1177
            + +RG + ++        P   +T     EV+ KE KG +SA++ + G L+   G KI +
Sbjct: 1150 ITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYI 1209

Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             +    +L G+AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF 
Sbjct: 1210 WQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFN 1268

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
             L+ +  EF++D S L  +V+D ++N+ ++ Y P+  ES  GQKL+ +A++H+G  V   
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
             R+Q       +          N+  +++GTLDG +G   PL E  +RR   LQ  L+  
Sbjct: 1329 FRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSY 1385

Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
              H+ GLNP+ FR   S  K        I+D +L+  Y +LP  ++ E+A + GT   +I
Sbjct: 1386 QDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEI 1445

Query: 1418 LSNL 1421
            LS+L
Sbjct: 1446 LSDL 1449


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score =  506 bits (1304), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 413/1463 (28%), Positives = 685/1463 (46%), Gaps = 190/1463 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   +E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G T + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399

Query: 1376 GKAHRPGPDSIVDCELLSHYEML 1398
             K        I+D +L+  Y ++
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLM 1422


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            1, partial [Ciona intestinalis]
          Length = 1370

 Score =  504 bits (1299), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 409/1476 (27%), Positives = 678/1476 (45%), Gaps = 215/1476 (14%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            +A Y+ +H PTG+  C        +                             NL+VTA
Sbjct: 2    YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            A+ + +Y +    E + +++N  E       + +    L+ +  ++L GNV  +  +   
Sbjct: 35   ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87

Query: 123  GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            GA    + DS++L+F +AK+S++EFD + H ++ TS+H FE   +   K G       P 
Sbjct: 88   GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
            + VDP+ RC  + +    + ++      + L  D+           + R  +S+ I+L  
Sbjct: 141  IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200

Query: 241  LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            +D  ++ + D  F+HGY EP +++L E   TWAGRV+ +  TC I A+S++   + HP++
Sbjct: 201  VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
            WS   LP D      VP PIGGVL+   N+I Y +QS         Y  SL+S+ E    
Sbjct: 261  WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314

Query: 355  ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
                P+    + LD +HA ++  +  ++S K G+L +LT++ D  R V+     K+  SV
Sbjct: 315  FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374

Query: 411  LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
            LTS +T + +   FLGSRLG+SLL+++T        +  +       E  A   KRL  +
Sbjct: 375  LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427

Query: 471  SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
            +  A  D  N  +L +YG  +  +E    ++ F V DSLVNIGP      G    A  S 
Sbjct: 428  ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483

Query: 531  TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
              +S ++S+ EL                           ELPGC  +WTV     +    
Sbjct: 484  EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543

Query: 564  DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
              ++      + H+YLI+S E  T++LET   + EV E+  +  + +++  GN+ G + +
Sbjct: 544  TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597

Query: 624  I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            I QV   G  +L G  + Q +       E GS      +   SI DPY LL  SDG + +
Sbjct: 598  ILQVCASGVWLLAGVKLLQHIPL-----ELGS-----PITQCSICDPYALLLTSDGDLIM 647

Query: 683  --LVGDPST--------CTVSVQTPAAIE--------------------------SSKK- 705
              L  D  +        C  S+     IE                          SS K 
Sbjct: 648  LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707

Query: 706  -------------PVSSCTLYHDKGPEPWLRKTSTDAWL----------STGVGEAIDGA 742
                         P+SS T   D+  E    ++  D             S    E +DG 
Sbjct: 708  VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLDGD 767

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
              G  ++   ++++  E+  LEI+ +P+ + V+T+  F  G+  + ++    +   S+ +
Sbjct: 768  KEGN-EEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
             ++S    T    K  I  + +V L  +     +S P L A + +  IL Y+ + F  PE
Sbjct: 827  KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876

Query: 863  NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
               K +     S  +    V+ S +      R P+   T+ +      C R   F NI G
Sbjct: 877  KFKKYN-----SLQIRFKKVNHSMM----IRRAPVTHETKTDQLEHRNCLRT--FSNIGG 925

Query: 923  HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G FL G  P W  V  R  L  HP   DGS+  F   HNVNC +GF+Y  SQG L+IC
Sbjct: 926  YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LP    YD  WP++KI L+ + H + Y  E  +Y L+ SV   +P  ++  L  + E  
Sbjct: 986  MLPPHMKYDTAWPMRKITLRCSVHFLAYSIEHKVYALVTSVS--EPCTRLPYLTFENERE 1043

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
             +    +L   D      ++++ V+++ P  A       A + M   E+   ++ V L  
Sbjct: 1044 FE----DLEKGDRFIYPHIDKFSVQLISP--ASWDLVPNARLDMGEFEHITCMKNVWLSC 1097

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
               +   +  L +GT  V GE++++RG++++         P     +N + ++YS+E KG
Sbjct: 1098 GQDSSARQNFLVLGTVNVFGEEMSSRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQKG 1157

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
             ++A+  L+G+LL A G KI + ++   + L G+AF D   +Y+      ++F L+GDI 
Sbjct: 1158 PVTAVCGLEGNLLTAIGQKIFIWRFDENQSLRGLAFVDT-NVYIHHALSFRSFALVGDIQ 1216

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +SI  L ++     L++ ++D   L+ +  + ++DG+ ++ +VSD +KN+ +F Y P+  
Sbjct: 1217 RSITLLRYQTDFKTLSVTSRDVRPLEVYTADLVVDGTGINFLVSDHEKNLVLFAYDPEDH 1276

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            ES  G +L  RA+ H+G+      R+   A   DR+   P          + GTLDGSI 
Sbjct: 1277 ESHGGSRLTKRADMHIGSRANCMWRVA--ACGVDRSTGLPNQPYAGVHITMMGTLDGSIC 1334

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             + P+ E  +RRL  LQ  ++  + H+AGLNP++FR
Sbjct: 1335 HVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 415/1461 (28%), Positives = 684/1461 (46%), Gaps = 172/1461 (11%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  R+  +    S++   T R   M       LE +  Y L GN+ S+
Sbjct: 29   SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+    +RD+++++F+DAK+SV++FD     L+  S+H FE  +   +K G   
Sbjct: 81   QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
                P+V+VDP  RC  +LVYG ++++L   K S      V D                I
Sbjct: 134  HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193

Query: 231  ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S+VI L++ +  + +V D  F+HGY EP ++IL+E   T+ GR++ +  TCM+ ALS+
Sbjct: 194  LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
            +   + HP+IW+   LP D  + +A+  PIGG L++  N + Y +QS     ++LN+ A 
Sbjct: 254  NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
               +    P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+    SK 
Sbjct: 314  HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT  I  +     FLGSRLG+SLL++F     + +++    EE  + E      KR
Sbjct: 374  ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428

Query: 467  LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
            LR          +  EEL +YGS    T     ++ F V DS++NIGP+   + G RI  
Sbjct: 429  LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477

Query: 525  ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
               + +     +  + + E+V     G  G   V   S +     S  ++     D+ H+
Sbjct: 478  EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537

Query: 578  YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
            ++I+S EA TMVL+T D + E+ E+  +     TI  GN+ G R ++QV  +  R+L G+
Sbjct: 538  FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596

Query: 638  YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
             + Q++                 + +VSIADPYV +  S+G +  L      G P     
Sbjct: 597  RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646

Query: 691  --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
              T+S  TPA +  S  K VS    T Y D                    PEP ++    
Sbjct: 647  KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705

Query: 728  DAWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVP 769
            +  L    G +       D A        D +                 ++G LEI+ +P
Sbjct: 706  EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAVRDNGNLEIYSMP 765

Query: 770  NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKVVE 826
            +   V+ +    +G   + D+     L+  +T  +   +S+   +  G   N+   +++ 
Sbjct: 766  DLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEILM 825

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            +A+     H +RP LF  L +  +L Y+ Y +      SK    +   R    S V+   
Sbjct: 826  VAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTGPI 873

Query: 887  LRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVFRE 941
             +       P D    +   H           I  F N++G+ G  + G +P + M+   
Sbjct: 874  FKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLLTS 932

Query: 942  R--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
            R  LR H       +  F   +NVNC +GF+Y   Q  LKI   P   +YD+ WPV+KIP
Sbjct: 933  RGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRKIP 992

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
            L+++P QI Y  E  +Y +++                 +EV ++    N    +L     
Sbjct: 993  LRSSPKQIVYHKENKVYCVVMDA---------------EEVCNKYYRFNGEDKELTEENK 1037

Query: 1060 VE--------EYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENE 1108
             E        ++ V ++ P      W+     +I +   E+ + ++ V+L +       +
Sbjct: 1038 GERFLYPMAHKFSVVLVTP----SAWEIIPETSINLDEWEHVIALKNVSLSYEGARSGFK 1093

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASL 1163
              +A+GT +   ED+ +RGR+LL+        P   +T     EV  KE KG +SA+  +
Sbjct: 1094 EYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHV 1153

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             G L+ A G K+ L +    +L G+AF D   ++V  L  +K+ IL+ D++KS+  L ++
Sbjct: 1154 SGFLVGAVGQKVYLWQLKDDDLVGVAFIDT-NIFVHQLVSIKSLILVADVYKSVSLLRFQ 1212

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E    L+L+++D+  L+ F  E+++D   L  +VSDEQ NI  + Y P+  ES+ GQ+LL
Sbjct: 1213 EDYRTLSLVSRDYQPLNVFQIEYVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLL 1272

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
             + ++HVG  +    R+Q      D       S+   +    F TLDG IG + PL E T
Sbjct: 1273 RKCDYHVGQKINSMFRVQCDFHEMDYKR---NSNYECKHTTYFATLDGGIGYVLPLPEKT 1329

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L+   PH+ GLNP++FR   +  K        +VD +L+  +  LP  E+
Sbjct: 1330 YRRLFMLQNVLMTHSPHLCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEK 1389

Query: 1404 LEIAHQTGTTRSQILSNLNDL 1424
            LE+A + GT    I ++L ++
Sbjct: 1390 LEVAKKIGTRIDDICADLMEI 1410


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Acyrthosiphon pisum]
          Length = 1335

 Score =  499 bits (1284), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 412/1440 (28%), Positives = 674/1440 (46%), Gaps = 214/1440 (14%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV   N++ +Y +   +   +  K                   E +  Y L GN+  L 
Sbjct: 30   LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
               Q         D+++L+F +AK S++E+D  +H LR  S+H FE  ++   K G    
Sbjct: 73   ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
               PL++VDP GRC   LVYG   ++L       G   D++        SA++  S+ I 
Sbjct: 127  WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            +  +D  M ++ DF F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++     H
Sbjct: 176  ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQE 354
            P+IWS  +LP+D  K++AV  PIGGVL++  N++ Y +QS     +ALN+ A +L +   
Sbjct: 236  PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
              +   ++ LD A AT++ +D  + S   GDL ++T+  D  R V+     K   SVLT+
Sbjct: 296  GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
             IT   +S  FLGSRLG+SLL+++   S ++               D PS KR +   +D
Sbjct: 356  CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401

Query: 474  ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
              +D+V   EL +YGS    T    +++SF V DS++NIGP    S G    A  S    
Sbjct: 402  --EDLV---ELEVYGSEVQ-TSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453

Query: 534  SKQSNYELVELPG--CKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
            S + + EL+   G    G  +V H+S +     +  +  Y D        ++H ++I++ 
Sbjct: 454  SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513

Query: 584  EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
               T++L+T   + E+ +S  Y  +  T+   N+   + VIQV     R+L+GS   Q +
Sbjct: 514  VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570

Query: 644  S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
            S  FG            S ++  S  +PY +L   DG +         R+L+  P+    
Sbjct: 571  SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618

Query: 693  SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
              QT         + + SS  P +   L    GP     K   D ++S  V +  +   G
Sbjct: 619  IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670

Query: 745  GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
               D                              + V+  ++G +EI+ +P+F       
Sbjct: 671  DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725

Query: 779  KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
                        Y    + +S   +  S EEG    +K E I  + +V L  Q       
Sbjct: 726  -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            RP +F  L D  ++ Y  +    PE T K                       +RF +  +
Sbjct: 770  RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801

Query: 898  DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
             +    ++  G P +  ++   F  ++GH G F+ G  P   ++  R  LR HP   DG 
Sbjct: 802  TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
            I+ F   HNVNC+ GF+Y  S   L+I  LP+  +YD  WP++K+PL+ TPH I Y  E 
Sbjct: 862  IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKVPLRKTPHFIAYHLET 921

Query: 1014 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
              Y ++ S   L       +   D+E+  + +   L  +  H  +T+E +     EP   
Sbjct: 922  KTYCVVTSSSELSASYYRFNGE-DKELTTE-ERDPLFPLPSHEVFTLELFSPASWEP--- 976

Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1132
                    +I  +  E+   ++ V L     +   +  +A+GT Y   ED+ +RGR+ LF
Sbjct: 977  ----IPDTSIETEDWEHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFLF 1032

Query: 1133 STGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG 1187
                    P     +N +  +Y+KE KG ++A+  + G L+ A G KI + +    +L G
Sbjct: 1033 DIIDVVPEPGKPLTKNKIKMIYAKEQKGPVTAITHVVGFLVTAVGQKIYIWQLKDNDLIG 1092

Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            IAF D   +YV  +  +K+ IL+ D+ KSI  L ++E+   L+L+ +D   L+ F   FL
Sbjct: 1093 IAFIDT-EVYVHQMLSIKSLILVADLFKSITLLRFQEEYRTLSLVCRDSKPLEVFDINFL 1151

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS- 1306
            ID + L  + SD  +N+ ++ Y P   ES+ GQ L+ R +F++G++V  F RL+   ++ 
Sbjct: 1152 IDNTELGFLASDRDQNLLLYLYQPMARESYGGQHLVRRGDFNIGSNVNSFFRLRCKQSTV 1211

Query: 1307 -SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
              DR  A  GSDK  R   ++ TLDGSIG I P+ E  +RRL +LQ  LV ++ H+AGLN
Sbjct: 1212 APDRREAI-GSDK--RHVTMYTTLDGSIGYIVPIHEKNYRRLLTLQNMLVKNITHLAGLN 1268

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHY-EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P+++R F +           ++D EL+  +   +   ++ EIA++ G    ++L ++ +L
Sbjct: 1269 PKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQRNEIANKVGVKTIELLQDIYEL 1328


>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
          Length = 461

 Score =  494 bits (1271), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 240/420 (57%), Positives = 293/420 (69%), Gaps = 10/420 (2%)

Query: 685  GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
             DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33   ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
               D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93   SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
             +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153  DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
             +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210  VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266  GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
            QLPS   YDNYWPVQK+PL  TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+G
Sbjct: 326  QLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQELG 385

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
              ++N   S  DL   YTV+E+EVRI+E  ++ G W+TR+TIPMQS ENALTVR+VTL N
Sbjct: 386  LHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQN 445


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  487 bits (1254), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 408/1485 (27%), Positives = 701/1485 (47%), Gaps = 203/1485 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  RV  +    +++     R   M       LE V  YRL+GN++S+
Sbjct: 29   SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+     RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 81   QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
                P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     A+  I
Sbjct: 134  HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 194  LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 254  NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
            L+SS +        P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+ 
Sbjct: 308  LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367

Query: 401  LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
               +K   SVLTS I    +   FLGSRLG+SLL++F     + +++    ++ G +E +
Sbjct: 368  FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424

Query: 461  APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                KR R    +            +YGS    T     ++ F V D+++NIGP+   + 
Sbjct: 425  P---KRPRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDNVLNIGPIAHMAV 470

Query: 521  GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
            G R+  + +        +  + + E+V                           L GC  
Sbjct: 471  GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530

Query: 550  IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            +WTV+ ++        +R A      HA++I+S E  TMVL+T + + E+ E+  +    
Sbjct: 531  VWTVFDEAV-------ARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             TI  GN+   R ++QV  +  R+L G+ + Q++                 + SV+I DP
Sbjct: 583  PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632

Query: 670  YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
            YV +  S+G +  L      G P       T+S  TPA +  S+ + VS   L+  K   
Sbjct: 633  YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689

Query: 717  --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
                                 PEP ++    +  L    G           AI G  GG 
Sbjct: 690  VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749

Query: 747  LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
             D         +   +     ++G LEI+ +P+   V+ +    +G   + D+  ++   
Sbjct: 750  ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809

Query: 796  LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
            +  S ++ ++SS  G   G   ++   +++ +A+    ++ SRP LF I  +  +L Y+ 
Sbjct: 810  MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865

Query: 856  YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
            + +      SK    +   R LS S V+    R    S         E+    A      
Sbjct: 866  FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917

Query: 911  -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
                I  F N+SG+ G  + G +P +  +     LR H       + AF   +NVNC +G
Sbjct: 918  NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
            F+Y   Q  LKI   P+  +YD+ WPV+KIPL+++P QI Y  E  +Y +++    +   
Sbjct: 978  FLYFDEQYELKISIFPTYLSYDSVWPVRKIPLRSSPKQIVYHRENKVYCVVMDAEEI--C 1035

Query: 1029 NQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPM 1085
            N+       D+E+  +           HR      + V ++ P      W+     +I +
Sbjct: 1036 NKYYRFNGEDKELTEENKGERFLYPMGHR------FSVVLVTP----AAWEVVPETSINL 1085

Query: 1086 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
            +  E+ + ++ V+L     +   +  +A+GT +   ED+ +RGR+LL+        P   
Sbjct: 1086 EEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGKP 1145

Query: 1145 VT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            +T     EV  K+ KG +SA++ + G L+ A G K+ L +    +L G+AF D   ++V 
Sbjct: 1146 LTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDLVGVAFIDTN-IFVH 1204

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             +  +K+ IL+ D++KS+  L ++E+   L+++++D+  L+ F  E+++D + L  +VSD
Sbjct: 1205 QMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLNVFQVEYVVDNANLGFLVSD 1264

Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
            +Q N+  + Y P+  ES+ GQ+LL ++++H+G  V    R+Q     +D        D  
Sbjct: 1265 DQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCMFRVQCDFHETDVMKRTLNYD-- 1322

Query: 1320 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
            N+    F TLDG IG + PL E T+RRL  LQ  L+   PH  GLNP+++R      K  
Sbjct: 1323 NKHTTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQTRKLP 1382

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                  +VD +L+  +  LP  E+ E+A + GT   +I ++L ++
Sbjct: 1383 INPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICADLMEI 1427


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score =  478 bits (1230), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 415/1498 (27%), Positives = 668/1498 (44%), Gaps = 243/1498 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    +  +  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  T       
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                      +  PIGG LV+  N + Y +QS         Y V
Sbjct: 249  -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 280  SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 340  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399

Query: 448  -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                   L EE    D+E  +AP   + RR         +  EEL +YGS +  +    +
Sbjct: 400  IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450

Query: 500  TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
             F F V DSL+N+ P+     G           LR +AD          +ATG SK    
Sbjct: 451  KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+  ++R   A ++R      + H ++++S  + T
Sbjct: 511  SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 564  LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 621  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672

Query: 708  SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
             +   Y D                                 EP ++    +  L    G 
Sbjct: 673  VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732

Query: 738  AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
            A       D A        D +             VV  +SG LEI+ +P+   V+ V+ 
Sbjct: 733  AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
              +G   + D    E +  S T+ NS +  G       ++ +S   +EL +     H  R
Sbjct: 793  IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P L  + T   +L YQ + +                  +    +    L   + +   LD
Sbjct: 849  PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895

Query: 899  AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
                EE           Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 896  GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
            + +F   +NVN  HGF+Y  +   LKI  LPS  +YD  WPV+K+PL+ TP Q+ Y  E 
Sbjct: 956  VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHREN 1015

Query: 1014 NLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPD 1071
             +Y LI      +P+ +       D+E+  +              Y +   +E+ ++ P+
Sbjct: 1016 RVYCLITQKE--EPMTKYYRFNGEDKELSEECRGERF-------IYPIGSLFEMVLISPE 1066

Query: 1072 RAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
                 W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG 
Sbjct: 1067 ----TWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGN 1122

Query: 1129 VLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
            + ++        P   +T     EV+ KE KG +SA++ + G L+   G KI + +    
Sbjct: 1123 IHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDG 1182

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            +L G+AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  ++ F 
Sbjct: 1183 DLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFNPMEVFG 1241

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             EF++D S L  +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q  
Sbjct: 1242 IEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCH 1301

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
                 +          N+  +++G+LDG++G   PL E  +RR   LQ  L+    H+ G
Sbjct: 1302 QRGLHQRQPFL---YENKHLVIYGSLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCG 1358

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            LNP+ +R   S  K        I+D +L+  + ML   E+ E+A + GT   +IL++L
Sbjct: 1359 LNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGTRTEEILADL 1416


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score =  474 bits (1219), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 411/1510 (27%), Positives = 668/1510 (44%), Gaps = 262/1510 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243  ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 335  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394

Query: 448  ----------------SGLKEEFGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
                               +++  D+E  +APS  + RR         +  EEL +YGS 
Sbjct: 395  TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRR---------IEDEELEVYGSG 445

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASA 530
            +  +    + F F V DSL+N+ P+     G R+  +                     +A
Sbjct: 446  AKASVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAA 505

Query: 531  TGISKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
            TG SK           N +++   EL GC  +WTV+  ++R       + +    E H +
Sbjct: 506  TGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDF 558

Query: 579  LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
            +++S  + T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ 
Sbjct: 559  MLLSQRSSTLVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTR 617

Query: 639  MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA 698
            + Q++               S V+ VSIADPYV L + +G +  L    +  T  +    
Sbjct: 618  LIQNVPI----------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINK 667

Query: 699  AIESSKKPVSSCTLYHD------------------------------KGPEPWLRKTSTD 728
               SS   V +   Y D                                 EP ++    +
Sbjct: 668  HTISSAPAVVAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEE 727

Query: 729  AWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPN 770
              L    G A       D A        D +             VV  +SG LEI+ +P+
Sbjct: 728  DLLYGDAGNAFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPD 787

Query: 771  FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAM 829
               V+ V+   +G   + D    E +  S T+ NS +  G       ++ +S   +EL++
Sbjct: 788  MKLVYLVNDVGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSL 843

Query: 830  QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN 889
                 H  RP L  + T   +L YQ + +                  L +      +L  
Sbjct: 844  VGLGQHGDRPLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHL 888

Query: 890  LRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
            L    T ++    EET             Q++  F N+ G  G  + G  PC+  +  R 
Sbjct: 889  LDQQPTHIELINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARG 948

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
             LR+H  L +  + +F   +NVN  HGF+Y  +   LKI  LP+  +YD  WPV+K+PL+
Sbjct: 949  ELRIHRLLGNAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKVPLR 1008

Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV 1060
             TP Q+ Y  E  +Y LI      +P+ +       D+E+  +              Y +
Sbjct: 1009 CTPRQLVYHRENRVYCLITQKE--EPMTKYYRFNGEDKELSEESRGERF-------IYPI 1059

Query: 1061 EE-YEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
               +E+ ++ P+     W+    A+I  +  E+    ++V L +  T    +  L IGT 
Sbjct: 1060 GSLFEMVLISPE----TWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTN 1115

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIAS 1171
            +   ED+ +RG + ++        P   +T     EV+ KE KG +SA++ + G L+   
Sbjct: 1116 FNYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGL 1175

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            G KI + +    +L G+AF D   +YV  +  VK+ I + D++KSI  L ++E+   L+L
Sbjct: 1176 GQKIYIWQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1234

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
             ++DF  L+ F  EF++D S L  +V+D ++NI ++ Y P+  ES  GQKLL +A++H+G
Sbjct: 1235 ASRDFNPLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLG 1294

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
              V    R+Q       +          N+  +++GTLDG++G   PL E  +RR   LQ
Sbjct: 1295 QVVNTMFRVQCHQRGLHQRQPFLYE---NKHFVIYGTLDGALGYCLPLPEKVYRRFLMLQ 1351

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              L+    H+ GLNP+ +R   +  K        I+D +L+  Y ML   E+ E+A + G
Sbjct: 1352 NVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIG 1411

Query: 1412 TTRSQILSNL 1421
            T   +IL++L
Sbjct: 1412 TRTEEILADL 1421


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 408/1488 (27%), Positives = 661/1488 (44%), Gaps = 229/1488 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136  GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N I Y +QS         Y V
Sbjct: 243  ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++    E   + +A
Sbjct: 335  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394

Query: 460  DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
               +  +L            + S A    +  EEL +YGS +  +    + F F V DSL
Sbjct: 395  VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454

Query: 510  VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
            +N+ P+     G R+  +                     +ATG SK           N +
Sbjct: 455  INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514

Query: 541  LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
            ++   EL GC  +WTV+  ++R       +      E H ++++S  + T+VL+T   + 
Sbjct: 515  IITSFELDGCLDVWTVFDDATR-------KPTTARQEQHDFMLLSQRSSTLVLQTGQEIN 567

Query: 598  EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
            E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++              
Sbjct: 568  EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616

Query: 658  NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
             S V+ VSIADPYV L + +G +  L    +  T  +       SS   V +   Y D  
Sbjct: 617  GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676

Query: 716  ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
                             GP            EP ++    +  L    G A       D 
Sbjct: 677  GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736

Query: 742  ADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
            A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + D
Sbjct: 737  AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796

Query: 790  TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
                E +  S T+ NS +  G       ++ +S   +EL +     H  RP L  + T  
Sbjct: 797  AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
             +L YQ + +                  +    +    L + + +   LD    EE    
Sbjct: 852  ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899

Query: 907  --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
                   Q++  F N+ G  G  + G  P +  +  R  LR+H  L +  + +F   +NV
Sbjct: 900  NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
            N  HGF+Y  +   LKI  LPS  +YD  WPV+K+PL+ TP Q+ Y  E  +Y LI    
Sbjct: 960  NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHRENRVYCLITQKE 1019

Query: 1024 VLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQT-- 1079
              +P+ +       D+E+  +              Y +   +E+ ++ P+     W+   
Sbjct: 1020 --EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVLISPET----WEIVP 1066

Query: 1080 RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
             A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++      
Sbjct: 1067 DASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVV 1126

Query: 1139 DNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
              P   +T     EV+ KE KG +SA++ + G L+   G KI + +    +L G+AF D 
Sbjct: 1127 PEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDT 1186

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
              +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ F  EF++D S L
Sbjct: 1187 N-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDNSNL 1245

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
              +V+D ++N+ ++ Y P+  ES  GQKLL +A++H+G  V    R+Q            
Sbjct: 1246 GFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHHRQPF 1305

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
                  N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   
Sbjct: 1306 LYE---NKHLVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIK 1362

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +  K        I+D +L+  Y ML   E+ E+A + GT   +IL+++
Sbjct: 1363 TVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILADM 1410


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
          Length = 1420

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 410/1486 (27%), Positives = 677/1486 (45%), Gaps = 225/1486 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+             
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243  ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 335  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 395  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 454  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 514  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 565  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 614  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 674  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 734  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 794  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 850

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+   
Sbjct: 851  ELLIYQ--VFRYPKGHLKI-----RFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 901

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 902  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 961

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 962  PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1019

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1020 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1068

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1069 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1128

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1129 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN- 1187

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1188 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1247

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++NI ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1248 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1306

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S+
Sbjct: 1307 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1364

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             K        I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 1365 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1410


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 398/1499 (26%), Positives = 679/1499 (45%), Gaps = 223/1499 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  R+  +    ++      R   M       LE +  YRL GN+ SL
Sbjct: 42   SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+    +RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 94   QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
                PLV+VDP  RC  +LVYG ++++L   K S      + D                I
Sbjct: 147  HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 207  LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 267  NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
            L+SS +        P+    + LDAA   +++++  +LS K G+L +LT+  D       
Sbjct: 321  LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374

Query: 402  DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
                         I        FLGSRLG+SLL++F     + +++    ++ G +E + 
Sbjct: 375  ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKEQ 421

Query: 462  PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
               KR R    +            +YGS    T     ++ F V DS++NIGP+   + G
Sbjct: 422  ---KRQRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDSVLNIGPIAHMAVG 467

Query: 522  LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
             RI  +                    +A+G  K     +++            L GC  +
Sbjct: 468  ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527

Query: 551  WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            WTV+ +++       +R    DD     HA++I+S E  TMVL+T + + E+ E+  +  
Sbjct: 528  WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               TI  GN+   R ++QV  +  R+L G+ + Q++                 + SVSI 
Sbjct: 585  DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
            DPYV +  S+G +  L       T  +       S+  PV + + Y D            
Sbjct: 635  DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694

Query: 716  ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
                KG               PEP ++    +  L    G +      AD    D+G   
Sbjct: 695  FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754

Query: 751  -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
                           + +   ++G LEI+ +P+    + +    +G   + D+     L 
Sbjct: 755  ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814

Query: 797  --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              K   ++  ++S  G   G      S+   E+ M    ++ SRP LF I  +  +L Y+
Sbjct: 815  MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
             + +       +     S+    +   V A RL NL   +    A T    P+G   Q  
Sbjct: 874  VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931

Query: 914  -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
                       I  F N+SG+ G  + G +P +  +     LR H       + AF   +
Sbjct: 932  ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991

Query: 962  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
            NVNC +GF+Y   Q  LKI  LP+  +YD+ WPV+KIPL+++P QI Y  E  +Y +++ 
Sbjct: 992  NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKIPLRSSPKQIVYHRENRVYCVVMD 1051

Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--------EYEVRILEPDRA 1073
                            +E+ ++    N    +L      E        ++ V ++ P   
Sbjct: 1052 A---------------EEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNP--- 1093

Query: 1074 GGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVL 1130
               W+      I ++  E+ ++++ V+L     +   +  +A+GT +   ED+ +RGR+L
Sbjct: 1094 -AAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSEDITSRGRLL 1152

Query: 1131 LFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
            L+        P   +T     EV  K+ KG +SA++ + G L+ A G K+ L +    +L
Sbjct: 1153 LYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDL 1212

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
             G+AF D   ++V  +  +K+ IL+ D++KS+  L ++++   L+L+++D+  L+ +  E
Sbjct: 1213 VGVAFIDTN-IFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHPLNVYQVE 1271

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
            +++D + L  +V+D+Q N+  + Y P+  ES+ GQ+LL + ++H+G  V    R+Q    
Sbjct: 1272 YVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQCDFH 1331

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
             SD        D  N+    F TLDG  G + PL E T+RRL  LQ  L+   PH  GLN
Sbjct: 1332 ESDVMRRTLNYD--NKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLN 1389

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P+++R    +          +VD +L+  +  LP  E+ E+A + GT   +I ++L ++
Sbjct: 1390 PKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLMEI 1448


>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
 gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
          Length = 374

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
           MS+AAYKMMHWPT I +C +GFITHS AD                  ++DS   S  R +
Sbjct: 1   MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60

Query: 53  GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
           GP PNLVV+AANV+E+Y VR     G+++  NS  T    ++DGIS A LELVCHYRLHG
Sbjct: 61  GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           N+ES+A+LS G      RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
           RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G  LVG+++        + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235

Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
           SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K  TCM+SA SIS  
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++   D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score =  443 bits (1140), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 371/1327 (27%), Positives = 612/1327 (46%), Gaps = 177/1327 (13%)

Query: 57   NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
            NLVV  ANV+ ++  +    E+  ++    G+  +           LE +  Y L G V 
Sbjct: 29   NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79

Query: 115  SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            S+A +S  G+     +D+I+++F  AK+S++E+D     L+  S+H FE    L    G 
Sbjct: 80   SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133

Query: 175  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
             S  + P ++VDP+GRC  +L++   + IL               F       + +  SS
Sbjct: 134  GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179

Query: 234  HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L DL+ +  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + A+S++T 
Sbjct: 180  YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLD 350
             + HP+IWS  +LP D  +LL VP P+GG L++  N++ Y +QS     +++N+ A    
Sbjct: 240  QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299

Query: 351  SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
            S    P     + LD A A +LQ D  +LS K G+L +LT+  D  R V++  L K   S
Sbjct: 300  SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359

Query: 410  VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            VLT+ +    N LF LGSRLG+SLL+ F         +    +      A  P  ++   
Sbjct: 360  VLTTCLCICDNYLF-LGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410

Query: 470  SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                 L D ++ EE+   LYG    +T+S   ++ F V DSL+NIGP    + G      
Sbjct: 411  LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466

Query: 528  ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
            ++ T   K+S    VE+      G  G   V  ++ +     +  +    D +       
Sbjct: 467  STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526

Query: 576  ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
                  H YL++S    TMVL+T   + E+ +S  + V   TI A NL   R ++QV   
Sbjct: 527  NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585

Query: 630  GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV----------LLGMSDG 678
              R+LD +  + Q+L              +  + S S +DPYV          LL   +G
Sbjct: 586  SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635

Query: 679  SIRLLV------GDPSTCTVSVQTPAAIESSKKPVS---SCTLYHDKGPEPWLRKTSTDA 729
            S   ++        P  C    +  + + ++  P +     T  H        +K   D 
Sbjct: 636  SQLEMIFPVLSKNSPVVCVCLYRDISGLFNTTIPETDSPETTKLHTANKSLNAKKEMDDE 695

Query: 730  ----WLSTGVGEAIDGADGGPLDQGDIYSVVCY--------------ESGALEIFDVPNF 771
                +  T   E+    D           V+ Y              ++G LEI+ +   
Sbjct: 696  EDYLYGDTNTEESRPTEDKTHTKFTPQQKVIDYFREIKPTFWLSIIRQNGTLEIYSLAGQ 755

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
            +    V+ F +   H+    +   +K  ET + SS+                +VE+ +  
Sbjct: 756  S---VVETFQTVHVHLGHRLIFN-MKADETSLPSSTH-------------CNIVEMGIFG 798

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN-- 889
                H RP L    +D  +L Y+A              PV  S+  +   +   +L +  
Sbjct: 799  LGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIRFRKLNHSL 848

Query: 890  -LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
             LR ++T    Y R+        P+     +   F NI+G+ G F+ G  P W  M  R 
Sbjct: 849  LLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTSRG 904

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
             LR+HP   DGSI  F   HNVNC  GFIY+  +  L+IC LP+   YD  WPV+K+PL+
Sbjct: 905  ELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKVPLR 964

Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT-V 1060
             TPH + Y  E   Y  I++  + +P N++     D +        +L   D    Y  V
Sbjct: 965  CTPHYLIYHVETKTY--ILATSLAEPTNRIYRFNGDDK------ELSLEERDDRFPYPHV 1016

Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQ 1119
            E++ ++++ P        TR  + +   E+   ++ V+L +       +  LA+ T Y  
Sbjct: 1017 EKFAIQLISPVTWEAVPNTR--MDLDDWEHVTCLKTVSLEYEGHASGLKDYLAVSTNYNY 1074

Query: 1120 GEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
            GED+ +RGR+ +         P     +N +  +Y+K+ KG ++A++S+ G+L+ A G K
Sbjct: 1075 GEDIISRGRIFILDLIEVVPEPGQPLTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQK 1134

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            I L +    +L GIAF D   +Y+  L  +K+FIL  D++KS+  L ++E+   L ++A+
Sbjct: 1135 IYLWQLKNDDLVGIAFIDT-EIYIHQLLNIKSFILAADVYKSVSILRFQEEYRTLCIVAR 1193

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            D+  L+  A ++ ID + L  +VSD +KN+ ++ Y P+  ES  G +L+ +A+FHVG  V
Sbjct: 1194 DYQPLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQPEARESQGGHRLIRKADFHVGQVV 1253

Query: 1295 TKFLRLQ 1301
            +   R++
Sbjct: 1254 STMFRIK 1260


>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
          Length = 1626

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 427/1602 (26%), Positives = 653/1602 (40%), Gaps = 375/1602 (23%)

Query: 4    AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
            A    +H PT + +C + ++TH++          Q               P+P+LVV  +
Sbjct: 7    AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58

Query: 64   NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
              +E+Y VR  E G   + ++             A SL+ +   RL G  ES+A+L +G 
Sbjct: 59   TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104

Query: 124  ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
            A    +RD ++L F DAK+SVL +D   H L  +S+H FE      LK GR  F   PL 
Sbjct: 105  APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160

Query: 184  KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
              DP GRCG V+++  Q+ +L A         D + FG G                    
Sbjct: 161  VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212

Query: 224  --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
                                         +A + +S+V NL    +K V+D  F+HGY E
Sbjct: 213  PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            PV+++LHE E TWAG +  K  TC+++ALS++ T K HP IW A  LP DAY+L A  +P
Sbjct: 273  PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330

Query: 318  IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
             GGVLV+  + + ++ Q     + L+                                  
Sbjct: 331  CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390

Query: 344  -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
              +AV +            D+SQ    ++  V  D A   WL  + ALL  ++G L+ L 
Sbjct: 391  RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447

Query: 391  VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
            ++    G   + L +++   +   S   ++  +                      L FLG
Sbjct: 448  LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507

Query: 427  SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
            S  GDSLLV+ T     G+     ++       D  AD P++KRLR    +         
Sbjct: 508  SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567

Query: 480  --------------------------------NGEELSLYGSA--------------SNN 493
                                              EE  +YG+A              +  
Sbjct: 568  ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627

Query: 494  TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
            T S Q + +   V DSL NIGPL+DF+        A A G +           G  G  T
Sbjct: 628  TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684

Query: 553  VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
            V  +S                  G    +   A  +  +HAYL++S +  T VL T + L
Sbjct: 685  VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
             EVTESV++ V   T+AAG++   RR+ Q F +G R+LDG    QD     L+   + + 
Sbjct: 745  REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
            +G       ++S  + DPYVLL ++DG+ R L  DP  C +S  +               
Sbjct: 805  AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864

Query: 697  -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
                 P A E     +++C+L+ D       + P+   +            G     A  
Sbjct: 865  EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
                 G +Y+VVC  SGA +++ +P +  VF+    ++G   ++ T    A   +     
Sbjct: 922  SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
            +++         E     +VVE+ +  +    +                  P L A+  D
Sbjct: 981  AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
              +L YQA+        ++      T R   +       L       LR  R        
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100

Query: 903  EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
            EE P+                 G F++G  P W +  R  L  HP               
Sbjct: 1101 EEAPY----------------SGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144

Query: 954  IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
               FT  HNVNC HGFI  TS  +  ++I QLP  +  D  WP Q++ +K TP ++ ++A
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRVSIKGTPLKVAHYA 1204

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
            E +++              VLS    +  G  +               +E +EVR + P 
Sbjct: 1205 EADMF-------------AVLSSRQGRARGRGV---------------MEGHEVRWVWP- 1235

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
              GG WQ       +  E AL+V  V L +  T     LLA+G A   GED    GR+LL
Sbjct: 1236 --GGGWQGVGRHQRRPGERALSVGAVRLKDHATGATVPLLAVGAALPAGEDYPCGGRLLL 1293

Query: 1132 FSTGRNADNP----QNLVTEVYSKELKGAISALASLQGHLLIASGPKI------------ 1175
            F   R         Q     +Y++E KG +++++ L+G+LL+ASG +I            
Sbjct: 1294 FEVTRGDGGGGGGGQWAGRLIYTREFKGPVTSVSGLEGYLLLASGNRIETCSLSSTTITS 1353

Query: 1176 -----ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
                  +   T  ++   AFYD P L + SLN+VKNF+LLGD   S+ F+ +K++G QL+
Sbjct: 1354 TADDGTVAATTTWKVQRSAFYDGPVL-LTSLNVVKNFVLLGDCQHSVQFVRYKDEGRQLS 1412

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
            LL+KDF   D  AT+FLI+GS+L L   D    +++  YAP    SWKGQ+L++   FHV
Sbjct: 1413 LLSKDFNRADTAATQFLINGSSLHLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHV 1472

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
            G   +   RL++  +S +        D+T R A+L  +  GS
Sbjct: 1473 GEAASCMRRLRLHPSSPE--------DRTVRQAVLLSSAAGS 1506


>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Cucumis sativus]
          Length = 643

 Score =  404 bits (1039), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
           MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           TA NV+E+YVVRV EEG +ESK+SGE KR  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60  TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241 LDMKHVKDFIFVHGY 255
           LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254



 Score =  368 bits (945), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)

Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
           GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401

Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
           LKDFSYGLRINAD +ATGI+KQSNYELV                          ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
           GIWTVYHK++RG  ADSSRM   DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV 
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+   + +ESG+ SE  TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581

Query: 669 PYVLLGMSDGSIRLLVGD 686
           PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score =  385 bits (988), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 306/1011 (30%), Positives = 469/1011 (46%), Gaps = 163/1011 (16%)

Query: 501  FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
            + F+V+DSL+ I P+ D + G    +  D        +A G  K             ELV
Sbjct: 668  YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727

Query: 543  ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
                   LPG +  W      +   N  + R    D+ +  +LI+SL +   TMVLET +
Sbjct: 728  TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
             L EV++ VD+ V   T+A   +F  R + QV +   R     +  +   F   + +   
Sbjct: 785  ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839

Query: 654  -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
              G E + +    I +  + L +SDGSIR+++GD  T T ++              ++  
Sbjct: 840  LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899

Query: 706  PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
             V++ TLY D                + P  WL +T      + G  E  D +     + 
Sbjct: 900  DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948

Query: 750  GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
             ++        G L ++ +P+   +++      G         RE L  + T I+S    
Sbjct: 949  NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
               +  K  +  +++   A    +A + RP L     DG++L YQA  F+ P +      
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
                                LRF+R P++  T   E T +    Q   R+T  +NI    
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
            G  G F+SG  P W +V R R+   P   +G   +AF   HNVNC  GFI  T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149

Query: 981  CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            C+LP     +  WPV+K+ L+ TP  ITY  +  LY L+ S  V  P         D E+
Sbjct: 1150 CRLPGKMHIEAQWPVRKLALRCTPRAITYMNDFKLYALVTSASV--PWK-------DFEI 1200

Query: 1041 GHQIDNHNLSSVDLHRTYT------VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              + D+H  +     +         V+++ +R+L P      WQ      ++  E+ L V
Sbjct: 1201 -DETDSHARALYRFRKEKAKSEGNVVQQFAIRLLVPGTLETAWQK----AVEPGEHILCV 1255

Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST--GRNADNPQNLVTEVY-SK 1151
            + V + + +T    ++LAIGTA   GED   RGR+LLF+    R  D       E+   K
Sbjct: 1256 KNVQIRDQSTGALLSMLAIGTAMPGGEDTPCRGRILLFAIMWERARDGGVRWRGELKCEK 1315

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
              K A SA+ S+ G  ++A G K+  H W G  LN IAFYD  PLY  +L  VKNF+L G
Sbjct: 1316 PSKMACSAIESVDGTFMVAIGTKLTAHSWDGKHLNPIAFYDT-PLYTTTLCCVKNFLLCG 1374

Query: 1212 DIHKSIYFLSWKE-QGAQ-LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            D+HKSI F+ WK+ QG + L+ L KD+  LDC A+EF+IDG TLSL+ +D   N  +F Y
Sbjct: 1375 DLHKSIRFVRWKDSQGEKTLSQLGKDYEVLDCIASEFMIDGGTLSLLAADANGNAHVFQY 1434

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
            APK++ESWKG KLL ++ +H G+ + K +R Q+            G  K NR A+ FG+ 
Sbjct: 1435 APKLAESWKGDKLLPKSAYHAGSLIRKMVRFQI----------GVGEQKQNRHAVFFGSS 1484

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA------GLNPRSFRQFHSN--GKAHRP 1381
            DG +G  +P+DE TF  L+ LQ  +  ++   +      GLN +++R   S+    A + 
Sbjct: 1485 DGGLGIFSPVDEHTFLNLEKLQDAMRSNIVASSNSINPLGLNSKTYRALKSSEGSVARQT 1544

Query: 1382 GPDSIVDCELLSHYE-MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             P +IVD  LLS +E  L +  Q  +A + G TR Q LS    +    SF+
Sbjct: 1545 PPRTIVDGGLLSKFEHSLSITAQTRVAAKAGLTRDQALSLARTIIAEQSFM 1595



 Score =  142 bits (358), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)

Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
           P++ + DP+GRC  VL+   +   +K                +S G   +   ++  G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265

Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
             +  A I SS  +++R +       V+D  F+HGY EPV++IL+E    TW+GR+S + 
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC + A+SI  T K++ ++W+   LP  AY L  VP+P+GGVLV+ +  I Y SQS+S 
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385

Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
               +          N+A  +                         SS E  ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
           DAA    ++ +VA++S+KTG L+  TV+ +            GR  +R+ + K+  +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
           S +  +G  L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528


>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1744

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 372/1498 (24%), Positives = 634/1498 (42%), Gaps = 337/1498 (22%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 278  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A    +
Sbjct: 338  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397

Query: 352  SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                P S              +V L      +LQ    LL+  +G + +L++ Y+     
Sbjct: 398  QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 458  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 517  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD+
Sbjct: 572  ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNADS 631

Query: 529  S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            +        + G  +     ++            EL GC+ +WTV         +   R 
Sbjct: 632  NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 690

Query: 569  AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
                  Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R++Q+F+
Sbjct: 691  -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQRIVQIFK 745

Query: 629  RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
            +GAR++                  +G+                  TQ+++        G 
Sbjct: 746  QGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 805

Query: 655  GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
              + S+V  +SV + DPY+LL ++DGS+RLL+GD     +SV  P             + 
Sbjct: 806  NVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 865

Query: 701  ESSKKPVSSCTLYHD----------------------------------------KGPEP 720
            + SK   SS  L++D                                          P P
Sbjct: 866  DMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALYSSKPSP 925

Query: 721  WLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPNF----- 771
             +  T +T +  ST      DG+   PL Q     +   +C+  G+L +F +P+F     
Sbjct: 926  KVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPDFKKRGV 985

Query: 772  ---------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHS 821
                     + V T++ +  GR   V       L      +N S+S    G+ +K +  +
Sbjct: 986  FPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIKKSHTIN 1039

Query: 822  MKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPV 871
              V ++ + R         + + SR  +   L +G +L Y A   FE  +  +  +  PV
Sbjct: 1040 SPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPV 1099

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI---------TIFKNISG 922
                 +    ++   L     +    +A    E    A   ++         T F N++ 
Sbjct: 1100 FHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFHNVNN 1159

Query: 923  HQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFIYVTSQG 976
              G F  G+ P W +  R      P +C+ +      +++FT  H+ NC +GFIY  S+G
Sbjct: 1160 MSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFIYFHSRG 1218

Query: 977  ILKICQLPSGSTY-----DNYWPVQKIPLKATPHQITYFA-----------EKNLYPLIV 1020
             L++C+LPS  T         + +QK    AT H + Y             E   Y ++ 
Sbjct: 1219 ALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVC 1278

Query: 1021 SV--------------------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVD 1053
            S                           P   PL + V++   +    ++ D        
Sbjct: 1279 SARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD-------- 1330

Query: 1054 LHRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT------- 1103
             H  +T E+ YE+R+++ D  G  W  R    +  +  E  L+V+++ L++++       
Sbjct: 1331 -HMAHTEEDVYELRLVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVA 1388

Query: 1104 ------TKENETLLAIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNLV 1145
                   K+    L +GT +V   GED + RGR+LL+            G  +     L 
Sbjct: 1389 STSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKLR 1448

Query: 1146 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1205
                 +  +GAIS ++ L  ++L A G K+I++++   +L G AFYDA  +Y+V+L++VK
Sbjct: 1449 LVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVVK 1507

Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            +F++ GD++KS++FL W+E   QL LLAKD+  L   ATEF +    L+L+  D  +N+ 
Sbjct: 1508 DFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENLH 1567

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTN 1320
            +  +AP+  ES  GQ+LL  ++FH+G  V+   R ++ A+ S     +   AAP S+  N
Sbjct: 1568 VMQFAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN 1627

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
                + GT +G +G + P+ E  FRRL +LQ  +V+++P    LNPR FR   +N +   
Sbjct: 1628 ----VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRC 1683

Query: 1381 PGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
              PD+          +D  +L  +  L    Q E+A   GTT   ++ NL ++   TS
Sbjct: 1684 GRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 1741


>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
 gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
          Length = 1561

 Score =  372 bits (954), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 370/1484 (24%), Positives = 630/1484 (42%), Gaps = 322/1484 (21%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 108  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 168  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227

Query: 350  -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                   D+  E P    +  +V L      +LQ+   LL+   G + +L++ Y+     
Sbjct: 228  QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 288  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 347  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD+
Sbjct: 402  ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNADS 461

Query: 529  S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            +        + G  +     ++            EL GC+ +WTV         +   R 
Sbjct: 462  NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 520

Query: 569  AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
                  Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R++Q+F+
Sbjct: 521  -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFK 575

Query: 629  RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
            +GAR++                  +G+                  TQ+++        G 
Sbjct: 576  QGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 635

Query: 655  GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
              + S+V  +SV + DPY+LL ++D S+RLL+GD     +SV  P             + 
Sbjct: 636  NVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 695

Query: 701  ESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAIDGADGGP 746
            + SK   SS  L++D               P P +    +T +  ST      DG+   P
Sbjct: 696  DMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNEDGSVSIP 755

Query: 747  LDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSGRTHIVD 789
            L Q     +   +C+  G+L +F +P+F              + V T++ +  GR   V 
Sbjct: 756  LLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK 815

Query: 790  TYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SAHHSRPF 840
                  L      +N S+S    G+ +K +  +  V ++ + R         + + SR  
Sbjct: 816  ------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQYLSRMV 869

Query: 841  LFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            +   L +G +L Y A   FE  +  +  +  PV     +    ++   L     +    +
Sbjct: 870  MLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKARTNAHNE 929

Query: 899  AYTREETPHGAPCQRI---------TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQL 949
            A    E    A   ++         T F N++   G F  G+ P W +  R      P  
Sbjct: 930  AGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHASFVPMC 989

Query: 950  CDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
               S                   +++FT  H+ +C +GFIY  S+G L++C+LPS  T  
Sbjct: 990  VPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELPSSKTST 1049

Query: 990  ----DNYWPVQKIPLKATPHQITYFA-----------EKNLYPLIVSV------------ 1022
                   + +QK    AT H + Y             E   Y ++ S             
Sbjct: 1050 ILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPADADRATE 1109

Query: 1023 --------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVR 1066
                          P   PL + V++   +    ++ D         H  +T E+ YE+R
Sbjct: 1110 VEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD---------HMAHTEEDVYELR 1160

Query: 1067 ILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT-------------TKENETLL 1111
            +++ D  G  W  R    +  +  E  L+V+++ L++++              K+    L
Sbjct: 1161 LVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVASTSPEWNKKKRPYL 1219

Query: 1112 AIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNLVTEVYSKELKGAISA 1159
             +GT +V   GED + RGR+LL+            G  +     L      +  +GAIS 
Sbjct: 1220 VVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKLRLVFIKEHRQGAISM 1279

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            ++ L  ++L A G K+I++++   +L G AFYDA  +Y+V+L++VK+F++ GD++KS++F
Sbjct: 1280 VSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVVKDFVMYGDVYKSVHF 1338

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            L W+E   QL LLAKD+  L   ATEF +    L+L+  D  +N+ +  +AP+  ES  G
Sbjct: 1339 LRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGG 1398

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            Q+LL  ++FH+G  V+   R ++ A+ S     +   AAP S+  N    + GT +G +G
Sbjct: 1399 QRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN----VMGTSEGGVG 1454

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--------- 1385
             + P+ E  FRRL +LQ  +V+++P    LNPR FR   +N +     PD+         
Sbjct: 1455 ALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRILKTNAQRRCGRPDAWSKKKWKKS 1514

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             +D  +L  +  L    Q E+A   GTT    + NL ++   TS
Sbjct: 1515 FLDAFVLFRFLQLDYVAQKELARCIGTTPEVAMHNLLEVQHATS 1558


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Metaseiulus occidentalis]
          Length = 1500

 Score =  355 bits (911), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 273/958 (28%), Positives = 443/958 (46%), Gaps = 143/958 (14%)

Query: 543  ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
            ELPGC  +WTV   S+R  + D        ++ H +LI+S    TM+L+T   + E+  S
Sbjct: 603  ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654

Query: 603  VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
              +  Q  TI AGNL   R +IQV     R+L+G    Q +               S ++
Sbjct: 655  -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703

Query: 663  SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
            S SIAD +VL+   DG +  L   GD +T   +SV  P     +K  +++  +Y D    
Sbjct: 704  SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762

Query: 716  -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
                 + PE   + KT     + T V                  G ++D  D   G L+ 
Sbjct: 763  FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822

Query: 750  GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +I                             +  +  E+GALEI+  P++   + V  F
Sbjct: 823  ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                  + +  ++ A    +T   S+SE              KV+E+ +     H SRP 
Sbjct: 883  -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            LFA + D  +  Y+AY F               ++      +   RL++   +  P   Y
Sbjct: 928  LFARV-DSELHIYEAYPF--------------VNQKEGHLKLQFRRLQH-AVTMEPRRVY 971

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
             ++E       + I  F+++ G+ G F+ G RP W  +  R  LR HP L DG I +F  
Sbjct: 972  KQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDGRIYSFAT 1031

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI 1019
             HNVNC  GF++    G L+IC LPS   YD  WP++KIP+  TPH + Y  +   Y + 
Sbjct: 1032 FHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIYETPHSVNYHVDSRTYCVA 1091

Query: 1020 VS-------VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1072
             S       VP L   ++     I++E    I              TV+++ + +  P  
Sbjct: 1092 TSKEETATCVPKLANEDKEFEP-IERESSRFIPP------------TVDKFALELWSPVS 1138

Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLL 1131
                  TR  +PM+  E    V+ V + +  T   E  L+A+GT +  GED+ A+GR+LL
Sbjct: 1139 WEAIPNTR--MPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDITAKGRILL 1196

Query: 1132 FSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1186
                     P     ++ V  + SK     ++AL S++GHL+ A G K+ L +    +L 
Sbjct: 1197 IDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKLFLFQLKDNDLV 1256

Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
            G+AF D   +Y++S   +K+FIL+GD+HKSI  L ++E+   L +++KD   +  ++ E+
Sbjct: 1257 GMAFLDTQ-IYILSAISIKSFILIGDVHKSITLLRYQEESKTLAVVSKDTKPVQIYSIEY 1315

Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1306
            L+D S ++ + +D Q NI ++ Y P+  E++ GQ+L+ R +F++G+ +    R++     
Sbjct: 1316 LVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIGSRINTMFRIRCRLAE 1375

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
              R+     SD   R   L+ +LDG+ G + P+ E T+RRL  LQ  L     HV GLNP
Sbjct: 1376 VPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQNVLNSYCQHVGGLNP 1435

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            ++FR   ++ +A      +IVD +L++ +  L   E+ E+A + GTT  QI  +L ++
Sbjct: 1436 KAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIGTTVHQIQLDLAEI 1493



 Score =  184 bits (466), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
           NLVV    VI++Y                    R++ DG++     A LE    +   GN
Sbjct: 29  NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +  +  +    +     RDS++  F++ KIS++E+D + H L+  ++   E  E+   K 
Sbjct: 69  ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFS 227
           G  +F    L+KVDP  RC  VL+YG  + I+      A+     +   + T  +  GF 
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180

Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                 + I L DLD    + ++ D  F++GY EP +++L+E   TW GRV+ +  TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
            ALS++   + HP +WS   LP +++K+L VP PIGGVL++  N + Y +QS        
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291

Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
            Y VSL+   E   +SF ++        LD     +L     LLS   GDL +L++  DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            R + + +  K   + + + I+       F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
          Length = 1241

 Score =  354 bits (908), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 311/1109 (28%), Positives = 496/1109 (44%), Gaps = 156/1109 (14%)

Query: 388  LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +LT+  DG R V+  +  K   SVLT+ +T       FLGSRLG+SLL+ +T      M 
Sbjct: 198  VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256

Query: 447  SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
             +G KE+               ++  D    +++ +EL +YGS +  T+    +++F V 
Sbjct: 257  EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300

Query: 507  DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
            DSL+NIGP      G           N+D      +  G  K     ++           
Sbjct: 301  DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360

Query: 543  -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             ELPGC  +WTV    +           D     A     HA+LI+S    +M+L+T   
Sbjct: 361  FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  Q  T+ AGNL   R V+QV   G R+LDG+   Q +            
Sbjct: 421  INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
               S ++  S+ADP+VL+    G +  L   GDP++ C ++V  P   A+ S +    +C
Sbjct: 470  DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529

Query: 711  --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
                           LY D    +  +  +R T+ +    +      +  +  P      
Sbjct: 530  HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
            +  V  E+G LEI+ +P++   F V  F  G+  +VD+    A   +++E ++  S E  
Sbjct: 584  WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641

Query: 812  GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                      M VV E+ M       SRP L A + D  +L Y+A+ F   +        
Sbjct: 642  ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                  L    ++   +   R  +T       EE    +    +  F +ISG+ G FL G
Sbjct: 688  ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742

Query: 931  SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-----ILK---IC 981
             RP W  M  R  LR HP   DG +  F   HNVNC  GF++   Q      +L    + 
Sbjct: 743  HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLS 802

Query: 982  QLPSGSTYDNYW----PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
            QLPS   +        P  K   K   H+  +FA +      +  P        L  + D
Sbjct: 803  QLPSPKRHGERLLFNCPSHK---KICIHRCHFFALQQKAADFLWPPPFVTTVSPLPFVAD 859

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVR 1095
                             +   T++++ +++L P      W+T     + +   E+   ++
Sbjct: 860  SR---------------YIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCIK 900

Query: 1096 VVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVY 1149
             V L +  T    +  LA+GT Y  GEDV +RGR+ +         P     +N +  VY
Sbjct: 901  NVMLSSEGTSTGMKGYLALGTNYCYGEDVTSRGRITILDIIDVVPEPGQPLTKNKIKIVY 960

Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
            SKE KG ++AL+ + G LL A G K+ + +     L G+AF D   +Y+ S+  VKN IL
Sbjct: 961  SKEQKGPVTALSQVVGFLLSAIGQKMYIWQLKDNGLVGVAFIDTQ-IYIHSVVTVKNLIL 1019

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            +GD+ KS+  L ++E    L+L+++D   L+ FA EF ID S +S +V+D ++N+ ++ Y
Sbjct: 1020 VGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVFAVEFFIDNSQMSFLVTDSERNMILYMY 1079

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             P+  ES  GQ+LL R +FH+G+ V    R++       +      +    R   +  TL
Sbjct: 1080 QPESRESCGGQRLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDGRHITMLATL 1139

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            DGS+G + P+ E T+RRL  LQ  LV ++PH AGLNP++FR +HS  +       +I+D 
Sbjct: 1140 DGSLGYVLPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLGNPHKNILDG 1199

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            EL+  +  L   E+ E++ + GTT +Q++
Sbjct: 1200 ELIWKFMHLSFMERSELSKKIGTTVTQVV 1228



 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+++VDP  RC  +LV+   + ++   +  +    +E   G   G    +   + + L +
Sbjct: 58  PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114

Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           LD K  +V D  F+HGY EP ++IL+E   TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149



 Score = 40.0 bits (92), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)

Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           + L +LD K  +V D  F+HGY EP ++IL+E   TW G V    +  M S  + +    
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
              ++ ++M L  + Y  L   S +G  L+     +HY  ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score =  348 bits (892), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
           NL+V   NV+E+Y +   E+      +   T+     DG+                +  S
Sbjct: 30  NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + LV    L GN+ES+A +   G      +D+++L+F DAKISVLEFD + + LR  S+H
Sbjct: 87  MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE      +K G + +   P ++VDPQ RC  +L +  ++++L   Q  S +       
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194

Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
            +GG     ++ S +++LR + + +VKDF+F+ GY EP ++IL+E   TW+GRV+   +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254

Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
           C+ +A+S++          HP++WSA  LP+D  +L+AVP PIGG L +  N++      
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314

Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
                              H+ +Q+++  L LN +A + L      P   ++   + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374

Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
           A   +L +D  L+S K G+L +  ++ DGR V  + L+K   SV+TS + T+        
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
             FLGSR+GDSLL+Q+T    ++   +G  +   F DI+ +  +         +A     
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494

Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
           +++  +GE E+  +G      +  +K     T+ F V DSLVN+GP+ DF+ G   + A 
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554

Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
            S      Q + E+V                           +L GCK  WT+YH+S   
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
              ++        EYHAYL++S E +T V+   D L E++ E  D+ V   T+ AGNLF 
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665

Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           + R++QV + G  +LDG   TQ +            S    + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
           +RL   DP++  + VQT        + + +  L++                     G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751

Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
            G      D+  I++ +  ++G L+I+ VP F+ VF+ ++  +G   I +  MR   + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809

Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
             + ++ +             S ++ E+A+       S P LF  L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855



 Score =  305 bits (780), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 191/539 (35%), Positives = 277/539 (51%), Gaps = 62/539 (11%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            +RI  F  +    G F+SGS P W    R   R++P   D  + AF   HN NC HGFIY
Sbjct: 967  RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026

Query: 972  VTSQGILKICQLPSGSTYDNYWP----VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
               +G LKICQLP+      +W     V+K+PL  TP +I Y      Y  +V++     
Sbjct: 1027 FNHEGTLKICQLPAAEGA-IHWELPGVVRKVPLGRTPREIAYHPPSRTY--VVALATPVT 1083

Query: 1028 LNQVLSLLIDQE-------------VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
                     D E             +G + +               E +E+ ++ P    
Sbjct: 1084 TVVPTPPETDMERQEREREEEESREMGIEPEEKQRDMGPREIAMMEERHELHLISPRT-- 1141

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL----LAIGTAYVQGEDVAARGRVL 1130
              WQ    + ++  E+ LT+ V+ L +  ++ N  L    L I    V GE+        
Sbjct: 1142 --WQILHHVELEPKEHVLTLSVLKLGDNYSQVNRELRPPHLLIYEIDVTGEE-------- 1191

Query: 1131 LFSTGRNADNPQNLVTEVYSKELK-----GAISALASLQGHLLIASGPKIILHKWTGTEL 1185
                       Q  +T  Y K +K     G +SA ASLQG+L+IA GPKI +  + G   
Sbjct: 1192 -----------QCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIAVGPKIWVFNFDGGST 1240

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
              +AFYDAP  Y+VS+  +KNF+L GDI+KSI+FL WK+  +QL LLAKD G +  FATE
Sbjct: 1241 EAVAFYDAPH-YIVSIKTLKNFVLCGDIYKSIFFLRWKDSASQLALLAKDVGRVSVFATE 1299

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
            +++D   L+L++SDE++N+Q+  YAP  +ES  GQ L+ R +F+VG  + KF+RL M   
Sbjct: 1300 YVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRGDFNVGQSINKFVRLPMTLP 1359

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
            S        G+    R AL FGTL G +G +AP+DE  FRRL  LQ  L+ ++PH AGL+
Sbjct: 1360 S--------GTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQSALLSAIPHTAGLH 1411

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P+++R      +  R    +I+D  LLS Y  L    Q +IA + GT+R +IL++L  +
Sbjct: 1412 PQAYRALQRE-RLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTSRERILNDLQGI 1469


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
            alecto]
          Length = 820

 Score =  345 bits (884), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 220/683 (32%), Positives = 344/683 (50%), Gaps = 48/683 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 162  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 218  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++  R +  R S+   +    E         R   F++I G+ G F+ G  
Sbjct: 268  VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 327  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S              I +  G + +   +  
Sbjct: 387  PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCTRIPRMTGEEKEFETIER 439

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 440  DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 495

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 496  GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 555

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 556  NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 614

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 615  EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 674

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 675  RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 727

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 728  MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 787

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 788  STMERGELAKKIGTTPDIILDDL 810


>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
            SS5]
          Length = 1422

 Score =  342 bits (878), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 356/1475 (24%), Positives = 647/1475 (43%), Gaps = 198/1475 (13%)

Query: 52   IGPVPNLVVTAANVIEIYVVRVQ-------------EEGSKE---SKNSGETKRRVLMDG 95
            +G   NLVV   N++ ++ VR++             E+G          GE +  V  +G
Sbjct: 40   LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99

Query: 96   ISAAS------------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
              +A                   L LV  +RLHG V  L  + Q  A    + D ++++F
Sbjct: 100  FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158

Query: 138  EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
            +DAKI++LE+ +  H L   S+H +E +P+ L     R        +++DP  RC  + +
Sbjct: 159  KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214

Query: 197  YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
             G  + IL   +  + L  D D     GG S  +    S +++L ++D  ++++ D  F+
Sbjct: 215  PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270

Query: 253  HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
             G+  P + +L E + TW GR++    T  +  L++    + +P+I S   LP+D+ +L+
Sbjct: 271  PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330

Query: 313  AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
            A P+ +GGV+V+ AN + +  QS  + A+A N +A  + S    P       L+ + A +
Sbjct: 331  ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389

Query: 372  LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
            + +   LL  + G +V + ++ +GR+V ++D+  +   + + + +  + + L  +GS  G
Sbjct: 390  VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
             S+L++ T              E  DI  DA S +    ++ ++       D ++ E+  
Sbjct: 450  PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496

Query: 486  LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
            LYG       S   T +               A+ DSL   GP+ D ++ L  N +    
Sbjct: 497  LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556

Query: 528  -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
                      TG        +  +   +L  + G +G+W+   + +    G N +    A
Sbjct: 557  ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616

Query: 570  AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
            A  D     +I+S +A      +   + + +  VD   +    TI AG  F R  ++QV 
Sbjct: 617  ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672

Query: 628  ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                R+L  DGS   + +DL            +  + + + SI+DP+V++   D ++ L 
Sbjct: 673  NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723

Query: 684  VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
            VG+     +  +  + +        + + Y D       G    L+ K   +A  ST + 
Sbjct: 724  VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783

Query: 737  EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
             A+D        +G  + V+C   G +EI+ +P    VF+          + DT+     
Sbjct: 784  AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832

Query: 797  KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
               +    S  ++   Q    ++      E+ +        RP L  +L  GT+  Y   
Sbjct: 833  ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884

Query: 857  LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
              E P        PV+T R   +  ++  R+ +      P++   +   P  AP   I  
Sbjct: 885  PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933

Query: 917  FKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
              ++S   G FL+G +P W +   +  +R+ P + +  + +FT          F+  T +
Sbjct: 934  STSVS---GVFLTGGKPGWILGTDKTAVRLVPAV-NQVVHSFTACSLWGNRGEFLMNTDE 989

Query: 976  GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
            G   +  +P     D   P   +P +  P+  T  A +    ++++   L+        +
Sbjct: 990  GPCLVEWMPD-LRLDEELPSFFMP-RGRPY--TSIAYEATTGMVIAASSLRS-----RFV 1040

Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
            +  E G+ +   +   +      T +   + +++P+     W T        +E   TVR
Sbjct: 1041 LFDEDGNTVWKPDAEFIS---DPTTDTSSLELIDPET----WTTVDGFEFAFNEMINTVR 1093

Query: 1096 VVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNLVTEVY---S 1150
             V L   +T+  ++  +A+GT   +GED+A +G   +F       D+ Q    ++     
Sbjct: 1094 TVNLETVSTEAGSKDFIAVGTTVFRGEDLAVKGATYIFEVIEVVPDDTQQRRHKLKLWCR 1153

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFIL 1209
             E KG +SAL  + G+L+ + G K+ +  +   E L G+AF D   +YV SL  +KN +L
Sbjct: 1154 DEAKGPVSALCGINGYLVSSMGQKVFVRAFDLNERLVGVAFMDV-GIYVTSLRTLKNLLL 1212

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            +GD  KS++F++++E   +L LL KDF      + EF      +++V +DEQ  ++IF Y
Sbjct: 1213 IGDAVKSVWFVAFQEDPFKLQLLGKDFQRAALTSAEFFFGFGEMTIVSTDEQNVLRIFRY 1272

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             P  +E+  GQKLL + EF+  +         +L  +SD     P S       +++   
Sbjct: 1273 DPMHAEAQDGQKLLCQTEFNTQSDARG--TTTILRRTSDEDILLPQS------KIMYCGT 1324

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            DGS+  + P++E  F+RL  LQ ++  ++ HVAGL+P++FR   ++  A RP    I+D 
Sbjct: 1325 DGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGLHPKAFRVVRNDFTA-RPLARGILDS 1383

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             LL+ +E LPL  Q+E   Q G +R  IL +   L
Sbjct: 1384 NLLAKFEELPLSRQVEFTKQIGQSREVILGDWMQL 1418


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Sarcophilus harrisii]
          Length = 1449

 Score =  342 bits (876), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 225/703 (32%), Positives = 352/703 (50%), Gaps = 81/703 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S          L  
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNALCT 1051

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+  
Sbjct: 1052 RIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1107

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1108 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1167

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1168 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1226

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1227 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLMV 1286

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K      N+
Sbjct: 1287 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPTKKSIVWENK 1339

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1340 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1399

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               +++D ELL+ Y  L   E+ E+A + GTT   IL +L ++
Sbjct: 1400 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442



 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        SS  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G   +   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Felis catus]
          Length = 1432

 Score =  340 bits (873), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 226/700 (32%), Positives = 350/700 (50%), Gaps = 81/700 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G LEI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 773  WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 829  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 872  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 922  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S  +  P  +    
Sbjct: 982  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR---- 1035

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+  
Sbjct: 1036 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1090

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1091 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1150

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1151 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1209

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1210 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1269

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K      N+
Sbjct: 1270 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENK 1322

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1323 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1382

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
               +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1383 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILEDL 1422



 Score =  300 bits (769), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 211/631 (33%), Positives = 334/631 (52%), Gaps = 76/631 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 57  LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 112

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E   
Sbjct: 113 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 166

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 167 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 226

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS   
Sbjct: 227 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 286

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
             +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG R
Sbjct: 287 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 346

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E    
Sbjct: 347 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 402

Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
            + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS
Sbjct: 403 ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 458

Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
           ++NIGP  + + G                L I        + + + + K    ++V   E
Sbjct: 459 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 518

Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
           LPGC  +WTV         ++S+G  A+   S + A DD   H +LI+S E  TM+L+T 
Sbjct: 519 LPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQTG 578

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
             + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +    
Sbjct: 579 QEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 631

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                S ++  ++ADPYV++  ++G + + +
Sbjct: 632 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 658


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Monodelphis domestica]
          Length = 1449

 Score =  340 bits (873), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 224/703 (31%), Positives = 352/703 (50%), Gaps = 81/703 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S          L  
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNALCT 1051

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   + +     E + ++++ P      W+    A I ++  E+  
Sbjct: 1052 RIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1107

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1108 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1167

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1168 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1226

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1227 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLMV 1286

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K      N+
Sbjct: 1287 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSIVWENK 1339

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1340 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1399

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               +++D ELL+ Y  L   E+ E+A + GTT   IL +L ++
Sbjct: 1400 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442



 Score =  306 bits (783), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G  A+   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
            putorius furo]
          Length = 1399

 Score =  340 bits (871), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 344/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 741  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 797  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 847  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 907  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   +  
Sbjct: 967  PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFEAIER 1019

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1020 DDRYVHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1075

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1076 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1135

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1136 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1194

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1195 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1254

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1255 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1307

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H++ +A +    +++D ELL+ Y  L
Sbjct: 1308 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRLLHADRRALQNAVRNVLDGELLNRYLYL 1367

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1368 STMERGELAKKIGTTPDIILDDL 1390



 Score =  298 bits (762), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 210/635 (33%), Positives = 329/635 (51%), Gaps = 81/635 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 22  LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 77

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVGD 216
            FE PE   L+ G       P V+VDP GRC  +L    +YG ++++L   +     + +
Sbjct: 78  YFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LAE 131

Query: 217 EDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRV 274
           E     G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV
Sbjct: 132 EHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRV 191

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           + +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +Q
Sbjct: 192 AVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQ 251

Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
           S     +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ 
Sbjct: 252 SVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLIT 311

Query: 394 DG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
           DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++     T  L      
Sbjct: 312 DGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPAG 366

Query: 453 EFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
              + + D P +K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 367 AVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 422

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 423 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 482

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV   + +         G   + S + A DD   H +LI+S E  TM+
Sbjct: 483 TTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTMI 542

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 543 LQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPVD 598

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 599 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Bos taurus]
          Length = 1490

 Score =  340 bits (871), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 831  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 887  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 937  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 997  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   +  
Sbjct: 1057 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1109

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1110 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1165

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1166 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1225

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1226 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1284

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1285 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1344

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1345 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1397

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1398 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1457

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1458 STMERGELAKKIGTTPDIILDDL 1480



 Score =  295 bits (755), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLV 214
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
           G+        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           +V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y 
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338

Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           +QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457

Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
             E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Canis lupus familiaris]
          Length = 1460

 Score =  339 bits (869), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 342/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 801  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 857  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 907  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 967  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   +  
Sbjct: 1027 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIER 1079

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1080 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1135

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1136 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1195

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1196 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1254

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1255 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1314

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1315 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1367

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1368 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1427

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1428 STMERGELAKKIGTTPDIILDDL 1450



 Score =  301 bits (771), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 212/629 (33%), Positives = 331/629 (52%), Gaps = 72/629 (11%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 85  LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 140

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E   
Sbjct: 141 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 194

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 195 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 254

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS   
Sbjct: 255 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 314

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
             +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG R
Sbjct: 315 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 374

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D
Sbjct: 375 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 432

Query: 457 IEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
            E      KR+  ++         QD V+  E+ +YGS A + T+ A  T+SF V DS++
Sbjct: 433 KEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSIL 488

Query: 511 NIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELP 545
           NIGP  + + G                L I        + + + + K    ++V   ELP
Sbjct: 489 NIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELP 548

Query: 546 GCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETADL 595
           GC  +WTV         ++S+G  A  +SS + A DD   H +LI+S E  TM+L+T   
Sbjct: 549 GCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQE 608

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
           + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +      
Sbjct: 609 IMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL----- 659

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV 684
              S ++  ++ADPYV++  ++G + + +
Sbjct: 660 --GSPIVQCAVADPYVVIMSAEGHVTMFL 686


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
            taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
          Length = 1444

 Score =  338 bits (868), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   +  
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1063

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1064 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1411

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434



 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543

Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
             +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score =  338 bits (868), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 216/683 (31%), Positives = 342/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 735  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++  S   + 
Sbjct: 791  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 841  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 901  PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 961  PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1013

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    +
Sbjct: 1014 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1069

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P+  +T      +Y KE KG ++AL   
Sbjct: 1070 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVTEPRQTLTXXKFKVLYEKEQKGPVTALCHC 1129

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1130 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1188

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1189 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1248

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG + P
Sbjct: 1249 RRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVMWENKHITWFATLDGGIGLLLP 1301

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1302 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 1361

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1362 STMERSELAKKIGTTPDIILDDL 1384



 Score =  267 bits (682), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 200/672 (29%), Positives = 324/672 (48%), Gaps = 124/672 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ ++                    F C +G   +                     
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
                   QD    +E+ +YGS + +  +   T+SF V DS++NIGP  + + G      
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439

Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------H 555
                     L I        + + + + K    ++V   ELPGC  +WTV         
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499

Query: 556 KSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
            AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608

Query: 673 LGMSDGSIRLLV 684
           +  ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
            [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  338 bits (868), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431



 Score =  307 bits (787), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Otolemur garnettii]
          Length = 1441

 Score =  338 bits (867), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 224/700 (32%), Positives = 348/700 (49%), Gaps = 81/700 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
                L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 881  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 931  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S     P  +    
Sbjct: 991  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 1044

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+  
Sbjct: 1045 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1099

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1100 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1159

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1160 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1218

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1219 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1278

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GA  G  K      N+
Sbjct: 1279 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENK 1331

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1332 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQN 1391

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
               +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1392 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1431



 Score =  300 bits (767), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 217/678 (32%), Positives = 343/678 (50%), Gaps = 89/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             S          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540

Query: 555 -----HKSSRGHNADSSR---MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 ++ +G   +          D   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  338 bits (867), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1348

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431



 Score =  303 bits (775), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
            griseus]
          Length = 1419

 Score =  338 bits (867), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 760  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 816  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 866  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 926  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 986  PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1038

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1039 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1094

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1095 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1154

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1155 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1213

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1214 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1273

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1274 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1326

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1327 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1386

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1387 STMERSELAKKIGTTPDIILDDL 1409



 Score =  299 bits (765), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
           + + T+     FLGSRLG+SLL+++T        SS ++E      A  + P +K+ R  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430

Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546

Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655

Query: 668 DPYVLLGMSDG 678
           DPYV++  ++G
Sbjct: 656 DPYVVIMSAEG 666


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  338 bits (867), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1114

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1115 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1174

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1175 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1233

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1234 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1293

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1294 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1346

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1347 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1406

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1407 STMERSELAKKIGTTPDIILDDL 1429



 Score =  308 bits (789), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665


>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
 gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
          Length = 1431

 Score =  337 bits (864), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 378/1513 (24%), Positives = 641/1513 (42%), Gaps = 248/1513 (16%)

Query: 57   NLVVTAANVIEIY-VVRVQ------EEGSKESKNSGETKRRVLMDGI---------SAAS 100
            NL+V   + ++++ +VRVQ      E+    S N    +    M+           +  +
Sbjct: 29   NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88

Query: 101  LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
            L LV   R+ G ++ LA++    GG       D ++L    AK+S+L +D     L   S
Sbjct: 89   LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144

Query: 159  MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
            MH +ES   P+ +      E+ A+   + VDP+  C  +  YG  M I+   +     + 
Sbjct: 145  MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197

Query: 216  DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
            D +               S V+    LD  +  V D  F+ GY E  + +L+  E T   
Sbjct: 198  DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257

Query: 273  RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
             +  +  T  ++ +++    +   +I S  NLP+D Y +L VP+P+GG L++G N I Y 
Sbjct: 258  FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317

Query: 332  HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
             S  ++  +A+N +  +  + Q   RSSF +EL+      L     ++   LL   TG L
Sbjct: 318  DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377

Query: 387  VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
              L  + DG+ V+ LDL     + N ++L S +T    + +   FLGS+ GDS LVQ++ 
Sbjct: 378  FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437

Query: 440  GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             S  +         + D E DA           D L D        +Y + S       K
Sbjct: 438  RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482

Query: 500  -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
                      V D L N+GP+ +F  G               L +    + TG S     
Sbjct: 483  FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541

Query: 535  -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
             ++S   +VE      GC+ +WT+ H + R  N  S     Y DDEY  YL++S E  + 
Sbjct: 542  FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600

Query: 589  VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
            V    +   EV +S D+  +G TI  G L G  R++Q+     R+ D + ++ Q ++ G 
Sbjct: 601  VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                     +   V++ S+ DPYV+L +  G I L   D  T  +       +    K V
Sbjct: 659  ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706

Query: 708  SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
             + +LY     +P +++  ++  L     S G  +  +DG D  P               
Sbjct: 707  KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764

Query: 747  ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
               LD+    +     V ++ G L++  +P ++CV   D F       + T + + L   
Sbjct: 765  VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818

Query: 800  E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
              TE++ SS+E              +VE+             L        I  Y+ ++ 
Sbjct: 819  RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864

Query: 859  EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
              P  T K++                     LRFS+ P +  TRE T             
Sbjct: 865  SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902

Query: 906  ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
               P  A  Q            R+   + I  H   F++G++P + +     + + HP L
Sbjct: 903  ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
             +  I++    H  +   G+I+V     + IC+      YD+ W  +K+ +  + H I Y
Sbjct: 963  SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKVNVGRSVHGIAY 1022

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
               K +Y +  S   L P        +  E G+ +         L RT +     + ++ 
Sbjct: 1023 HPTKMVYAIATST--LTPYE------VTDEEGNVVYPLKNEGEYLPRTNS---GMLELVS 1071

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
            P      W           E  L VR+V L  +  TK  +  +A+GT+  +GED+A RG 
Sbjct: 1072 P----LTWTVIDRYKFLDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVRGS 1127

Query: 1129 VLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
              LF        P +  T      V  +E+KG ++ ++ + G+LL   G K+I+      
Sbjct: 1128 TYLFEIIDVVPQPGHPETRHKLKLVTREEIKGTVAVVSEINGYLLSGQGQKVIVRALEDE 1187

Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
            + L G+AF D     VV+ ++ +N ++ GDI +SI F+ + E+  ++ L AK    L   
Sbjct: 1188 DHLVGVAFIDLGSYTVVAKSL-RNLLIFGDIRQSISFVGFAEEPYRMTLFAKGQDPLSVS 1246

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            + +FL+ G +L   V+D + N++I  Y P+  ES  G++L++R + HVG H+   + L +
Sbjct: 1247 SADFLVQGQSLYFAVADMRGNLRILAYDPENPESHSGERLVTRGDIHVG-HIITAIHL-V 1304

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
                 DR G     D+ + FA +    DGS+  + P+ E  +RRL  +Q  L + +  V 
Sbjct: 1305 PKMKKDRPGEV-DYDEGDEFACITTNSDGSLQALCPISERVYRRLNIIQNYLANRIETVG 1363

Query: 1363 GLNPRSFRQFHS----NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            GLNPRS+R  ++    N   HR     I+D  L+ H+  + +  + E+A++ G   S I+
Sbjct: 1364 GLNPRSYRLINTVSSLNNATHR-----ILDGGLIEHFSYMSVAHRQEMAYKCGVPISTIM 1418

Query: 1419 SNLNDLALGTSFL 1431
            ++L +L    +++
Sbjct: 1419 NDLVELDEALNYM 1431


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  337 bits (863), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 223/700 (31%), Positives = 346/700 (49%), Gaps = 81/700 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+  +             
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +    L+ +RF + P +   RE+ P  +                    R  
Sbjct: 882  -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 934  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S     P  +    
Sbjct: 994  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 1047

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+  
Sbjct: 1048 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1102

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1103 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1162

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1163 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1221

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E    L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1222 FILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1281

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K      N+
Sbjct: 1282 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENK 1334

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1335 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQN 1394

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
               +I+D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1395 AVRNILDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1434



 Score =  300 bits (767), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 216/675 (32%), Positives = 346/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +      LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---- 468
           S + T+     FLGSRLG+SLL+++T        +S ++E     + + PS+K+ R    
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430

Query: 469 ---RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546

Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   +      D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
            V++  ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Equus caballus]
          Length = 1450

 Score =  336 bits (862), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 791  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 847  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 897  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 957  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1017 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1069

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1070 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1125

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1126 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1185

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1186 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1244

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1245 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1304

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1305 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1357

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1358 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1417

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1418 STMERGELAKKIGTTPDIILDDL 1440



 Score =  303 bits (775), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
                           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + 
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486

Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
           + G                L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546

Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
                    ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S 
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
            +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++ 
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655

Query: 664 VSIADPYVLLGMSDGSIRLLV 684
            ++ADPYV++  ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  336 bits (862), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1063

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1064 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1411

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434



 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546

Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Saimiri boliviensis boliviensis]
          Length = 1390

 Score =  336 bits (862), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 223/700 (31%), Positives = 349/700 (49%), Gaps = 81/700 (11%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                LS  N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 830  ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   
Sbjct: 880  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S     P  +    
Sbjct: 940  QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 993

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
             I +  G + +   +   + +     E + ++++ P      W+    A I +Q  E+  
Sbjct: 994  -IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVT 1048

Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+    
Sbjct: 1049 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1108

Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKN
Sbjct: 1109 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1167

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ +
Sbjct: 1168 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1227

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
            + Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GA  G  K      N+
Sbjct: 1228 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENK 1280

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
                F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  + 
Sbjct: 1281 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQN 1340

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
               +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1341 AVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1380



 Score =  273 bits (699), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
           ++ T K HP+IWS  +LP D  + LAVP PI                          GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226

Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           ++   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286

Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T 
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346

Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
                  +++  +G KEE    +    +T           QD V+  E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
           + A  T+SF V DS++NIGP  + + G                L I        + + + 
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462

Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
           + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522

Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
           I+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G   
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
              L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cavia porcellus]
          Length = 1440

 Score =  335 bits (860), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 340/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1007 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1059

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1060 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1115

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1116 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1175

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1176 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1234

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1235 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1294

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG + P
Sbjct: 1295 RRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLLP 1347

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1348 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1407

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1408 STMERGELAKKIGTTPDIILDDL 1430



 Score =  271 bits (694), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539

Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                     + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
            [Mus musculus]
          Length = 1450

 Score =  335 bits (860), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 215/675 (31%), Positives = 337/675 (49%), Gaps = 47/675 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408

Query: 1399 PLEEQLEIAHQTGTT 1413
               E+ E+A + GTT
Sbjct: 1409 STMERSELAKKIGTT 1423



 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
          Length = 1365

 Score =  335 bits (858), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 706  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 762  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 812  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 872  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 932  PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 982

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 983  ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1038

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1039 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1098

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1099 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1157

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1158 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1217

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1218 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1270

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1271 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1330

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1331 YLSTMERSELAKKIGTTPDIILDDL 1355



 Score =  287 bits (734), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
           ++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+  
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
               +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
              SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +   
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + 
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402

Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
           G                L I        + + + + K    ++V   ELPGC  +WTV  
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462

Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
                   +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  335 bits (858), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 215/683 (31%), Positives = 339/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 730  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 786  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 836  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 896  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S              I +  G + +   +  
Sbjct: 956  PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIER 1008

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    +
Sbjct: 1009 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1064

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1065 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1124

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1125 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1183

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1184 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1243

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG + P
Sbjct: 1244 RRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLP 1296

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1297 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 1356

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1357 STMERSELAKKIGTTPDIILDDL 1379



 Score =  272 bits (696), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 202/644 (31%), Positives = 321/644 (49%), Gaps = 104/644 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
           ++ T K HP+IWS  +LP D  + LAVP PI                         GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226

Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           V   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S 
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286

Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
           K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T  
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEK 346

Query: 441 ----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTE 495
                 +++  +  KEE    +    +T           QD V+  E+ +YGS A + T+
Sbjct: 347 LQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQ 404

Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATGI 533
            A  T+SF V DS++NIGP  + + G                L I        + + + +
Sbjct: 405 LA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVL 462

Query: 534 SKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
            K    ++V   ELPGC  +WTV          +    G   ++    A DD   H +LI
Sbjct: 463 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 522

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           +S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G    
Sbjct: 523 LSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 578

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
             L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Gorilla gorilla gorilla]
          Length = 1440

 Score =  334 bits (857), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1007 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1057

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1058 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1113

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1114 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1173

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1174 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1232

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1233 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1292

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1293 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1345

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1346 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1405

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1406 YLSTMERSELAKKIGTTPDIILDDL 1430



 Score =  287 bits (735), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP G C  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E     + + P +K+ 
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
           R  ++               G  +     A    + AV    DS++NIGP  + + G   
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542

Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
            [Bos grunniens mutus]
          Length = 1417

 Score =  334 bits (857), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 214/675 (31%), Positives = 339/675 (50%), Gaps = 47/675 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 775  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 831  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 881  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 941  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   +  
Sbjct: 1001 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1053

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1054 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1109

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1110 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1169

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1170 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1228

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1229 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1288

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1289 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1341

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1342 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1401

Query: 1399 PLEEQLEIAHQTGTT 1413
               E+ E+A + GTT
Sbjct: 1402 SPMERGELAKKIGTT 1416



 Score =  280 bits (717), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 206/634 (32%), Positives = 320/634 (50%), Gaps = 89/634 (14%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LELV  +   GNV S+A +   GA    +RD+++L       SV+E+D   H L+  S+H
Sbjct: 66  LELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSLH 114

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLVG
Sbjct: 115 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVG 171

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 172 E--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 223

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +
Sbjct: 224 VAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLN 283

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT++
Sbjct: 284 QSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 343

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+   
Sbjct: 344 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA-- 401

Query: 452 EEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAV 505
            E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF V
Sbjct: 402 REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEV 457

Query: 506 RDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV- 542
            DS++NIGP  + + G                L I        + + + + K    ++V 
Sbjct: 458 CDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVT 517

Query: 543 --ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
             ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+L
Sbjct: 518 TFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMIL 577

Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
           +T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P + 
Sbjct: 578 QTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL 633

Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                   S ++  ++ADPYV++  ++G + + +
Sbjct: 634 -------GSPIVQCAVADPYVVIMSAEGHVTMFL 660


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  334 bits (857), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 215/675 (31%), Positives = 337/675 (49%), Gaps = 47/675 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1348

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408

Query: 1399 PLEEQLEIAHQTGTT 1413
               E+ E+A + GTT
Sbjct: 1409 STMERSELAKKIGTT 1423



 Score =  302 bits (774), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1442

 Score =  334 bits (857), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 216/685 (31%), Positives = 341/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1059

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1060 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1115

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1116 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1175

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1176 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1234

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1235 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1294

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1295 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1347

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1348 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1407

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1408 YLSTMERSELAKKIGTTPDIILDDL 1432



 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
            [Heterocephalus glaber]
          Length = 1440

 Score =  334 bits (856), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 216/682 (31%), Positives = 341/682 (50%), Gaps = 46/682 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V    LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  
Sbjct: 948  PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007

Query: 993  WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1052
            WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +   
Sbjct: 1008 WPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIERD 1060

Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
            D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    + 
Sbjct: 1061 DRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKG 1116

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQ 1164
             +A GT  +QGE+V  RGRV  +        P   +T+     +Y KE KG ++AL    
Sbjct: 1117 YVAAGTCLMQGEEVTCRGRVRDWERIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCN 1176

Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E
Sbjct: 1177 GHLVSAIGQKIFLWSLRASELTGMAFIDT-QLYIHQMISVKNFILAADVMKSISLLRYQE 1235

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
            +   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL 
Sbjct: 1236 ESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLR 1295

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPL 1339
            RA+FHVGAHV  F R       +   GA+ G  K      N+    F TLDG IG + P+
Sbjct: 1296 RADFHVGAHVNTFWR-------TPCRGASEGPSKKSVVWENKHITWFATLDGGIGLLLPM 1348

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
             E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L 
Sbjct: 1349 QEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLS 1408

Query: 1400 LEEQLEIAHQTGTTRSQILSNL 1421
              E+ E+A + GTT   IL +L
Sbjct: 1409 TMERGELAKKIGTTPDIILDDL 1430



 Score =  303 bits (775), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 221/684 (32%), Positives = 351/684 (51%), Gaps = 101/684 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE        P 
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEE--------PP 418

Query: 464 TKRLRRSSSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
           +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP 
Sbjct: 419 SKKKRVDSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPC 474

Query: 516 KDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGI 550
            + + G                L I        + + + + K    ++V   ELPGC  +
Sbjct: 475 ANAAVGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDM 534

Query: 551 WTVYH---------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVT 600
           WTV             + G   + S   A DD   H +LI+S E  TM+L+T   + E+ 
Sbjct: 535 WTVIAPVRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELD 594

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
            S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         + 
Sbjct: 595 TS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAP 643

Query: 661 VLSVSIADPYVLLGMSDGSIRLLV 684
           ++  ++ADPYV++  ++G + + +
Sbjct: 644 IVQCAVADPYVVIMSAEGHVTMFL 667


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
            troglodytes]
          Length = 1442

 Score =  334 bits (856), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1059

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1060 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1115

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1116 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1175

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1176 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1234

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1235 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1294

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1295 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1347

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1348 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1407

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1408 YLSTMERSELAKKIGTTPDIILDDL 1432



 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
            sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
            [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
          Length = 1443

 Score =  334 bits (856), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1296 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1348

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1349 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1408

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1409 YLSTMERSELAKKIGTTPDIILDDL 1433



 Score =  305 bits (780), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  333 bits (855), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 213/680 (31%), Positives = 338/680 (49%), Gaps = 42/680 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            LL RA+FHVGAHV  F R    AT      +       N+    F TLDG IG + P+ E
Sbjct: 1296 LLRRADFHVGAHVNTFWRTPCRATEGLSKKSVVWE---NKHITWFATLDGGIGLLLPMQE 1352

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   
Sbjct: 1353 KTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTM 1412

Query: 1402 EQLEIAHQTGTTRSQILSNL 1421
            E+ E+A + GTT   IL +L
Sbjct: 1413 ERSELAKKIGTTPDIILDDL 1432



 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score =  333 bits (854), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 217/696 (31%), Positives = 357/696 (51%), Gaps = 67/696 (9%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +   S T+     EE T 
Sbjct: 790  WCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG   +I  +K  E+A+     +HSRP+L A + +  +L Y+A+ ++  +          
Sbjct: 846  QG---DIPLVK--EVALVSLGYNHSRPYLLAHV-EQELLIYEAFPYDQQQ---------- 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHG---------APCQRIT 915
                      + S L+ +RF + P +   RE+         P G             R  
Sbjct: 890  ----------AQSNLK-VRFKKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++ISG+ G F+ G  P W +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   
Sbjct: 939  YFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNK 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H ++Y  E  +Y +  SV   +P  +    
Sbjct: 999  QGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR---- 1052

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
             I +  G + +   +   + +     +++ ++++ P        TR  + ++  E+   +
Sbjct: 1053 -IPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCM 1109

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
            + V L +  T    +  +A+GT  +QGE+V  RGR+L+         P   +T+     +
Sbjct: 1110 KTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVL 1169

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            Y KE KG ++AL    G L+ A G KI L      +L G+AF D   LY+  +  +KNFI
Sbjct: 1170 YEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMYSIKNFI 1228

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L  D+ KSI  L ++ +   L+L+++D   L+ ++ EF++D + L  +VSD  KN+ ++ 
Sbjct: 1229 LAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYM 1288

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
            Y P+  ES+ G +LL RA+F+VG+HV  F R+    T       A   D  N+    F T
Sbjct: 1289 YLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFAT 1346

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            LDG +G + P+ E T+RRL  LQ  L   +PH AGLNP++FR  H + +  +    +I+D
Sbjct: 1347 LDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILD 1406

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             ELL+ Y  L   E+ E+A + GTT   IL +L ++
Sbjct: 1407 GELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1442



 Score =  292 bits (747), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 209/641 (32%), Positives = 334/641 (52%), Gaps = 78/641 (12%)

Query: 94  DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           DG S    LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H
Sbjct: 58  DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            L+  S+H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +    
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
            + DE     G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
            GRV+ +  TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ 
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287

Query: 331 YHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
           Y +QS     ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +L
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347

Query: 390 TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           T++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T     + +  
Sbjct: 348 TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
           G + E  + +   P  K+ R  S+ A       L D ++  E+ +YGS A + T+ A  T
Sbjct: 408 GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460

Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
           +SF V DS++NIGP    S G         + N +        +G  K     ++     
Sbjct: 461 YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520

Query: 543 -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
                  ELPGC  +WTV +   +         G + +  +       D + H +LI+S 
Sbjct: 521 PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
           E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      L
Sbjct: 581 EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
            F P +         S ++  S+ADPYV++  ++G + + V
Sbjct: 637 HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 670


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
          Length = 726

 Score =  332 bits (852), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 224/700 (32%), Positives = 354/700 (50%), Gaps = 74/700 (10%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
            V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E     
Sbjct: 73   VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                +H + VV L ++     HSRP L A + D  +L Y+A+ F              T 
Sbjct: 132  ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
            R   +          LRF +   D + RE     + P     ++       +  F +ISG
Sbjct: 170  REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220

Query: 923  HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG L+I 
Sbjct: 221  YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LP+  TYD  WPV+K+PL+ TPH + Y  +   Y ++ S P   P N ++        G
Sbjct: 281  TLPTHLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-----TG 333

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL 1099
             + +   L     +   T++++ +++L P      W+T     + +   E+   ++ V L
Sbjct: 334  EEKEYELLERDSRYIFPTMDKFSLQLLSP----VSWETIPNTRVDLDEWEHLTCLKNVML 389

Query: 1100 FNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
             +  TT   +  LA+GT Y  GEDV +RGR+++         P     +N +  VYSKE 
Sbjct: 390  SSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKIVYSKEQ 449

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            KG ++AL+ + G LL A G KI + +    EL G+AF D   +Y+ S+  VKN IL+GD+
Sbjct: 450  KGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNLILVGDV 508

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             KS+  L ++E    L+L+++D   L+ +A EF ID + +S +V+D ++N+ ++ Y P+ 
Sbjct: 509  FKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPES 568

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
             ES  GQ+LL R +FHVG+ V    R++       +      S    R   +  TLDGS+
Sbjct: 569  RESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSL 628

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGPDSIVDC 1389
              + P+ E T+RRL  LQ  LV ++PH AGLNP+++R ++S     G  H+    +I+D 
Sbjct: 629  AYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK----NILDG 684

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            EL+  +  L   E+ E++ + GTT +QI  +L ++   T+
Sbjct: 685  ELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 724


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score =  332 bits (851), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 217/696 (31%), Positives = 356/696 (51%), Gaps = 67/696 (9%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +   S T+     EE T 
Sbjct: 446  WCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTR 501

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG   +I  +K  E+A+      HSRP+L A + +  +L Y+A+ ++  +          
Sbjct: 502  QG---DIPLVK--EVALVSLGYSHSRPYLLAHV-EQELLIYEAFPYDQQQ---------- 545

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHG---------APCQRIT 915
                      + S L+ +RF + P +   RE+         P G             R  
Sbjct: 546  ----------AQSNLK-VRFKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFR 594

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++ISG+ G F+ G  P W +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   
Sbjct: 595  YFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNK 654

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+I  LP+  +YD  WPV+KIPL+ T H ++Y  E  +Y +  SV   +P  +    
Sbjct: 655  QGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR---- 708

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
             I +  G + +   +   + +     +++ ++++ P        TR  + ++  E+   +
Sbjct: 709  -IPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCM 765

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
            + V L +  T    +  +A+GT  +QGE+V  RGR+L+         P   +T+     +
Sbjct: 766  KTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVL 825

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            Y KE KG ++AL    G L+ A G KI L      +L G+AF D   LY+  +  +KNFI
Sbjct: 826  YEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLTGMAFIDTQ-LYIHQMYSIKNFI 884

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L  D+ KSI  L ++ +   L+L+++D   L+ ++ EF++D + L  +VSD  KN+ ++ 
Sbjct: 885  LAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYM 944

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
            Y P+  ES+ G +LL RA+F+VG+HV  F R+    T       A   D  N+    F T
Sbjct: 945  YLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFAT 1002

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            LDG +G + P+ E T+RRL  LQ  L   +PH AGLNP++FR  H + +  +    +I+D
Sbjct: 1003 LDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILD 1062

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             ELL+ Y  L   E+ E+A + GTT   IL +L ++
Sbjct: 1063 GELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1098



 Score =  100 bits (250), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 96/342 (28%), Positives = 159/342 (46%), Gaps = 62/342 (18%)

Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T      + 
Sbjct: 2   VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG------EELSLYGS-ASNNTESAQK 499
            + ++E   + E +     + +R  S+       G      +E+ +YGS A + T+ A  
Sbjct: 58  ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELDEIEVYGSEAQSGTQLA-- 115

Query: 500 TFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV---- 542
           T+SF V DS++NIGP    S G         + N +        +G  K     ++    
Sbjct: 116 TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSI 175

Query: 543 --------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIIS 582
                   ELPGC  +WTV +   +         G + +  +       D + H +LI+S
Sbjct: 176 RPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILS 235

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      
Sbjct: 236 REDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQ 291

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           L F P +         S ++  S+ADPYV++  ++G + + V
Sbjct: 292 LHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 326


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score =  328 bits (841), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 268/962 (27%), Positives = 429/962 (44%), Gaps = 152/962 (15%)

Query: 543  ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
            +LPGC  +WTV            G  A+S+      +                   H +L
Sbjct: 107  DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166

Query: 580  IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
            I+S E  TMVL+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L G   
Sbjct: 167  ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
             Q L F          S+    +  S+ADPY L+   DG I LL  V DP      +   
Sbjct: 226  LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276

Query: 698  AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
                + K    +   Y D              P P + K + +        + TGV    
Sbjct: 277  KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336

Query: 736  ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
                GE      G  + + ++                   + V+C E+G+LEI+++P+F+
Sbjct: 337  EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396

Query: 773  CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             V+ V  F +G   +VD++   +   +                K+      V E+ M   
Sbjct: 397  LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
                SRP L A + D  +L Y+A+    P + S    P  T   +    V  + +   R 
Sbjct: 445  GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495

Query: 893  SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
                  A  +EE+    G+  Q    F +ISG+ G F+ GS P W  M  R  LR+HP  
Sbjct: 496  GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
             DG++  F+  HNVNC  GF+Y    G L+I  LP+  +YD  WPV+K+PL+ TPH + Y
Sbjct: 556  IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAY 615

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
              E  +Y   V+    +  N++  +  D++    ++       D +    ++++ ++++ 
Sbjct: 616  HMECKVY--AVAASTFEMCNRIPRMAGDEKEYDAVEKD-----DRYIYPMLDKFNIQLMS 668

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
            P        TR    MQ  EN               E      +G  +V    +   G++
Sbjct: 669  PVSWEIIPNTRG---MQLEENY-------------AECTCSFLVGINFV----LFVAGQI 708

Query: 1130 LLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
            ++         P     +N + E+Y KE KG +SAL    G+LL A G KI L ++   +
Sbjct: 709  VILDVIEVVPEPGQPLTKNKIKELYGKEQKGPVSALCGCNGYLLSAIGQKIFLWEFRNND 768

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            L G+AF D   +Y+ +   +KN+++L D+ KSI  L ++           D   L+ +  
Sbjct: 769  LIGVAFIDTQ-VYIHTAISIKNYVILADVFKSISLLRYQ-----------DMRPLETYCV 816

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QM 1302
            EF +D + +  +VSD QKN  ++ Y P+  ES+ GQ+L+ RA+F+VG+HV  F R+  ++
Sbjct: 817  EFFVDNAQIGFLVSDAQKNFLLYSYQPEARESYGGQRLVRRADFNVGSHVNTFFRVRCKI 876

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
            +  S +R   A    K  R   +F TLDG +G + P+ E T+RRL  LQ  L+  +P  A
Sbjct: 877  MDPSGERRRDADTVAK--RHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTHMPFPA 934

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            GLNP++FR    N ++      +I+D ELL  +  L + E+ E+A + GT+   I  +L 
Sbjct: 935  GLNPKAFRMLKHNHRSLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETITEDLM 994

Query: 1423 DL 1424
            D+
Sbjct: 995  DI 996


>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 265

 Score =  323 bits (828), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           GYIEPV+VILHE+E TWAGR+  KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19  GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78

Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           VP PI GVLV+ AN+IHYHSQS SC+L LNN++   D S E+ +S+F VELDAA ATWL 
Sbjct: 79  VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           ND+ + STK G+++LLTVVYDGRVVQRLDL K+  SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
           LVQF+  +  S+L     E   DIE D P +KRL+R  SD LQD+ + EELS     A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258

Query: 493 NTESAQ 498
           + ESAQ
Sbjct: 259 SLESAQ 264


>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
          Length = 1468

 Score =  317 bits (813), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 348/1472 (23%), Positives = 594/1472 (40%), Gaps = 269/1472 (18%)

Query: 57   NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
            N++V   ++++I+     E        ++K  G+  RR+L                    
Sbjct: 29   NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88

Query: 97   SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
            S   L LV  Y L G+V   + + +LS   GG       ++++ +F+DAK S++E+D   
Sbjct: 89   SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
            + +   S+H +E          RE F   P+V          DP  RC  +   G  + I
Sbjct: 142  NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190

Query: 204  LKASQGG-----------------------------SGLVGDEDTFGSGG---------G 225
            +   Q                               +G    EDT    G         G
Sbjct: 191  IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250

Query: 226  FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                   S V+++  LD  + HV    F+H Y EP   IL+    TW G ++ +     I
Sbjct: 251  KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310

Query: 284  SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
            S + I+  L+Q     I S   LP+D +K++ +  P GG L+VG N  IH      +  +
Sbjct: 311  SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370

Query: 341  ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
            A+N +         L  +S   +EL+ +    L+++    LL TK G+ V++    DGR 
Sbjct: 371  AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430

Query: 398  VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            V  + ++K N    S++   ++T   +G    F+G   GD+ ++++           G+K
Sbjct: 431  VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490

Query: 452  EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
            EE          +G +E            SS     + NG          N+   +Q  +
Sbjct: 491  EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542

Query: 502  SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
             F   D L N+GP +D + G                  L +   +  +  S+ S   ++ 
Sbjct: 543  IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602

Query: 543  -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
                       + P C+ +WTV  +S+   NA        DD      +  +L ++    
Sbjct: 603  KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            + V    D   EV    D+  +G TI  G +    R++QV     R+ D       +   
Sbjct: 663  SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
            P   E  +G E   V    + DPY+LL   DGS  +   D +   ++ +   AI+  K  
Sbjct: 720  PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
             S C     KG                 +   +D     P++   D    +    G L+I
Sbjct: 779  -SGCIYASTKG-----------------IFIPLDA----PVENVKDYLLFLLTVEGGLQI 816

Query: 766  FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
            +D+ N    +F+ + F        +T       D+ T   ++ E+      K+ I  + V
Sbjct: 817  YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865

Query: 825  VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
             ++      +    P+L A  ++  +  Y+ ++   P      KS +P   S  LS+S  
Sbjct: 866  ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
            + +  R L                        T   N++G+   FL G+ P + +   + 
Sbjct: 922  TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
              R+H +L    + + +  H+   + GF+YV S GI+++  +P+  T+D  W  +K+   
Sbjct: 958  SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKVTPG 1016

Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
                 + YF   N+Y  ++S    +P +      + +E G      N++  D      ++
Sbjct: 1017 EHVQSLAYFPPMNVY--VISTSKRQPFD------LAEEDG------NIAKDDTTLQPEID 1062

Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQG 1120
               +++L P      W          +E AL V+ ++L  +  TKE + L+++GTA  +G
Sbjct: 1063 SGTLKLLSPQT----WTAVDEYKFAHNEIALVVKTISLEVSEHTKERKQLVSVGTAIFRG 1118

Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKI 1175
            ED +ARG + +F        P    T      V  +E+KG +SA+  + G+LL A G KI
Sbjct: 1119 EDHSARGGIYVFEVIEVVPEPNRPETNRKLKLVTREEVKGTVSAICGVNGYLLAAQGQKI 1178

Query: 1176 ILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            ++      + L  +AF D   LYV     +   IL GD  KS++F  + E+  ++ L  K
Sbjct: 1179 MVRGLKEDQSLLPVAFLDMC-LYVSVAKNLDGMILFGDFMKSVWFAGFSEEPYKMTLFGK 1237

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            D   L+  + EFL DG+ L  VV D + NI    Y P+  +S  GQ+L+ RA+F  G  +
Sbjct: 1238 DTQKLEIISAEFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEI 1297

Query: 1295 TKFLRLQM----LATSSDRTGAAPGSDKT------------NRFALLFGTLDGSIGCIAP 1338
            +    L      L+ SS+    A  +D +              + +L GT  GS+  I  
Sbjct: 1298 STLTMLPFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRT 1357

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
            + E  +RRL  +Q ++V+   HVAGLNPR +R
Sbjct: 1358 IPETAYRRLNIVQGQIVNGEEHVAGLNPREYR 1389


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Aspergillus oryzae 3.042]
          Length = 1389

 Score =  314 bits (805), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 338/1393 (24%), Positives = 592/1393 (42%), Gaps = 196/1393 (14%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
            ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88   EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191  CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
            C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148  CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230  IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207  YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267  LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347  VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327  RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405  KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
               P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387  PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
               E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441  RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513  GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
            GPLKD ++G    +          S  ELV                            +L
Sbjct: 501  GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545  PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
              C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561  ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602  SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
             +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614  ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
              S  E    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669  DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725  LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768  KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809  WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
             +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858  -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
            G   + +      C  GFI + S+G++ +CQ+P G   D  W +Q+IP+      + Y +
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSS 973

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
               +Y +  S            L  D E+  +  N   S         V+   ++++ P 
Sbjct: 974  SSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPK 1022

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
                 W    +  +  +E+ + V+ ++L  +  T E + ++ +GTA+ +GED+A+RG V 
Sbjct: 1023 T----WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVY 1078

Query: 1131 LFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWT 1181
            +F   +   +P+    +     V  + +KGA++AL+ +  QG L++A G K I+   K  
Sbjct: 1079 VFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKED 1138

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
            G+ L  +AF D        +++VK        ++ D  K ++F  + E+  +++L AKD 
Sbjct: 1139 GSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL 1193

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
              L+  A +FL DG+ L ++V+D   N+ +  Y P+  +S  G +LLSR++FH G  ++ 
Sbjct: 1194 DYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFIST 1253

Query: 1297 FLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
               L   + SS++      A     K  R  +L  + +GS+G +  + E ++RRL +LQ 
Sbjct: 1254 LTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQS 1313

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            +L +++ H  GLNPR+FR   S+G A R     ++D +LL  +  +  + ++EIA + G 
Sbjct: 1314 QLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVGA 1369

Query: 1413 TRSQILSNLNDLA 1425
               +I ++   ++
Sbjct: 1370 NEWEIKADFEAIS 1382


>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1424

 Score =  313 bits (801), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 344/1466 (23%), Positives = 619/1466 (42%), Gaps = 186/1466 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
            N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47   NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96   ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
                       + +    V  + LHG V   E++ I+S     N    D ++++F+DAKI
Sbjct: 106  GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161

Query: 143  SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
            ++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + +    +
Sbjct: 162  ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217

Query: 202  IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
             IL   Q  + L V ++D              S +++L    D +++HV DF+F+ G+  
Sbjct: 218  AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+   + 
Sbjct: 275  PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334

Query: 318  IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
            +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  HA  
Sbjct: 335  LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393

Query: 372  LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
            + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLGS +G
Sbjct: 394  VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++             L      ++A           +  ++  + +         +
Sbjct: 454  ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
            +N     +     ++ DSL   GP+ D ++ L    D       +ATG            
Sbjct: 505  NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564

Query: 533  -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
             +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A     
Sbjct: 565  DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624

Query: 589  VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
            V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL    
Sbjct: 625  VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
                         + + SI DP+VL+   D +I L +G+     +  +  +P   +SS+ 
Sbjct: 682  ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735

Query: 706  PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
                    +SC         P       D  +++ +   ++       +    + ++   
Sbjct: 736  LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
             G +EI+ +P     F+          + D+Y   AL   +              RK N 
Sbjct: 790  QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
              + V ++ +         P+L   L  G I+ Y+A     P +            S+  
Sbjct: 838  -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
            S VS  +++ ++ +    +    EET      ++  I +            S   G F +
Sbjct: 885  SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944

Query: 930  GSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
            G RP W +   +  +R++       + +FT          F+  + +G   +  +P    
Sbjct: 945  GDRPSWIVATNKGGIRIYSS-GHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMPD-LC 1002

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
             D+  P + IP       + Y     L   IV+   ++      +     E G+ I    
Sbjct: 1003 LDSVLPSRNIPRSRAYANVVYDPSAML---IVAASSMQ-----ANFASFDEDGNIIWEPE 1054

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-N 1107
             S+V L +    +   + ++ P+     W T        +E    +  VTL   +T+  +
Sbjct: 1055 ASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALECVTLETLSTETGS 1107

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEVYSKELKGAISALA 1161
            +  +A+GT+  +GED+A +G   LF       D  QNL     +  +   + KG ++AL 
Sbjct: 1108 KDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALC 1167

Query: 1162 SLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
             + G+L+ + G KI +  +   E L G+AF D   +YV SL +VKNF+L+GD  KSI+F+
Sbjct: 1168 GINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNFLLIGDAVKSIWFV 1226

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            +++E   +L +LAKD         +F     TLS+V  D    ++++ Y P   ES  GQ
Sbjct: 1227 AFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQ 1286

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
             LL R EFH  +     L   ++A  +      P +   + F+      DGS+  + P+D
Sbjct: 1287 HLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS------DGSLSSLTPVD 1337

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            + +F+RLQ LQ +L  ++ HVAGLNPR++R    N    +P    I+D +LLS +E LP+
Sbjct: 1338 DASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDILDGQLLSAFESLPI 1396

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLAL 1426
              Q E+  Q GT R+ +L +  +LA+
Sbjct: 1397 SRQNEMTKQIGTERNIVLHDWMELAI 1422


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
            [Macaca mulatta]
          Length = 1436

 Score =  312 bits (799), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 184/525 (35%), Positives = 278/525 (52%), Gaps = 37/525 (7%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            R   F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y
Sbjct: 923  RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982

Query: 972  VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLN 1029
               QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S   P  +   
Sbjct: 983  FNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR--- 1039

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQS 1087
                  I +  G + +   +   + +     E + ++++ P      W+    A I +Q 
Sbjct: 1040 ------IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQE 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
             E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T
Sbjct: 1090 WEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLT 1149

Query: 1147 E-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
            +     +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +
Sbjct: 1150 KNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQM 1208

Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
              VKNFIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  
Sbjct: 1209 ISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRD 1268

Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-- 1319
            +N+ ++ Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GA  G  K   
Sbjct: 1269 RNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSV 1321

Query: 1320 ---NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
               N+    F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + 
Sbjct: 1322 VWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDR 1381

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +  +    +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1382 RTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1426



 Score =  286 bits (732), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 218/704 (30%), Positives = 344/704 (48%), Gaps = 116/704 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR---------------- 506
           T           QD V+  E+ +YGS A + T+ A  T+SF VR                
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVRLRQQGPHPSQCPQRPL 482

Query: 507 --------DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV-------------- 542
                   DS++NIGP  + + G    ++ +      S + + E+V              
Sbjct: 483 TFAVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVL 542

Query: 543 ------------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
                       ELPGC  +WTV          +    G   ++    A DD   H +LI
Sbjct: 543 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 602

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           +S E  TM   T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G    
Sbjct: 603 LSREDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 655

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
             L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 656 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692


>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 1428

 Score =  311 bits (797), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 344/1470 (23%), Positives = 619/1470 (42%), Gaps = 190/1470 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
            N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47   NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96   --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
                           + +    V  + LHG V   E++ I+S     N    D ++++F+
Sbjct: 106  GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161

Query: 139  DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            DAKI++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + + 
Sbjct: 162  DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217

Query: 198  GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
               + IL   Q  + L V ++D              S +++L    D +++HV DF+F+ 
Sbjct: 218  KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274

Query: 254  GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
            G+  P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+ 
Sbjct: 275  GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334

Query: 314  VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
              + +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  
Sbjct: 335  CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393

Query: 368  HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
            HA  + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLG
Sbjct: 394  HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
            S +G S+L++             L      ++A           +  ++  + +      
Sbjct: 454  SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507

Query: 487  YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
               ++N     +     ++ DSL   GP+ D ++ L    D       +ATG        
Sbjct: 508  ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564

Query: 533  -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
                 +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A 
Sbjct: 565  LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624

Query: 587  TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
                V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL
Sbjct: 625  PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684

Query: 644  SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
                             + + SI DP+VL+   D +I L +G+     +  +  +P   +
Sbjct: 685  D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735

Query: 702  SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
            SS+         +SC         P       D  +++ +   ++       +    + +
Sbjct: 736  SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
            +    G +EI+ +P     F+          + D+Y   AL   +              R
Sbjct: 790  LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            K N   + V ++ +         P+L   L  G I+ Y+A     P +            
Sbjct: 839  KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
            S+  S VS  +++ ++ +    +    EET      ++  I +            S   G
Sbjct: 885  SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944

Query: 926  FFLSGSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
             F +G RP W +   +  +R++       + +FT          F+  + +G   +  +P
Sbjct: 945  VFFTGDRPSWIVATNKGGIRIYSS-GHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP 1003

Query: 985  SGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1044
                 D+  P + IP       + Y     L   IV+   ++      +     E G+ I
Sbjct: 1004 D-LCLDSVLPSRNIPRSRAYANVVYDPSAML---IVAASSMQ-----ANFASFDEDGNII 1054

Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
                 S+V L +    +   + ++ P+     W T        +E    +  VTL   +T
Sbjct: 1055 WEPEASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALECVTLETLST 1107

Query: 1105 KE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEVYSKELKGAI 1157
            +  ++  +A+GT+  +GED+A +G   LF       D  QNL     +  +   + KG +
Sbjct: 1108 ETGSKDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPV 1167

Query: 1158 SALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
            +AL  + G+L+ + G KI +  +   E L G+AF D   +YV SL +VKNF+L+GD  KS
Sbjct: 1168 TALCGINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNFLLIGDAVKS 1226

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            I+F++++E   +L +LAKD         +F     TLS+V  D    ++++ Y P   ES
Sbjct: 1227 IWFVAFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPES 1286

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
              GQ LL R EFH  +     L   ++A  +      P +   + F+      DGS+  +
Sbjct: 1287 KNGQHLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS------DGSLSSL 1337

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+D+ +F+RLQ LQ +L  ++ HVAGLNPR++R    N    +P    I+D +LLS +E
Sbjct: 1338 TPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDILDGQLLSAFE 1396

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
             LP+  Q E+  Q GT R+ +L +  +LA+
Sbjct: 1397 SLPISRQNEMTKQIGTERNIVLHDWMELAI 1426


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score =  309 bits (792), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 209/688 (30%), Positives = 338/688 (49%), Gaps = 53/688 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 560  WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 605  DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
              R   + +    + +  R S+ P  +  T+ ET   A            R+  F N+  
Sbjct: 660  V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718

Query: 923  HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 719  YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LPS  +YD  WPV+K+PL+ TPH + Y  E   Y ++ SV       Q     + +  G
Sbjct: 779  VLPSHLSYDAPWPVRKVPLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVTG 831

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
             +I    +   D     T   + +++  P        TR  I  +++EN   ++VV L  
Sbjct: 832  EEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLSC 889

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
              T    +  + + T +V  ED+  RG V ++        P     +N +  +Y K  KG
Sbjct: 890  EGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKG 949

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
             +SAL  + G LL   G K+ + ++   +L G+AF D   +Y+ +   VK FIL+ D+ K
Sbjct: 950  PVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDT-QIYIHNAVSVKQFILITDVMK 1008

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
              YFL ++ Q   L+L+++D   L+ F  EF++D   ++ +VSD  KN+ +F+Y P+  E
Sbjct: 1009 GAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPE 1068

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
            S  G  LL R + ++G+ V  F+R++   T  S+++  + P      R  + F TLDGS+
Sbjct: 1069 SHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGSL 1125

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            G + P+ E T+RRL  LQ  L + +PHV GLNP+S+R   S+ +       +I+D +LL 
Sbjct: 1126 GLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLL 1185

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             Y  L + E+ E A + GT+  QI+S+L
Sbjct: 1186 KYCHLSVVERNEFAKKIGTSVDQIISDL 1213



 Score =  182 bits (461), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           RV+ +  TC I ALS++   K HP+IWS  +LP+D  ++ AVP PIGGVL++  N++ Y 
Sbjct: 11  RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70

Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
           +QS      +  Y VSL+S    S   P   +    + +D   AT++  D   LS K G+
Sbjct: 71  NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124

Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
           + +LT++ DG R V+   L K   SVLT+ I  +G+   FLGSRLG+SLL+++T     +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
             +   K E      + P+ K     +SD +      + + +EL +YG     T +   +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244

Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
           +SF + DSL+NIGP  +   G         + N D      + +G  K     +++    
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304

Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
                   LPGC  +WTV  KS +   AD     S  +  D + HA+LI+S +  +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
           T   +TEV     +  Q  TI A N+   R ++QV  +   +++G    Q +        
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
                  S +   S+ADPY+LL   +G   L+   P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
           norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score =  309 bits (791), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
                  L I        + + + + K    ++V   ELPGC  +WTV            
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
           K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           N+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++  
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654

Query: 676 SDGSIRLLV 684
           ++G + + +
Sbjct: 655 AEGHVTMFL 663



 Score =  285 bits (730), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 778  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 834  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 884  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 944  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1004 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1056

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 1057 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1107

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1108 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1127

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1128 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1185

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1186 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1245

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P+ E T
Sbjct: 1246 HVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKT 1298

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1299 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1358

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1359 SELAKKIGTTPDIILDDL 1376


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
           musculus]
          Length = 1388

 Score =  308 bits (789), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  286 bits (732), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1109

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1110 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1129

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1130 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1187

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1188 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1247

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P+ E T
Sbjct: 1248 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1300

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1301 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1360

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1361 SELAKKIGTTPDIILDDL 1378


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Nasonia vitripennis]
          Length = 1415

 Score =  308 bits (788), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 207/688 (30%), Positives = 348/688 (50%), Gaps = 61/688 (8%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
            V  ++G LE++ +P     + +  F  G+  + D+            +  ++ +G+ Q  
Sbjct: 773  VYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDS------------MEFTTIQGSQQNE 820

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
              N   ++V E+A+     H +RP L   L D  +  YQ Y +  P+   K         
Sbjct: 821  PVN-PEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK--------- 867

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSR 932
             L    +  + +  + FSR       +EE        R+ +   F NI+G+ G F+ G  
Sbjct: 868  -LRFKKIDHNFI--VGFSRIG----PKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDY 920

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W  +  R  LR HP   DG + +F   +NVNC  GF+Y   +  L+IC LP+  +YD 
Sbjct: 921  PHWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDA 980

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLS 1050
             WPV+K+PL+ TPH +TY  E   Y ++ S    +PL         D+E   +  N    
Sbjct: 981  PWPVRKVPLRCTPHFVTYHLESKTYCVVTSTA--EPLKSYYRFNGEDKEFTEEERNERF- 1037

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
               L+ T   E++ + +  P      W T     I +   E+   ++ V+L    T+   
Sbjct: 1038 ---LYPTQ--EQFSIVLFSPVS----WDTIPNTKIDLDQWEHVTCLKNVSLAYEGTRSGL 1088

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
            +  + IGT Y  GED+ +RGR+ +F        P     +N   ++Y+KE KG ++A+  
Sbjct: 1089 KGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPVTAITQ 1148

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            + G L+ A G KI + +    +L G+AF D   +YV  +  +K+ IL+ D++KS+  L +
Sbjct: 1149 VSGFLVSAIGQKIYIWQLKDNDLVGVAFIDTQ-IYVCQMLSIKSLILVADVYKSVSLLRF 1207

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
            + +   L+L+++DF + + +A E+ I  + L  +V+D + NI IF Y P+ S+S  GQKL
Sbjct: 1208 QPEYKTLSLVSRDFRTTEIYAIEYFIQNNELGFIVADGESNISIFSYQPESSQSLGGQKL 1267

Query: 1283 LSRAEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            + +A+ H+G  +  F R++   T S++ T    G+DK  R   ++ TLDGS+G I P+ E
Sbjct: 1268 IRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADK--RHVTMYATLDGSLGYILPVPE 1325

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             T+RRL  LQ  LV  + H+AGLNP++FR + S  +        I+D +L+  Y  LP+ 
Sbjct: 1326 KTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVRKYLDLPVN 1385

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            E++EIA + GT   +I+ +++++   TS
Sbjct: 1386 EKIEIAKKIGTGAQEIMDDMHEIYKQTS 1413



 Score =  295 bits (754), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 210/667 (31%), Positives = 334/667 (50%), Gaps = 78/667 (11%)

Query: 58  LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           LVV  AN+I ++ ++   + G KE        +           LE +  Y LHGNV S+
Sbjct: 30  LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+     RDS++L+F +AK+SV+E+D  IH LR  S+H FE  E   +K G  +
Sbjct: 81  QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   +      GD             I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+ L+  M ++ D  F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++   K 
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
           HP+IWS  NLP D Y+ +AV  P+GG L++  N++ Y +QS     ++LN+   +  +  
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
             P+    + L+++   ++  D  ++S KTG+L +L++  D  R V+     K   SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
           S +    ++  FLGSRLG+SLL++FT      +   S L+       +    TK+++   
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433

Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG---- 521
                +SD L D+ + EEL +YGS +  T     ++ F V DSL+NIGP  + S G    
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491

Query: 522 ----LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTVYHKSSRG 560
                  N++      + +G  K     +++            LPG + IWTV   +   
Sbjct: 492 LSEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSD 551

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
           + A +          H +LI++ +  TMVL+T   + EV +   +  QG TI AGNL   
Sbjct: 552 NRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGSN 606

Query: 621 RRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
           R +IQV + G R+L G     +M  DL                 ++  S ADPYV L   
Sbjct: 607 RYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLSE 652

Query: 677 DGSIRLL 683
           DG + LL
Sbjct: 653 DGQVVLL 659


>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
          Length = 741

 Score =  308 bits (788), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
           musculus]
          Length = 1417

 Score =  307 bits (786), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 56  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694



 Score =  285 bits (730), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 809  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 865  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 915  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 975  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1035 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1087

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 1088 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1138

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1139 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1158

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1159 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1216

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1217 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1276

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P+ E T
Sbjct: 1277 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1329

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1330 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1389

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1390 SELAKKIGTTPDIILDDL 1407


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  306 bits (783), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 195/579 (33%), Positives = 292/579 (50%), Gaps = 41/579 (7%)

Query: 860  GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
            G E +   DD        S S  S S+    R S+ P D   R+  P  A P     + +
Sbjct: 732  GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788

Query: 919  NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
                    F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG 
Sbjct: 789  ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848

Query: 978  LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLL 1035
            L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S   P  +         
Sbjct: 849  LRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR--------- 899

Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1093
            I +  G + +   +   + +     E + ++++ P      W+    A I +Q  E+   
Sbjct: 900  IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTC 955

Query: 1094 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE----- 1147
            ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+     
Sbjct: 956  MKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKV 1015

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  VKNF
Sbjct: 1016 LYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNF 1074

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            IL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++
Sbjct: 1075 ILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVY 1134

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRF 1322
             Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GA  G  K      N+ 
Sbjct: 1135 MYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKH 1187

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
               F TLDG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +  
Sbjct: 1188 ITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNA 1247

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1248 VRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1286


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
            [Harpegnathos saltator]
          Length = 1214

 Score =  306 bits (783), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 204/697 (29%), Positives = 343/697 (49%), Gaps = 76/697 (10%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 570  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE------ 623

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   ++V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 624  -------LQVREVLMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK-------- 665

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +     SR P      E+ P  A   RI +   F NI+G+ G F+   
Sbjct: 666  --LRFKKLDHGIIPG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSD 718

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 719  YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 778

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
              WPV+K+PL+ TPH +TY  E   Y +I S    +PL                  +  +
Sbjct: 779  APWPVRKVPLRCTPHFVTYHLESKTYCVITSTS--EPLKSY---------------YRFN 821

Query: 1051 SVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTLF 1100
              D  + +T E+   R L P +        +   W+T     I +   E+   ++ V+L 
Sbjct: 822  GED--KEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSLA 879

Query: 1101 NTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
               T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE K
Sbjct: 880  YEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 939

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G I+A+  + G L+ A G KI + +    +L GIAF D   +Y+  +  +K+ IL+ D++
Sbjct: 940  GPITAITQVSGFLVTAVGQKIYIWQLKDNDLVGIAFIDTQ-IYIHQMLSIKSLILIADVY 998

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            KSI  L ++E+   L+L+++DF   + +  E+LID + L  +++D + N+ +F Y P+  
Sbjct: 999  KSISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPESR 1058

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGS 1332
            ES  GQKL+ +A+FH+G  +  F R++   T  +SD+      SD   +   ++ +LDGS
Sbjct: 1059 ESLGGQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHF---SDADKKHVTMYASLDGS 1115

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G + P+ E T+RRL  LQ  LV  + H+AGLNP+++R + S  +        I+D +L+
Sbjct: 1116 LGYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLV 1175

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
              Y  LP  E+ ++A + GT   +I+ ++ ++   T+
Sbjct: 1176 WRYLSLPNNEKADVAKKIGTRVQEIIEDITEIDRQTA 1212



 Score =  216 bits (549), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 152/475 (32%), Positives = 243/475 (51%), Gaps = 52/475 (10%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
           M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + HP+IWS  
Sbjct: 1   MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
           NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A +  +    P+    
Sbjct: 61  NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120

Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
           + L+ A   +L  D  ++S K+G+L +L++  D  R V+     K   SVLTS +    +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
           +  FLGSRLG+SLL++FT     ++ S    E    D + + P  K+ ++       +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
            L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G            
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298

Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            N D    + +G  K     ++            ELPGC+ +WTV      G   +  ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            +  +  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL   R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412

Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            G R+L G    Q +                 ++  S ADPYV+L   DG + LL
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLL 457


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
          Length = 1428

 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  265 bits (676), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 168/556 (30%), Positives = 274/556 (49%), Gaps = 35/556 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S              I +  G + +   +  
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIER 1061

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    +
Sbjct: 1062 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1117

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1118 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1177

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1178 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1236

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1237 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1296

Query: 1284 SRAEFHVGAHVTKFLR 1299
             RA+FHVGAHV  F R
Sbjct: 1297 RRADFHVGAHVNTFWR 1312


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 205/688 (29%), Positives = 333/688 (48%), Gaps = 53/688 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 776  WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 821  DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875

Query: 873  TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
              R   + +    + +  R S+ P                +         R+  F N+  
Sbjct: 876  V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934

Query: 923  HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 935  YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LPS  +YD  WPV+K+PL+ TPH + Y  E   Y ++ SV       Q     + +  G
Sbjct: 995  VLPSHLSYDAPWPVRKVPLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVTG 1047

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
             +I    +   D     T   + +++  P        TR  I  +++EN   ++VV L  
Sbjct: 1048 EEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLSC 1105

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
              T    +  + + T +V  ED+  RG V ++        P     +N +  +Y K  KG
Sbjct: 1106 EGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKG 1165

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
             +SAL  + G LL   G K+ + ++   +L G+AF D   +Y+ +   VK FIL+ D+ K
Sbjct: 1166 PVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDT-QIYIHNAVSVKQFILITDVMK 1224

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
              YFL ++ Q   L+L+++D   L+ F  EF++D   ++ +VSD  KN+ +F+Y P+  E
Sbjct: 1225 GAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPE 1284

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
            S  G  LL R + ++G+ V  F+R++   T  S+++  + P      R  + F TLDGS+
Sbjct: 1285 SHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGSL 1341

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            G + P+ E T+RRL  LQ  L + +PHV GLNP+S+R   S+ +       +I+D +LL 
Sbjct: 1342 GLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLL 1401

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             Y  L + E+ E A + GT+  QI+S+L
Sbjct: 1402 KYCHLSVVERNEFAKKIGTSVDQIISDL 1429



 Score =  286 bits (732), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C      H  +                  P ++      NLVV  
Sbjct: 2   YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            + + +Y + +  + +K +    + K +          LE    + + G V S+    Q 
Sbjct: 35  GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
                  RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P+
Sbjct: 80  AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             + +V D  F+HGY EP ++IL+E   TWAGRV+ +  TC I ALS++   K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
             +LP+D  ++ AVP PIGGVL++  N++ Y +QS      +  Y VSL+S    S   P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308

Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + +D   AT++  D   LS K G++ +LT++ DG R V+   L K   SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + I  +G+   FLGSRLG+SLL+++T     +  S   K E      + PS K     +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428

Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
           D +      + + +EL +YG     T +   ++SF + DSL+NIGP  +   G       
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488

Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
             + N D      + +G  K     +++            LPGC  +WTV    K+    
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
            ++ S  +  D + HA+LI+S +  +MVL+T   +TEV     +  Q  TI A N+   R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
            ++QV  +   +++G    Q +               S +   S+ADPY+LL   +G   
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657

Query: 682 LLVGDP 687
           L+   P
Sbjct: 658 LMTLKP 663


>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
          Length = 1438

 Score =  304 bits (778), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 369/1498 (24%), Positives = 636/1498 (42%), Gaps = 210/1498 (14%)

Query: 47   PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
            P  +   P+ NLVV  +N++ I  VR             +E      K +   +  V MD
Sbjct: 31   PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90

Query: 95   ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
                     G S AS             V  YRLHG V   E++ I+S          D 
Sbjct: 91   EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146

Query: 133  IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
            ++++F+DAKI++LE+    H L   S+H +E +P+ L L    +S      ++ DP  +C
Sbjct: 147  LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202

Query: 192  GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
              + +      IL   Q    L V D+D T      +S     S +++L    D  +++V
Sbjct: 203  AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258

Query: 247  KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
             DF+F+ G+  P + +L + + TW GR+     T  +   +++   + +P+I S   LP+
Sbjct: 259  VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318

Query: 307  DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
            D   ++  P+ +GGV+V+ +N+ IH    S   ALA+N +   +    ++P ++ +    
Sbjct: 319  DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375

Query: 362  --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
              +EL+ +  T++ +    +  K G +  +    DG+VV +L +S      T PSV    
Sbjct: 376  GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431

Query: 415  ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
            I  I N  FF+GS  G S L++           SG++E+  D + +    K    +  D+
Sbjct: 432  IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480

Query: 475  LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +  M   ++  LYGS+              + +T   +     ++ DSL   GP+ D ++
Sbjct: 481  VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540

Query: 521  GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
             +  N D       +ATG      + L +  LP           G +G+W++  + +   
Sbjct: 541  SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600

Query: 562  NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
            N  S +         +  +IIS +A     +   A   ++   ++   + G  T+ A   
Sbjct: 601  NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660

Query: 618  FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            F    ++ V     R+L  DGS         P     G+ +  + + + SI DPYV +  
Sbjct: 661  FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713

Query: 676  SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
             D +I L +G+     +  +  +P   ++S+  ++ C      G         L  ++T 
Sbjct: 714  EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
               +T   +++  A      Q   + ++    G +EI+ +P    VF+     + +  +V
Sbjct: 773  KQNATSTLQSVMNAG-----QKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D+    AL          S     Q RK     + + ++ +        RP LF +L  G
Sbjct: 828  DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
             +  Y+A  FE P     + DP   SR  S+  V   ++ +  F     D   +E++   
Sbjct: 876  QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929

Query: 907  HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
                 QR+ I F      +    G F +G RPCW +   +  +RVH       + +FT  
Sbjct: 930  ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVHSS-GHAVVHSFTPC 988

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY--FAEKNLYPL 1018
               +    F+  T +G   +  +P    +       ++P +  P    Y          L
Sbjct: 989  SLWDSKGDFLLYTDEGPCLLEWMPDVQLH------TELPSRFMPRSRAYTNVVFDPFTCL 1042

Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1078
            IV    LK   Q  S   D     + D  N+S        T +   + ++ PD     W 
Sbjct: 1043 IVGAASLK--AQFTSFDEDGNQTWEPDAPNISYP------TTDCSTLELITPDA----WL 1090

Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
            T       S+E    V  V L   +T   + + +A+GT   +GED+A +G   +F     
Sbjct: 1091 TMDGYEFASNEIVNAVECVMLETQSTDSGQKSFIAVGTTINRGEDLAVKGATYIFEIVEV 1150

Query: 1138 ADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAF 1190
              +P   V   +   +      KG ++AL  + G+L+ + G KI +      E L G+AF
Sbjct: 1151 VPDPSFGVKRWFKLRMRCRDDAKGPVTALCGMDGYLVSSMGQKIFVRALDLDERLVGVAF 1210

Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLID 1249
             D   +YV SL  +KN +++ D  KS++F++++E   +L +LAKD   + CF + +F   
Sbjct: 1211 LDVG-VYVTSLRALKNLLIISDAVKSVWFVAFQEDPYKLTVLAKDAQQV-CFTSADFFFA 1268

Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
               LS+V  DE+  +++++Y P   ES  GQ+LL  AEFH        L +        R
Sbjct: 1269 NQQLSIVTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFHGQIEYRSSLTIA-------R 1321

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
                P ++   +  L+ G+ DGS+  + P++E  F+RL  LQ +L  +V H A LNPR+F
Sbjct: 1322 RTKGPDTE-IPQAKLICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALNPRAF 1380

Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            R    N    +      +D  LL  +E LP+  Q+EI  Q GT R  +L +   +ALG
Sbjct: 1381 RAVR-NEYVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKDW--VALG 1435


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
            [Saccoglossus kowalevskii]
          Length = 984

 Score =  304 bits (778), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 287/1106 (25%), Positives = 465/1106 (42%), Gaps = 238/1106 (21%)

Query: 423  FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
             FLGSRLG+SLL+++       T  +++G K+   +    + + P+ K+    +SD +  
Sbjct: 6    LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65

Query: 478  MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
             V     + +EL +YGS +    +   +++F V DS++NIGP      G         + 
Sbjct: 66   DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124

Query: 525  NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
            N D      + +G SK     ++            ELPGC  +WTV     +  N D  +
Sbjct: 125  NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183

Query: 568  MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
                ++           HA+LI+S +  +M+L T   + E+  S  +  QG T+ AGNL 
Sbjct: 184  EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
                ++QV   G R+L+G    Q +               S ++  S++DPY LL    G
Sbjct: 243  NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292

Query: 679  SI--------------RLLVGDP---------STCTVSVQTPAAIESSKKPVSSCTLYHD 715
             +              RL +  P         + C     +     SSK   +S      
Sbjct: 293  ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352

Query: 716  KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
            K  +P +   S  + +S          GE+ D +   P  + +                 
Sbjct: 353  KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
             +  +  E+G LEI+ +P+F   F V  F  G   +VD+Y          ++ +S+  G+
Sbjct: 412  YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
             +  +++     V EL +      + +  L A + D  +  Y+A+               
Sbjct: 462  SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
             T    S+ N     LR LRF +                                F+ G 
Sbjct: 506  -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528

Query: 932  RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  M  R  LR HP   DGS+  F   HN+NC  GF+Y    G L+IC LP+  +YD
Sbjct: 529  YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1048
              WPV+K+PL+ TPH I+Y  E   Y ++ SV  P L+         I +  G   +  +
Sbjct: 589  ALWPVRKVPLRCTPHFISYHIESKTYAVVTSVSEPCLR---------ICKMTGDDKEFED 639

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1106
            +   D     T+E++ +++  P      W+      I  +  E+   ++ V L +  T  
Sbjct: 640  VERDDRFIFPTIEKFSLQLFSP----LSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTVS 695

Query: 1107 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
              +  +A+ T  V GE+V  RGR+L+F        P     +N +  +Y KE KG ++ L
Sbjct: 696  GLKGFIAVSTTIVYGEEVTCRGRILIFDVIEVVPEPGQPLTKNKLKLLYDKEQKGPVTTL 755

Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
              ++G L  A G KI L  +   +L G+AF D   +++ +L  +KNFIL  DI KS+  L
Sbjct: 756  CDIEGLLAAAIGQKIFLWAFRNNDLIGVAFIDT-QIHIHTLCTIKNFILAADIRKSVSLL 814

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
             + ++   L+L+ +                                        ES+ GQ
Sbjct: 815  RFSDEDRSLSLVTR----------------------------------------ESFGGQ 834

Query: 1281 KLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            +LL RA+F+ G+HV  F R+  ++   ++++    P      R   +F TLDGSIG + P
Sbjct: 835  RLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTGPME---RRHVTMFATLDGSIGYLIP 891

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L     H AGLNP+ FR      K+      +I+D +LL  Y  L
Sbjct: 892  MTEKTYRRLLMLQNALTTQTLHTAGLNPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFL 951

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDL 1424
             + E+ E+A + GT+  QIL +L D+
Sbjct: 952  SVNERTELAKKIGTSVEQILDDLMDV 977


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Apis mellifera]
          Length = 1415

 Score =  303 bits (776), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 204/697 (29%), Positives = 338/697 (48%), Gaps = 77/697 (11%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826  -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875  RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                       R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869  -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            YD  WPV+K+PL+ TPH +TY  E   Y +I S+   +PL                  + 
Sbjct: 978  YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1020

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
             +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V+
Sbjct: 1021 FNGED--KEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1078

Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
            L    T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE
Sbjct: 1079 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1138

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             KG I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D
Sbjct: 1139 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1197

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            ++KSI  L ++E+   L+L+++DF   + +  E+LID + L  +V+D + NI +F Y P+
Sbjct: 1198 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPE 1257

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
              ES  GQKL+ +A+FH+G  V  F R++    S         SD   R   ++ +LDG+
Sbjct: 1258 SRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDGN 1316

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I P+ E T+RRL  LQ  LV  + H+AGLNP+++R + S+ +        I+D +L+
Sbjct: 1317 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLV 1376

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
              Y  LP  E++++A + GT   +I+ +L ++   T+
Sbjct: 1377 WRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413



 Score =  293 bits (751), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 2 [Cricetulus griseus]
          Length = 1388

 Score =  303 bits (775), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
                    L I        + + + + K    ++V   ELPGC  +WTV         +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
           + R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  285 bits (730), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1109

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1110 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1129

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1130 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1187

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1188 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1247

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P+ E T
Sbjct: 1248 HVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKT 1300

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1301 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1360

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1361 SELAKKIGTTPDIILDDL 1378


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score =  302 bits (774), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 207/680 (30%), Positives = 334/680 (49%), Gaps = 75/680 (11%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 740  LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K        
Sbjct: 793  ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 836  --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 889  YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
              WPV+K+PL+ TPH +TY  E   Y +I S    +PL                  +  +
Sbjct: 949  APWPVRKVPLRCTPHFVTYHLESKTYCVITSTA--EPLKSY---------------YRFN 991

Query: 1051 SVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTLF 1100
              D  + +T EE   R L P +        +   W+T     I +   E+   ++ V+L 
Sbjct: 992  GED--KEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSLA 1049

Query: 1101 NTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
               T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE K
Sbjct: 1050 YEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 1109

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D++
Sbjct: 1110 GPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADVY 1168

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            KSI  L ++E+   L+L+++DF   + +  E+LID + L  +V+D + N+ +F Y P+  
Sbjct: 1169 KSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESR 1228

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSI 1333
            ES  GQKL+ +A+FH+G  V  F R++   T  ++      G+DK  R   ++ +LDGS+
Sbjct: 1229 ESLGGQKLIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADK--RHVTMYASLDGSL 1286

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELL 1392
            G I P+ E T+RRL  LQ  LV  + H+AGLNP+S+R  + +   ++  P   I+D +L+
Sbjct: 1287 GYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLV 1346

Query: 1393 SHYEMLPLEEQLEIAHQTGT 1412
              Y  LP  E+ ++A + GT
Sbjct: 1347 WRYLFLPNNEKADLAKKIGT 1366



 Score =  293 bits (749), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LE +  Y LHGN+ S+  +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+
Sbjct: 32  KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
           H FE  E   +K G  +    P+V+VDP+GRC  +L++G ++++L   +  S   GD  D
Sbjct: 88  HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144

Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
           T        A I SS++I L+ L+  M +V D  F+HGY EP ++IL+E   T+AGR++ 
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204

Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
           +  TC + A+S++   + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS 
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
               ++LN+ A +  +    P+    + L+ A   ++  D  ++S K+G+L +L++  D 
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
            R V+     K   SVLTS +    ++  FLGSRLG+SLL++FT     ++ + +G +  
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384

Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             + E++    K+ ++       +SD L D+ + EEL +YGS + +T     ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442

Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
           SL+NIGP  + S G           N D      + +G  K     ++            
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502

Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           ELPGC+ +WTV        N D  +  A  +  HA+LI+S E  TM+L+T   + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556

Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
             +  QG T+ AGNL   R ++QV + G R+L G    Q +                 ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605

Query: 663 SVSIADPYVLLGMSDGSIRLL 683
             S ADPYV L   DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score =  301 bits (770), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 358/1478 (24%), Positives = 627/1478 (42%), Gaps = 214/1478 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28   NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81   GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173  GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
               + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134  WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215  -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                  DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193  HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253  ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328  T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
              +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313  ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385  DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373  KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434  LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
            L+      G S  S+  K+       D         SS +  +D  +  E  LY      
Sbjct: 429  LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479

Query: 488  GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
            G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 480  GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539

Query: 541  LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            LV   G                            G W+V+ K  +  +   S        
Sbjct: 540  LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599

Query: 575  YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
            Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 600  YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659

Query: 629  RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
               R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 660  GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
            S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 715  SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760

Query: 748  DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 761  ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                     +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 799  EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
            S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 849  STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
            F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 902  FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961

Query: 987  STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
            + +D  W  +KI L      + Y +    Y +  S  V        +L  D E+  +  N
Sbjct: 962  THFDGSWATRKIGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWRN 1015

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
              +S +       +++  V++L P      W    +  ++++E  + V+ + L  +  T 
Sbjct: 1016 EVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEITH 1066

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISAL 1160
            E   ++A+GTA  +GED+AARG + +F         D P+    +  +  +E+KGA+++L
Sbjct: 1067 ERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSL 1126

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +K     ++GD  
Sbjct: 1127 SGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDAL 1184

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            K I+F  + E+  +L+L +KD G+L   A +FL DG  L ++V+D+  NI +  Y P+  
Sbjct: 1185 KGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDP 1244

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL---- 1329
             S KG +LL R+ FH G   +    L + +  S+    A P   + +    L+  L    
Sbjct: 1245 GSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSE 1304

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
             GSI  I PL E  +RRL +LQ +L++++ H  GLNPR+FR   S+G   R     +VD 
Sbjct: 1305 TGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVDG 1360

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            +LL  +  L  + + EIAH+ G    +I ++L  +  G
Sbjct: 1361 DLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1398


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score =  301 bits (770), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 359/1478 (24%), Positives = 627/1478 (42%), Gaps = 215/1478 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28   NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81   GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173  GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
               + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134  WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215  -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                  DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193  HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253  ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328  T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
              +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313  ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385  DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373  KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434  LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
            L+      G S  S+  K    D          +   SSD  +D  +  E  LY      
Sbjct: 429  LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478

Query: 488  GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
            G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 479  GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538

Query: 541  LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            LV   G                            G W+V+ K  +  +   S        
Sbjct: 539  LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598

Query: 575  YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
            Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 599  YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658

Query: 629  RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
               R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 659  GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
            S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 714  SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759

Query: 748  DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 760  ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                     +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 798  EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
            S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 848  STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
            F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 901  FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960

Query: 987  STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
            + +D  W  +KI L      + Y +    Y +  S  V        +L  D E+  +  N
Sbjct: 961  THFDGSWATRKIGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWRN 1014

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
              +S +       +++  V++L P      W    +  ++++E  + V+ + L  +  T 
Sbjct: 1015 EVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEITH 1065

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISAL 1160
            E   ++A+GTA  +GED+AARG + +F         D P+    +  +  +E+KGA+++L
Sbjct: 1066 ERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSL 1125

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +K     ++GD  
Sbjct: 1126 SGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDAL 1183

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            K I+F  + E+  +L+L +KD G+L   A +FL DG  L ++V+D+  NI +  Y P+  
Sbjct: 1184 KGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDP 1243

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL---- 1329
             S KG +LL R+ FH G   +    L + +  S+    A P   + +    L+  L    
Sbjct: 1244 GSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSE 1303

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
             GSI  I PL E  +RRL +LQ +L++++ H  GLNPR+FR   S+G   R     +VD 
Sbjct: 1304 TGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVDG 1359

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            +LL  +  L  + + EIAH+ G    +I ++L  +  G
Sbjct: 1360 DLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1397


>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
 gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1441

 Score =  300 bits (768), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 362/1491 (24%), Positives = 607/1491 (40%), Gaps = 232/1491 (15%)

Query: 57   NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
            NLVV  +++++I+  ++        Q+     ++N+G  + R+  D  G+ A+       
Sbjct: 28   NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87

Query: 100  ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
                       L LV    L G +  LA +    + N R   D++++AF+DA++S++E+D
Sbjct: 88   AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144

Query: 149  DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
               H L   S+H +E  E     W        +F     +  DP GRC  +   G+ + I
Sbjct: 145  AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199

Query: 204  LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
            L   Q                G   V  E    +GG          S V+ L +LD  + 
Sbjct: 200  LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259

Query: 245  HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
            H     F+H Y EP   IL    +  +  +  K H   +   ++    +    I S   L
Sbjct: 260  HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317

Query: 305  PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
            P D +++  +P+P+GG L+VGAN  IH         +A+N       S     +S  ++ 
Sbjct: 318  PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
            L+      L  +  L+    G + L+T   DGR V  LD+    S+T  S++   ++T  
Sbjct: 378  LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437

Query: 418  -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
             IG S+ F GS  GDSL+  +T     S       ++ G DI+              D  
Sbjct: 438  RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497

Query: 476  QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
             +    ++ ++  +ASN  E      +F + D L++I P++  +YG  ++A  S      
Sbjct: 498  AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553

Query: 530  -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNAD 564
                         G +K S   ++            E P  +G WTV  K    +    D
Sbjct: 554  AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613

Query: 565  SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
                A  +D     +Y  ++I++       E   +   TA     +T +      G TI 
Sbjct: 614  KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673

Query: 614  AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
            AG +    R+IQV +   R  DG      +   P   E       +T  S SIADPY+L+
Sbjct: 674  AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
              +D S+ +          S+     +E  +K         DK         +T  WL+ 
Sbjct: 730  IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765

Query: 734  GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
             +    +G  G   + GD        I   +   SGAL I+ +P+               
Sbjct: 766  CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809

Query: 786  HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
                 Y+ E L    T +++  ++ +GT    KE +  + V +L           P+L  
Sbjct: 810  -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
               +  +  Y+ Y ++           +   ++L    +  S L     +++P +    E
Sbjct: 862  RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909

Query: 904  ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
            E  H A C  +    NI G+   FL G  P + +   + +     L   ++ A +  H  
Sbjct: 910  EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
             C HGFIY  S  I+++ QLP   ++ +    V+KIP+      + Y      Y +  + 
Sbjct: 970  GCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPPSQSYVVACNT 1029

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
            P  +P           E+    D H   + + L    T+E   ++++ P      W    
Sbjct: 1030 P--EPF----------ELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPIT----WTVVD 1073

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            TI M+  EN L V  + L  +  T E + L+ +GTA  +GED+  RG V +++       
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYNVADVIPE 1133

Query: 1141 PQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1190
            P    T    ++ +KE   +GA++AL+ +  QG +L+A GPK ++   K  GT L  +AF
Sbjct: 1134 PGKPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGPKCMVRGLKEDGTLLP-VAF 1192

Query: 1191 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
             D    YV S   +      L+ D  K ++F  + E+  ++ L  K    L+    +FL 
Sbjct: 1193 MDM-NCYVTSAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSNTRLEVLNADFLP 1251

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSS 1307
            +G  LS+V  D + +I I  + P+  +S +G  LL R  F  GA HVTK L L    +  
Sbjct: 1252 NGKELSIVACDAEGHIHILQFDPEHPKSLQGHLLLHRTSFSTGAHHVTKSLLLPSTLSPD 1311

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
            ++         +    LL  +  G +  + PL E  +RRL SL  +L +S+ H AGLNP+
Sbjct: 1312 NKEDNEENGATSRPHILLLASPTGVLAALRPLSETAYRRLSSLAAQLTNSLTHAAGLNPK 1371

Query: 1368 SFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
             +R    +      G D+     IVD  +L+ +  L   ++ E+A + G T
Sbjct: 1372 GYRM--PSATCPPAGVDAGIGRHIVDGTILARFSELGRAKRGEVAGRAGYT 1420


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score =  300 bits (767), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 337/1463 (23%), Positives = 628/1463 (42%), Gaps = 198/1463 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD     +   S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665  TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700  IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
            + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720  VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
               L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761  DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
                V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803  IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
             ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846  HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
                    +L   +  + + L    C+ GFIY+ SQ  ++ C+LP  + +D  W ++++ 
Sbjct: 906  SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVH 965

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
            L      + Y     +Y       VL   +     L  D E+  +  N  +S     R  
Sbjct: 966  LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 1018

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAY 1117
             +     +++ P+     W    +  + + E  + ++ ++L  +  T E + ++ +GTA+
Sbjct: 1019 FI-----KLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAF 1069

Query: 1118 VQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIA 1170
             +GED+ +RG + +F   +   +P +  T+     +  + +KGA++AL+ +  QG +L+A
Sbjct: 1070 ARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVA 1129

Query: 1171 SGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQG 1226
             G K ++   K  G+ L  +AF D    YV  +  +K     +LGD  K ++F  + E+ 
Sbjct: 1130 QGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEEP 1187

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
             +++L AKD   L+  A EFL DG  L +VV+D   NI +  Y P+  +S  G +LLSR+
Sbjct: 1188 YKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRS 1247

Query: 1287 EFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDEL 1342
            +FH+G   +    L     SS++   ++ G D  N+  L   L  T +GS+G I  + E 
Sbjct: 1248 KFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEE 1307

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
            ++RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D  LL  +  +  + 
Sbjct: 1308 SYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQR 1363

Query: 1403 QLEIAHQTGTTRSQILSNLNDLA 1425
            + EIA + G    +I ++L  ++
Sbjct: 1364 KTEIAGRVGAREWEIKADLEAIS 1386


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
            echinatior]
          Length = 1412

 Score =  300 bits (767), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 202/672 (30%), Positives = 332/672 (49%), Gaps = 63/672 (9%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 771  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 824  ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 867  --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920  YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
              WPV+K+PL+ TPH +TY  E   Y +I S    +PL        + +V          
Sbjct: 980  APWPVRKVPLRCTPHFVTYHLESKTYCVITSTA--EPLKSYYRFNGEDKV---------- 1027

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
               L + Y + ++    +    +   W+T     I +   E+   ++ V+L    T+   
Sbjct: 1028 ---LTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKNVSLAYEGTRSGL 1084

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
            +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE KG I+A+  
Sbjct: 1085 KGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQ 1144

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D++KSI  L +
Sbjct: 1145 VSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADVYKSISLLRF 1203

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
            +E+   L+L+++DF   + +  E+LID S L  +V+D + N+ +F Y P+  ES  GQKL
Sbjct: 1204 QEEYRTLSLVSRDFRPAEVYTIEYLIDNSNLGFIVADGESNLALFMYQPESRESLGGQKL 1263

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            + +A+FH+G  +  F R++   T  ++      G+DK  R   ++ +LDGS+G I P+ E
Sbjct: 1264 IRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADK--RHVTMYASLDGSLGYILPVPE 1321

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPL 1400
             T+RRL  LQ  LV  + H+AGLNP+++R  + +   ++  P   I+D +L+  Y  LP 
Sbjct: 1322 KTYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPN 1381

Query: 1401 EEQLEIAHQTGT 1412
             E+ ++A + GT
Sbjct: 1382 NEKADLAKKIGT 1393



 Score =  290 bits (741), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  ANVI ++ +    + ++  K + ET+            LE +  Y LHGN+ S+ 
Sbjct: 30  LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82  AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
              P+V+VDP+GRC  +L++G ++++L   +  S  + D D   S    S     I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192

Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+ L+  M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
           + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A S  +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
               P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
           LTS +    ++  FLGSRLG+SLL++FT     ++  L+        +   + P+ K  +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432

Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                  +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G  
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                    N D      + +G  K     ++            +LPGC+ +WTV     
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
              N D  R    ++  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL 
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
             R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652

Query: 679 SIRLL 683
            + LL
Sbjct: 653 QVMLL 657


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Apis florea]
          Length = 1583

 Score =  300 bits (767), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   E    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658



 Score =  295 bits (754), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 199/679 (29%), Positives = 327/679 (48%), Gaps = 77/679 (11%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826  -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875  RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                       R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869  -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            YD  WPV+K+PL+ TPH +TY  E   Y +I S+   +PL                  + 
Sbjct: 978  YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1020

Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
             +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V+
Sbjct: 1021 FNGED--KEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1078

Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
            L    T+   +  + +GT Y  GED+ +RGR+L+F        P     +N   ++Y+KE
Sbjct: 1079 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1138

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             KG I+A+  + G L+ A G KI + +    +L G+AF D   +Y+  +  +K+ IL+ D
Sbjct: 1139 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1197

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            ++KSI  L ++E+   L+L+++DF   + +  E+LID + L  +V+D + NI +F Y P+
Sbjct: 1198 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPE 1257

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
              ES  GQKL+ +A+FH+G  V  F R++    S         SD   R   ++ +LDG+
Sbjct: 1258 SRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDGN 1316

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I P+ E T+RRL  LQ  LV  + H+AGLNP+++R + S+ +        I+D +L+
Sbjct: 1317 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLV 1376

Query: 1393 SHYEMLPLEEQLEIAHQTG 1411
              Y  LP  E++++A +  
Sbjct: 1377 WRYLYLPNNEKIDVAKKIA 1395


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1-like [Ailuropoda
           melanoleuca]
          Length = 1469

 Score =  298 bits (763), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 210/631 (33%), Positives = 333/631 (52%), Gaps = 76/631 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 104 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 159

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E   
Sbjct: 160 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 213

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 214 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 273

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS   
Sbjct: 274 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 333

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
             +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG R
Sbjct: 334 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 393

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E    
Sbjct: 394 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 449

Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
            + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS
Sbjct: 450 ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 505

Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
           ++NIGP  + + G                L I        + + + + K    ++V   E
Sbjct: 506 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 565

Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
           LPGC  +WTV         ++ +G  A+   S + A DD   H +LI+S E  TM+L+T 
Sbjct: 566 LPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQTG 625

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
             + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +    
Sbjct: 626 QEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 678

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                S ++  ++ADPYV++  ++G + + +
Sbjct: 679 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 705



 Score =  290 bits (743), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 214/707 (30%), Positives = 336/707 (47%), Gaps = 105/707 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 820  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 876  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 919  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968

Query: 916  IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
             F++I G+ G        F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  
Sbjct: 969  YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
            GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S  +  P
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--P 1086

Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPM 1085
              +     I +  G + +   +   D +     E + ++++ P      W+    A I +
Sbjct: 1087 CTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIEL 1137

Query: 1086 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
            +  E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   
Sbjct: 1138 EEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQP 1197

Query: 1145 VTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            +T+     +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+ 
Sbjct: 1198 LTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIH 1256

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             +  VKNFIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD
Sbjct: 1257 QMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSD 1316

Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
              +N+ ++ Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K 
Sbjct: 1317 RDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKK 1369

Query: 1320 -----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                 N+    F TLDG IG + P+ E T  RLQ                   S R  H 
Sbjct: 1370 SVVWENKHITWFATLDGGIGLLLPMQEKT-NRLQPAX----------------SPRMLHV 1412

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            + +  +    +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1413 DRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1459


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score =  297 bits (761), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 342/1477 (23%), Positives = 615/1477 (41%), Gaps = 229/1477 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
            N+V    N + IY VR +   SK    +   K+   M+G+                    
Sbjct: 38   NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97

Query: 98   ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
                     +    LV  +RLHG V  L  +    +      D ++++F+DAKI++LE+ 
Sbjct: 98   DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156

Query: 149  DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
             +   L   S+H +E    + +     +F     ++VDPQ RC  + +      IL   Q
Sbjct: 157  TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213

Query: 209  GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
                +  D                S ++NL    +  +++V DF F+ G+  P + +L+E
Sbjct: 214  VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262

Query: 266  RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
               TW GR++ +  T  ++  ++    +++P+I  A  LP D   +LA PS  GGV+VV 
Sbjct: 263  TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321

Query: 326  ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
            +N+I Y  QS    +   N  +   S   LP        + ++EL+ + + ++ +  A +
Sbjct: 322  SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381

Query: 380  STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              K G +  + +V  GRVV +L L       T PS+L      I N    +GS  GDS  
Sbjct: 382  ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435

Query: 435  VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
                     ++LS+   EE  D + D  +      +S  AL  QD+ ++ ++  +YG + 
Sbjct: 436  ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481

Query: 492  NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
              T ++QK                +  D+L   GP+ D ++ +  N D       +ATG 
Sbjct: 482  IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541

Query: 534  SKQSNYELVE--LP-----------GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYL 579
                 + L +  LP           G +G+W++   +SS    A+           H  L
Sbjct: 542  GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRSSSAAVAE-----------HDTL 590

Query: 580  IISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            IIS +A       R  V  T   L+ V+      V G TI AG  F R  ++ V     R
Sbjct: 591  IISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIR 645

Query: 633  IL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            +L  DG+   + +D+               + + S SI DPYVL+   D +I L +G+ +
Sbjct: 646  VLEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETT 696

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGP 746
               +  +  + +       ++   + D      +   + DA   T +     +D +    
Sbjct: 697  RGKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ 756

Query: 747  LDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                  + V+    G +EI+ +P    VF+     + +  + D+    AL   +      
Sbjct: 757  ------WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKP 810

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
             E             + + ++ +        +P L  +L  G +  Y+A+      N + 
Sbjct: 811  QE-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFA----TNQAP 853

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR------ITIFKNI 920
              +P    R+      S+ +++ ++ +    +    +ET  G   ++         F   
Sbjct: 854  IVEPPLKPRA------SSLQIQFVKIASKAFEMQRTDETEKGILAEQKKALRTFVPFACA 907

Query: 921  SGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
                G F +G RP W +   +  ++++P     ++ AF+       +  F+  + +G   
Sbjct: 908  GAPAGVFFTGDRPHWIVATDKGGVQMYPS-GHAAVYAFSACTLWERSTEFLIYSEEG-QT 965

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            +C+  +        P++ IP       I Y    +   +IV+   L+            E
Sbjct: 966  LCEWITEYEIGRPLPMRHIPRGRAYSNIVYEPASS---MIVAAASLRARFASF-----DE 1017

Query: 1040 VGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
             G+QI   D   ++        TVE   + ++ P+     W T       ++E   T+  
Sbjct: 1018 DGNQIWAPDGPGITEP------TVECSTLELISPEV----WATVDGYEFATNEFVNTMEC 1067

Query: 1097 VTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-- 1153
            V L   +T+   +  +A+GT+ V+GED+A +G   +F       +  N     Y  +L  
Sbjct: 1068 VPLETVSTEAGVKHFIAVGTSIVRGEDLAVKGATYIFEVVEVVPDQSNGPKRWYRLKLRC 1127

Query: 1154 ----KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFI 1208
                KG ++AL  +  +L+ + G KI +  +   E L G+AF D   +YV SL  +KN +
Sbjct: 1128 RDDAKGPVTALCGINNYLVSSMGQKIFVRAFDLDERLVGVAFMDV-GVYVTSLRALKNLL 1186

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            L+GD+ + I F++++E   +L  L +D   +     +F      L++V +DE   + ++ 
Sbjct: 1187 LIGDVVRGIQFVAFQEDPYKLVTLGRDVSRMCATTVDFFFAEEALAIVTTDENGVMSMYN 1246

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
            Y P+  +S  G+ LL + EF++    T F    ++A  +      P         L+FG 
Sbjct: 1247 YDPEAPDSHDGRLLLKQTEFNLH---TDFRTSTLIARRTKDDPIIPQG------ILIFGG 1297

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
             DG++ C+ P+ +   +RLQ LQ +L  ++ HVAGLNP++ R    N    RP    I+D
Sbjct: 1298 TDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVR-NEHVSRPLSKGILD 1356

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
              L++++E LP+  Q E+  Q GT R+ IL +   L+
Sbjct: 1357 GNLIAYFEHLPITRQDEMTRQIGTERATILRDWMSLS 1393


>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
          Length = 1450

 Score =  297 bits (760), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 363/1458 (24%), Positives = 627/1458 (43%), Gaps = 207/1458 (14%)

Query: 76   EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
            EG  E    GE    +    +S  + +   +  +RLHG V   E + ILS          
Sbjct: 89   EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
            D ++++F+DAK+++LE+   +H L   S+H +E +P+   L   + +   G L +VDP  
Sbjct: 145  DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200

Query: 190  RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
            RC  + +    L ++    SQ    LV  + T      +S       + N  D  +++V 
Sbjct: 201  RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259

Query: 248  DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
            DF F+ G+  P + +L + + TW GR+     TC +   ++      +P+I S  NLPHD
Sbjct: 260  DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319

Query: 308  AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYA--VSLDSSQELPRSSFS--V 362
             + +L   S +GGV+++  N++ Y  Q S    L +N +A  VS    Q+L     +  +
Sbjct: 320  CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARVSDMPMQQLRPEEMNRDL 379

Query: 363  ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI 418
             L+ AHAT++ +    + T+ G ++ + +V DGR   RL L    S+T    L  ++   
Sbjct: 380  HLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNVAFR 439

Query: 419  GNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD-APS 463
              S        + F+GS +G S+L++ T           ++EE       GDI A  A +
Sbjct: 440  SASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAVADN 489

Query: 464  TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKDFSY 520
               +     D +   V  E  + +G  +  +++A +T S    ++ DSL   GP+   ++
Sbjct: 490  PMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINSMAF 549

Query: 521  GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTV-YHKSSRG 560
             L  N D       +ATG ++   + L +             + G +GIW +   +S + 
Sbjct: 550  ALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQSLKV 609

Query: 561  HNADSSR-MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAG 615
            + ++ SR +     E    +I+S +A      T       +   D  +  R    TI A 
Sbjct: 610  NGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTIGAA 667

Query: 616  NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
              F R  +I V     R+L+      D S      +    ++   +    I+DP++L+  
Sbjct: 668  PFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFILVIR 722

Query: 676  SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRKTST 727
             D S+ L VGD     +  +  TP   E   +  + C      G       E      ++
Sbjct: 723  EDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTTGTS 781

Query: 728  DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
            D     G G      D     +G  + V+C   G +EI+ +P    VF+        + +
Sbjct: 782  DDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLPSVV 838

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
             D++   AL        S  E+   + ++ +I  ++  ++        +  P L  +L  
Sbjct: 839  SDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVMLRC 885

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNVSAS 885
            G +  YQA   +      K D P ST R+ ++                      S+V A 
Sbjct: 886  GQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAE 939

Query: 886  RLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLR 944
            + R LR S  P       E             K +S   G F++G  PCW +   ++ L+
Sbjct: 940  QRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKDGLK 982

Query: 945  VHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQ--GILKICQLPSGSTYDNYWPVQKIPLK 1001
            +H   C    V +FT     +    F+  T +  G   +  +P  +   +  P + + + 
Sbjct: 983  IHS--CSFQTVNSFTSCSVWDSKCDFLMHTDEAFGPCLLGWIPEFNLGTDM-PSKTVTVG 1039

Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HRTYTV 1060
             T   +T+ A   L  ++ S  V  P       + D+E G+++   +  +++  H   + 
Sbjct: 1040 RTYTNVTFDAASGL--MVASSVVPNPFT-----IFDEE-GNKLWEPDAPNINYPHSVMSA 1091

Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQ 1119
             E     L     G  +        Q +E    +  V L   +T+   +  + +GT   +
Sbjct: 1092 LELFHSDLSCVMDGYEF--------QPNEFVTALDCVQLETQSTESGTKEFIVVGTTVNR 1143

Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVY------SKELKGAISALASLQGHLLIASGP 1173
            GED+A +G   +F       +P+  +   +        E KG ++AL  + G+L+ + G 
Sbjct: 1144 GEDLAVKGVTYVFEIVEIVPDPEGGLARQFKLRLLCKDEAKGPVTALCGMNGYLVSSMGQ 1203

Query: 1174 KIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
            KI +      E L G+AF D   +YV SL  +KN +++GD  KS++ ++++E   +L ++
Sbjct: 1204 KIFVRALDLDERLVGVAFLDV-GVYVTSLRTIKNLLIIGDAVKSVWLVAFQEDPFKLVIV 1262

Query: 1233 AKDFGSLDCFATEFLI--DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH- 1289
            AK+   LD    +FL   DG    + VSDE+  I++  Y     ES  GQ LL R E+H 
Sbjct: 1263 AKEVQRLDVMTADFLFASDGD-FYIAVSDEEGIIRLLEYDTSDPESHSGQYLLRRTEYHA 1321

Query: 1290 -VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-ELTFRRL 1347
             V +H T      ++A  S   G  P +       L+   +DGS+  + P+D + + +RL
Sbjct: 1322 QVESHTTV-----LIARRSQNDGLVPQA------RLISAAVDGSMYALTPVDADESAKRL 1370

Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            Q LQ +L  ++ HVAGLNPR+FR   S+G A RP    I+D  LL+ +E LP+  Q EIA
Sbjct: 1371 QLLQGQLTRNMQHVAGLNPRAFRAVRSDGVA-RPLTKGILDGNLLAGFEQLPIPRQNEIA 1429

Query: 1408 HQTGTTRSQILSNLNDLA 1425
               GT R  +L +  +L+
Sbjct: 1430 RPIGTDRLAVLRDRRELS 1447


>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
 gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1486

 Score =  294 bits (753), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 359/1428 (25%), Positives = 596/1428 (41%), Gaps = 208/1428 (14%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
            D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 95   DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154

Query: 154  LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
            L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 155  LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214

Query: 214  VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
              D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 215  DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 275  YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 333  PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
             ++   LL    G L L+T   DGR V  L +    P    SV+ S +T+   +G S  F
Sbjct: 393  AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
            +GS  GDS+L+ +T   G +      K    DI+ D                        
Sbjct: 453  VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498

Query: 485  SLYGSASNNTESA--------QKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
             LYG  S + E A            +F + D L++I P++  +YG  +    S       
Sbjct: 499  DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558

Query: 530  ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNADS 565
                      A G  K S   ++            E P  +G WTV  K    +    D 
Sbjct: 559  GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618

Query: 566  SRMAA-YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
              M+  YD   ++H ++I++       E   +   TA     +T +      G T+ AG 
Sbjct: 619  GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678

Query: 617  LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            +    R++QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL  
Sbjct: 679  MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733

Query: 676  SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
             D S+ +    P    +  V+    + +  K ++ C LY D                +TG
Sbjct: 734  DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776

Query: 735  VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
            V  A + A  G  D  +I   +   SG L I+ +P+      V + +S        Y+  
Sbjct: 777  V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
             L       + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ
Sbjct: 826  GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
             Y  +     + +  P S S       +   ++ N  F++ P +    ++  H A    P
Sbjct: 874  PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
             +R T   NISG+   FL GS P + +   +       L    + A +  H   C HGFI
Sbjct: 922  MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978

Query: 971  YVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            Y  + GI ++ Q+P+ S++      V+KIP+      + Y      Y  +V     +P  
Sbjct: 979  YADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHPPTQAY--VVGCNNAEPFE 1036

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
                L  D +   +    N++   +     V+   +++L    +G  W    T+ M+  E
Sbjct: 1037 ----LPKDDDYHKEWARENITFKPM-----VDRGMLKLL----SGITWTVIDTVEMEPCE 1083

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEV 1148
              L V  + L  + +T E + L+A+GTA  +GED+  RGRV +F        P    T  
Sbjct: 1084 TVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVFDIADVIPEPGKPET-- 1141

Query: 1149 YSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPP 1195
             SK+LK         GA++AL+ +  QG +L+A G K ++   K  GT L  +AF D   
Sbjct: 1142 -SKKLKLVAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN- 1198

Query: 1196 LYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
             YV S+  +      L+ D  K ++F  + E+  ++ L  K    ++    +FL DG  L
Sbjct: 1199 CYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKEL 1258

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQML--ATSSDRT 1310
             +V SD   +I I  + P+  +S +G  LL R  F+ GA H T  L L  +   T+S  +
Sbjct: 1259 YIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPTTTSPNS 1318

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             +  G +  +   LL  +  G +  + PL E  +RRL SL  +L +++PH AGLNP+ +R
Sbjct: 1319 NSEVGENPPH--ILLLASPTGLLATLRPLQENAYRRLSSLAIQLTNALPHPAGLNPKGYR 1376

Query: 1371 QFHSNGKA--HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
                +  A    PG D+     IVD ++L  +  L   ++ EIA + G
Sbjct: 1377 LPSPSASASMQLPGVDAGIGRNIVDGKILERFMELGTGKRQEIAGRAG 1424


>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
            protein 1
          Length = 1456

 Score =  294 bits (752), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 345/1418 (24%), Positives = 594/1418 (41%), Gaps = 189/1418 (13%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
            D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96   DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154  LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
            L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156  LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214  VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
              D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216  DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276  YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334  PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
             ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394  AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
            +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454  VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485  SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
            +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508  ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530  ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
               A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568  LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574  ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
                +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628  DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624  IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+ +
Sbjct: 688  LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742

Query: 683  LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
                P    +        I +S K ++ C LY D          ++  +    VG+    
Sbjct: 743  AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789

Query: 742  ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                   + +I   +   SG L I+ +P+      V + +S        Y+   L     
Sbjct: 790  -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ Y  +  
Sbjct: 831  -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
               + +  P S S       +   ++ N  F++ P +    ++ PH A    P +R +  
Sbjct: 881  ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928

Query: 918  KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
             NISG+   FL GS P + +   +       L    + A +  H   C HGFIY  + GI
Sbjct: 929  -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987

Query: 978  LKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             ++ Q+P+ S+Y      V+KIP+      + Y      Y  +V    ++P      L  
Sbjct: 988  ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            D +   +    N++   +     V+   +++L    +G  W    T+ M+  E  L V  
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092

Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK- 1154
            + L  + +T E + L+A+GTA ++GED+  RGRV +F        P    T   SK+LK 
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKL 1149

Query: 1155 --------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLN 1202
                    GA++AL+ +  QG +L+A G K ++   K  GT L  +AF D    YV S+ 
Sbjct: 1150 VAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVK 1207

Query: 1203 IVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
             +      L+ D  K ++F  + E+  ++ L  K    ++    +FL DG  L +V SD 
Sbjct: 1208 ELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDA 1267

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              +I I  + P+  +S +G  LL R  F+ GAH      L + A   + +  +  S++ +
Sbjct: 1268 DGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENS 1326

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA-- 1378
               LL  +  G +  + PL E  +RRL SL  +L + +PH AGLNP+ +R    +  A  
Sbjct: 1327 PHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASM 1386

Query: 1379 HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
              PG D+     IVD ++L  +  L   ++ E+A + G
Sbjct: 1387 QLPGVDAGIGRNIVDGKILERFLELGTGKRQEMAGRAG 1424


>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Xenopus (Silurana) tropicalis]
          Length = 628

 Score =  292 bits (748), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S + +   E K            LEL+  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG Q+++L       ++   GLVG+        G  +   
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
              K HP+IWS  NLP+D  + LAVP PIGGV++   N++ Y +QS     ++LN+    
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             S    P+    V LD + AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ +T +     FLGSRLG+SLL+++T     S        +    + D P  K+ 
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422

Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           R  SS A       +MV+  +E+ +YGS    + +   T+SF V DS++NIGP    S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541

Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
             +               A  D   H +LI+S +  TM+L+T   + E+  S  +  Q  
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600

Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
           T+ AGN+   + ++QV  RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1407

 Score =  291 bits (746), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 350/1524 (22%), Positives = 618/1524 (40%), Gaps = 296/1524 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
            NLVV   ++++I+ +   E+        E+K+ G + RRV       D  +  S      
Sbjct: 31   NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90

Query: 101  --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
                    L+LV  Y L+G+V S+  +    S+ G D       ++++F+ AKIS+LE+D
Sbjct: 91   ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144

Query: 149  DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
             + H +   S+H +E  E+           R PL           DP+ RC      + L
Sbjct: 145  PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193

Query: 200  QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
              I+  + ++     + D D+F                       G G         S V
Sbjct: 194  VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            ++   LD  ++ + D +F+H Y EP   I+++ +    G +  +        +++    +
Sbjct: 254  LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
                I SA NLP D +K +A+P PIGG L++G + I H        A+A+N+YA    + 
Sbjct: 314  ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373

Query: 353  QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-- 408
                +S   + L++  A  L N+    L+ T  GD  +L+   +GR +  L + +     
Sbjct: 374  NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433

Query: 409  -----SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
                 S     +  +GN  FFLGS   D++L  +      + LS               S
Sbjct: 434  GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480

Query: 464  TKRLRRSSSDALQDMVNGEELSLYG------------SASNNTESAQKTFSFAVRDSLVN 511
              +L R  ++      + +E  LYG            S+ N    +   + F   D L N
Sbjct: 481  EVKLERDDAEDNV-EDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539

Query: 512  IGPLKDFSYG--------------------LRINADASATGISKQSNYELV-------EL 544
            +GP +  ++G                    L         G     N  +        + 
Sbjct: 540  VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599

Query: 545  PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
              C+ +W V+ K  +   +   SS    Y+++      Y  YL  S  + T+V       
Sbjct: 600  KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659

Query: 590  --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
              LE  D+  TEV  ++++         G      R+ QV E   ++ D     Q +   
Sbjct: 660  EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
             +N E   G  +  ++S S ADPY+LL   D SI        T  +  ++ PA I+ +K 
Sbjct: 709  STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
                                 T+  L T   E            G    V+C+   E G 
Sbjct: 766  --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
            L++F +PNF    T++ F                 D   ++ S  E        ++ I  
Sbjct: 798  LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L      A    P+L        I+ Y+ ++  G                     
Sbjct: 841  IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
            +   ++ N   +  P  + + +++P G P  +I    ++ G+   F++G  P +     +
Sbjct: 876  ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
             L    +L  G + + +  +      GF+Y+ S+G  ++C  P  S  ++ W  Q+IPL+
Sbjct: 929  TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQRIPLE 987

Query: 1002 ATPHQITYFAEKNLYPLIV---SVPVLKPLN-QVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
             TP  +TY+  KN+Y + V   S P +   + Q+   L+D+ +  +++  +L  +     
Sbjct: 988  RTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLLPELETGHLVMISPVTW 1047

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1117
             T + YE  + E      P+  +A + ++ SE              TKE + L+A+GT  
Sbjct: 1048 TTTDRYEFPVHEV-----PFVVKA-VELEISE-------------VTKERKVLIAVGTGL 1088

Query: 1118 VQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
            ++GE+  ARG V +F         G+     +  +  +  +E+KG +S LA + G+LLI 
Sbjct: 1089 LRGENSPARGAVYVFDVIDVVPEIGKPETGKKFKL--ISREEVKGVVSTLAGMDGYLLIT 1146

Query: 1171 SGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
             G K ++   K  G+ L  +AF D      V+  + K  ++ GD+ K + F+ + E+  +
Sbjct: 1147 HGQKCMIRGLKEDGSLLP-VAFMDMNTHTTVAKTLEK-MVMFGDVLKGVSFVGFSEEPYK 1204

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            + L  KD   L   A +FL  G+    VV+D Q NI +  Y P+  +S  G +LL + E 
Sbjct: 1205 MILFGKDPRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEI 1264

Query: 1289 HVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            + G  V     L +  +  ++        D+   F  +F T+ G  G ++ + E  +RRL
Sbjct: 1265 YCGHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRL 1324

Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
              +Q ++ ++  H+AGLNPR++R       +  P   +I+D +LL  + ML    + E+A
Sbjct: 1325 NVIQGQITNTGEHIAGLNPRAYRAAKFRNTSSEPM-RAILDGKLLVRWLMLGAGRRKELA 1383

Query: 1408 HQTGTTRSQILSNLNDLALGTSFL 1431
             + GT+   +  +L  L   T+F 
Sbjct: 1384 GRAGTSEEMLREDLWFLQDATAFF 1407


>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
            972h-]
 gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
          Length = 1441

 Score =  291 bits (745), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 336/1443 (23%), Positives = 598/1443 (41%), Gaps = 205/1443 (14%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV   ++ G +  ++ L   G++     D +I+  + AK+S LE+D         S+H
Sbjct: 92   LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148

Query: 161  CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
             +E      +K      +  P  + VDP   C  +L +   M+ +        L  +E  
Sbjct: 149  YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202

Query: 220  F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
               S    S   + S V+    LD  +  + D  F++GY EP + IL+  E T    +  
Sbjct: 203  IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262

Query: 277  KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
            +  T + S +++    +   +I +  +LP+D Y  +++P+P+GG L++G N + Y  S  
Sbjct: 263  RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322

Query: 336  ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLVLLT 390
             +  + +N+Y           +S F++EL+   A  L     +    +L   +G    L 
Sbjct: 323  RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYLD 382

Query: 391  VVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGT 443
             + DG+ V+ L L     + N   L S IT     G +L FLGS+  DS L++++     
Sbjct: 383  FLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS----- 437

Query: 444  SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                    EE    E D      L  ++   + DM++  E      +             
Sbjct: 438  ---RRTTNEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRL 490

Query: 504  AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV------------------ 542
             + D L NIGP+ DF+ G      A +     Q N+   ELV                  
Sbjct: 491  EICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNI 545

Query: 543  --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETA 593
                    +  GC+ +WTV   S +  N  S   A Y + E   YL++S E  + +    
Sbjct: 546  FPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG 604

Query: 594  DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSES 652
            +   EV  S D+    +T+  G+L    R++Q+     R+ D +  +TQ  +F       
Sbjct: 605  ETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF------- 656

Query: 653  GSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAAIES 702
               S+   V+S SI DP +++    G I          RL+  D       V+T A++ S
Sbjct: 657  ---SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-ASLVS 712

Query: 703  SKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEAIDG 741
                     L+                +D   E  L      KTS +  +  G  +++  
Sbjct: 713  PDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQSV-S 769

Query: 742  ADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
            AD  P              LDQ  +          + G L+++++ +F+ +   D F   
Sbjct: 770  ADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVFDLP 829

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
             T      +   ++   T  N  S             S ++VEL +         P LF 
Sbjct: 830  PT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPHLFL 870

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
                  I  Y+A+L+    NT K  + ++ ++   V   + +R        TP DA +  
Sbjct: 871  RSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAESTM 923

Query: 904  ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
            E    +     ++T  + +  H   F++G +P   +       +  P   +  I++    
Sbjct: 924  EKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPF 983

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
            H  +   G+IYV     ++IC+      YDN WP +K+ L    + I Y   K +Y +  
Sbjct: 984  HAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQINGIAYHPTKMVYAVGS 1043

Query: 1021 SVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
            +VP+   +     N+  ++  D +    +   N  S+DL    T                
Sbjct: 1044 AVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT---------------- 1084

Query: 1076 PWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
             W    +   Q  E  L+V +V L  + TTK  +  +A+GT+  +GED+A RG   LF  
Sbjct: 1085 -WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLFEI 1143

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
                  P     ++ +  V  +E+KG ++ +  + G+LL   G K+I+      + L G+
Sbjct: 1144 IDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHLVGV 1203

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            +F D    Y +S   ++N +L GD+ +++ F+ + E+  ++ L +K   +L+  A +FL+
Sbjct: 1204 SFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEALNVSAADFLV 1262

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             G  L  VV+D   N+++  Y P+  ES  G++L++R +FH+G  +T    + +L     
Sbjct: 1263 QGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITA---MTILPKEKK 1319

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               A  G D  + F+ +    DG +  + P+ +  +RRL  +Q  L + V  + GLNP+S
Sbjct: 1320 HQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKS 1379

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R   S      P    I+D  L+ ++  + +  + E+AH+ G   S I+++L +L    
Sbjct: 1380 YRLITSPSNLTNPT-RRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEAL 1438

Query: 1429 SFL 1431
            S++
Sbjct: 1439 SYM 1441


>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
            bisporus H97]
          Length = 1413

 Score =  291 bits (745), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 342/1447 (23%), Positives = 611/1447 (42%), Gaps = 166/1447 (11%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD        I+
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 98   AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
             ++++            +  +RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D   S          S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
                  + AFT          F+  T  G + +  +P    +D   P++ IP       +
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSIPRGRAYSNV 1009

Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
             +    +L   IV+   L+  +   S   D     + D  N+SS       +V+   + +
Sbjct: 1010 LFDPSTSL---IVAASSLQ--STFTSFDEDGNNIWEPDAPNISSP------SVDCSALEL 1058

Query: 1068 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAAR 1126
            + PD     W T       ++E    + +VTL    T+   +  +A+GT   +GED+A +
Sbjct: 1059 IAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGEDLAVK 1114

Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPKIILH 1178
            G   +F        P   V++    +L        KG ++A+  L  +L+ + G KI + 
Sbjct: 1115 GATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVR 1172

Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             +   E L G+AF D   +YV SL  +KN +L+GD  KS+ F++++E   +L LL+KD  
Sbjct: 1173 AFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ 1231

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
            S+     +FL   + L LV  DE+  I+I+ Y P+  +S +G+ LL   EFH G    ++
Sbjct: 1232 SVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR--EY 1288

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
                ++A       + P S       LL G+ DGS+  +  ++E  F+RL  LQ +L+ +
Sbjct: 1289 RTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQLMRN 1342

Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            + H+A LNP++FR    N    +P    I+D  LL  YE LP+  Q E   Q G  R  +
Sbjct: 1343 IQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNV 1401

Query: 1418 LSNLNDL 1424
            L +  +L
Sbjct: 1402 LRDWIEL 1408


>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1413

 Score =  291 bits (744), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 341/1447 (23%), Positives = 610/1447 (42%), Gaps = 166/1447 (11%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD        I+
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 98   AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
             ++++            +  +RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D              S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
                  + AFT          F+  T  G + +  +P    +D   P++ IP       +
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSIPRGRAYSNV 1009

Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
             +    +L   IV+   L+  +   S   D     + D  N+SS       +V+   + +
Sbjct: 1010 LFDPSTSL---IVAASSLQ--STFTSFDEDGNNIWEPDAPNISSP------SVDCSALEL 1058

Query: 1068 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAAR 1126
            + PD     W T       ++E    + +VTL    T+   +  +A+GT   +GED+A +
Sbjct: 1059 IAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGEDLAVK 1114

Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPKIILH 1178
            G   +F        P   V++    +L        KG ++A+  L  +L+ + G KI + 
Sbjct: 1115 GATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVR 1172

Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             +   E L G+AF D   +YV SL  +KN +L+GD  KS+ F++++E   +L LL+KD  
Sbjct: 1173 AFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ 1231

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
            S+     +FL   + L LV  DE+  I+I+ Y P+  +S +G+ LL   EFH G    ++
Sbjct: 1232 SVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR--EY 1288

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
                ++A       + P S       LL G+ DGS+  +  ++E  F+RL  LQ +L+ +
Sbjct: 1289 RTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQLMRN 1342

Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            + H+A LNP++FR    N    +P    I+D  LL  YE LP+  Q E   Q G  R  +
Sbjct: 1343 IQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNV 1401

Query: 1418 LSNLNDL 1424
            L +  +L
Sbjct: 1402 LRDWIEL 1408


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
            SS1]
          Length = 1431

 Score =  290 bits (741), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 346/1404 (24%), Positives = 607/1404 (43%), Gaps = 181/1404 (12%)

Query: 103  LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
            LV  +RLHG V  L A+ +    ++  + D ++++F+DAKI++LE+ D+IH +   S+H 
Sbjct: 123  LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180

Query: 162  FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
            +E +P+ + L        RG L +VDP  RC  + +    + IL    SQ    L+  E 
Sbjct: 181  YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236

Query: 219  TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
            +      +S     S V++L  D+D  +++V DF F+ G+  P + +L + + TW GR+ 
Sbjct: 237  SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292

Query: 276  WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
                T  +   ++      +PLI +   LP+D   L    + IGGV ++ +N I +  Q 
Sbjct: 293  EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352

Query: 335  SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
            S    L +N +      L      P+    +++L+ A  T++ +    +  K G +  + 
Sbjct: 353  SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412

Query: 391  VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
            +V DG+ V RL ++      T P+V    +  + +   F+GS +G S+L++      T+ 
Sbjct: 413  LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462

Query: 446  LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
            +   +KEE  D++A      AP+         D      NGE+ S  G+ +   +S +K 
Sbjct: 463  VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522

Query: 500  -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
                 ++ D+L   G + D ++GL  N D       +ATG             +  +S  
Sbjct: 523  TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582

Query: 540  ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
            +L  + G +G+W++   ++ +       R ++ +D     +IIS +A     +   A   
Sbjct: 583  KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
                 ++   + G T+ A   F    ++ +        R+L  DG+   + +DL      
Sbjct: 639  AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693

Query: 650  SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                  +    + S SI DP+VL+   D +I L +G+     +  +  + +        +
Sbjct: 694  ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749

Query: 710  CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
               + D           T   L T V E     +     QG +       + ++    G 
Sbjct: 750  GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            +E++ +P     F+     +    + D+Y   AL        S  ++   + ++ +I  +
Sbjct: 799  VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
             +  L   R      RP L  +L  G +  Y+A     P     S    + S +L V  V
Sbjct: 851  VIAPLGESR-----PRPHLIVLLRSGQLAVYEAVAIPPPPEPLPS----TRSSTLLVKFV 901

Query: 883  S-ASRLRNLRFSRTPLDAYTREE----------TPHGAPCQRITIFKNISGHQGFFLSGS 931
              AS+  +++       +   E+              AP Q  +         G F +G 
Sbjct: 902  KVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFS---------GVFFTGD 952

Query: 932  RPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            RP W +   +  ++V P      + AFT          F+  + +G   +  +P     D
Sbjct: 953  RPSWILSTDKGGVKVFPS-GHSVVQAFTTSSLWESRGDFLLYSEEGPSLVEWMPD-VQLD 1010

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
             + P + +P ++ P+    F       LIV+    +  N+  S   D  V  + D+ N+S
Sbjct: 1011 GHLPARSVP-RSRPYSNVVFDAST--SLIVAASSFQ--NRFASYDEDGNVVWEPDSPNIS 1065

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
            S  L    T+E     ++ PD     W T        +E    +  + L   +T+   + 
Sbjct: 1066 S-PLCECSTLE-----LISPDG----WITMDGYEFAPNEFVNCIVSIPLETMSTESGMKD 1115

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL 1163
             +A+GT   +GED+A +G V +F       +P   V   +  +L      KG +S L  +
Sbjct: 1116 FIAVGTTINRGEDLAVKGAVYIFEIVEVVPDPSTHVKRWWRLKLLCRDDAKGPVSFLCGI 1175

Query: 1164 QGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
             G+L+ + G KI +  +   E L G+AF D   +YV SL  VKN +++GD  KS++F+++
Sbjct: 1176 NGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GVYVTSLRAVKNLLVIGDAVKSVWFVAF 1234

Query: 1223 KEQGAQLNLLAKDFGSLDCF--ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            +E   +L +L KD   L C   A  F  DG  LS+V  DE+  ++++ Y P   ES  GQ
Sbjct: 1235 QEDPYKLVVLGKD-PQLCCITRADLFFADGQ-LSIVTCDEEGIVRLYAYDPHDPESKSGQ 1292

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
             LL R EFH  +      R  ML     +     G  +  +  L+ G++DGS+  +  +D
Sbjct: 1293 HLLRRTEFHGQSE----YRSSMLVARRPKN----GDPEIPQARLVCGSVDGSLSTLTYVD 1344

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E   +RL  LQ +L+ +V HVA LNP++FR    N    RP    I+D  LL+ +E LP+
Sbjct: 1345 EAASKRLHLLQGQLIRTVQHVAALNPKAFRMVR-NEYVSRPLSKGILDGNLLATFEDLPI 1403

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
              Q E+  Q GT R+ +L +   L
Sbjct: 1404 ARQNEVTRQIGTDRATVLKDWASL 1427


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score =  290 bits (741), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 337/1401 (24%), Positives = 590/1401 (42%), Gaps = 208/1401 (14%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
            ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88   EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191  CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
            C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148  CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230  IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207  YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267  LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347  VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327  RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405  KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
               P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387  PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
               E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441  RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513  GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
            GPLKD ++G    +          S  ELV                            +L
Sbjct: 501  GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545  PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
              C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561  ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602  SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
             +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614  ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
              S  E    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669  DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725  LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768  KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809  WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
             +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858  -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKIPLKAT 1003
            G   + +      C  GFI + S+ +      L  C L   +   +Y+P  +Q+IP+   
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973

Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1063
               + Y +   +Y +  S            L  D E+  +  N   S         V+  
Sbjct: 974  VDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRS 1022

Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
             ++++ P      W    T+    +E+ + V+ ++L  +  T E + ++ +GTA+ +GED
Sbjct: 1023 SLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGED 1074

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKI 1175
            +A+RG V +F   +   +P+    +     V  + +KGA++AL+ +  QG L++A G K 
Sbjct: 1075 IASRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKC 1134

Query: 1176 ILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQ 1228
            I+   K  G+ L  +AF D        +++VK        ++ D  K ++F  + E+  +
Sbjct: 1135 IVRGLKEDGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYK 1189

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            ++L AKD   L+  A +FL DG+ L ++V+D   N+ +  Y P+  +S  G +LLSR++F
Sbjct: 1190 MSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKF 1249

Query: 1289 HVGAHVTKFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            H G  ++    L   + SS++      A     K  R  +L  + +GS+G +  + E ++
Sbjct: 1250 HTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESY 1309

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D +LL  +  +  + ++
Sbjct: 1310 RRLSALQSQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKV 1365

Query: 1405 EIAHQTGTTRSQILSNLNDLA 1425
            EIA + G    +I ++   ++
Sbjct: 1366 EIASRVGANEWEIKADFEAIS 1386


>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1406

 Score =  288 bits (738), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 337/1469 (22%), Positives = 606/1469 (41%), Gaps = 226/1469 (15%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
            N+VV  +N++ I+ VR     +Q +   E +   + +R        V MD          
Sbjct: 51   NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110

Query: 95   --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
                      +      V  + LHG    +E + I+S        R D ++++F+DAKI+
Sbjct: 111  KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D++H L   S+H +E +P+ + +     S  R  L + DP  RC  + +    + 
Sbjct: 167  LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V D+D              S +++L    D ++++V DF F+ G+  P
Sbjct: 223  ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + +L + + TW GR+     T  +   ++    + +P+I S   LPH+   LL   + +
Sbjct: 280  TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339

Query: 319  GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
            GGV+++ +N I Y  QS+    L +N +   +    ++P  S + E       L+ + A 
Sbjct: 340  GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396

Query: 371  WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
            ++ +    +  K G +  L +V DG+ V +L +S   P +  + I ++   L    F +G
Sbjct: 397  FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
            S +G S+L++          ++ ++EE     D+EA AP+T        +   D  N   
Sbjct: 454  SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502

Query: 484  LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
            ++          +       ++RDSL   GP+ D ++ L  N D       +ATG     
Sbjct: 503  VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555

Query: 538  NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
             + L +  LP           G +G+W++            Y K+     A++  +    
Sbjct: 556  GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615

Query: 573  DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            D  +    +S   +  V+ TA             + G TI A   F R  V+ V     R
Sbjct: 616  D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662

Query: 633  ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            +L+ G  + +D+                 + + SI+DP+VL+   D SI L +G+     
Sbjct: 663  VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
            +  +  + +        SC      G  E     ++T   +++ +  A++    G     
Sbjct: 714  IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
              + ++    G +E++ +P     F+ D   S +  +VD++   A               
Sbjct: 766  --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
              Q          V ++ +        RP L   L  G +  Y+                
Sbjct: 811  LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH------- 923
              T+ +L     +  +++ ++ S    +    EE   G   ++  I++            
Sbjct: 860  -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918

Query: 924  ---QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
                G F +G RP W     +  ++++P      + AFT          F+  T +    
Sbjct: 919  VTFSGVFFTGDRPNWIFGTDKGGVQIYPS-GHAVVNAFTPCSLFESKGDFLMYTEEA--S 975

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            + +      YD   P++ +P       + +    +L   +V+   L+   +  S   D  
Sbjct: 976  VSKWLPDFHYDGPLPLRSVPRGRAYSSLVFDPSTSL---LVAASSLQ--AKFASYDDDDN 1030

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
               + +  N+ +  +  T T+E     ++ PD     W T       ++E    V  VTL
Sbjct: 1031 KIWEPETPNIGN-PMCDTSTLE-----LISPDM----WITMDGFEFATNEYINDVACVTL 1080

Query: 1100 FNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL----- 1153
                T+  ++  +A+GT   +GED+AARG   ++       +P       Y   L     
Sbjct: 1081 ETAGTEVGSKDFIAVGTTIDRGEDLAARGATYIYEIVEVVPDPAISPKRWYKLRLRCRDD 1140

Query: 1154 -KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
             KG ++A+    G+L+ + G KI +  +   E L G+AF D   +YV SL  +KN +L+G
Sbjct: 1141 AKGPVTAVCGFHGYLVSSMGQKIFVRAFDSDERLVGVAFMDVG-VYVTSLRTLKNLLLVG 1199

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D  KS+ F++++E   +L LL KD   +     +F      LSLV  DE+  ++++ Y P
Sbjct: 1200 DAVKSLSFIAFQEDPYKLVLLGKDTQHVCVTNADFFFTDGELSLVTGDEEGIMRMYEYNP 1259

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
            +  +S  G+ LL R EFH  +        +   T + R    P   +     L+ G  DG
Sbjct: 1260 QDPDSKDGRYLLLRTEFHGQSE------YRTSTTIARRLKDDPSIPQAK---LIIGGTDG 1310

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
             +  + P++E  F+RLQ LQ +L  ++ HVAGLNP++FR    N    +P    I+D  L
Sbjct: 1311 CLSSLTPVEEHAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDFVSKPLSKGILDGNL 1369

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            L+HYE LP+  Q E+  Q GT R  +L +
Sbjct: 1370 LAHYESLPIIRQNEMTRQIGTDRVTLLRD 1398


>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
            B]
          Length = 1440

 Score =  288 bits (736), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 358/1487 (24%), Positives = 625/1487 (42%), Gaps = 226/1487 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
            N+VV  ++++ I+ VR +E     S+   E +RR             V MDG     L  
Sbjct: 49   NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107

Query: 103  ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
                                 L+  +RLHG    +E + I++        R D ++++F+
Sbjct: 108  GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163

Query: 139  DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            DAKI++LE+ D++H L   S+H +E +P+ + L    +S    P ++ DP  RC  +L+ 
Sbjct: 164  DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219

Query: 198  GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
               + IL   Q  + L V ++DT             S +++L    D  +++V DF+F+ 
Sbjct: 220  RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276

Query: 254  GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
            G+  P + +L +++ TW GR+     T  +   ++    + +P+I S   LP+D + + A
Sbjct: 277  GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336

Query: 314  VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
              + +GGV+++ +N I Y  QS+   AL +N +        V   S+QE  R    + L+
Sbjct: 337  CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393

Query: 366  AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
             +H  ++ +    +  K G +  + +V DG+ V +L +S      T P+V    +  +  
Sbjct: 394  GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
               F+GS +G S+L++      T+ +   + +E  D+E     T  +     D+   M  
Sbjct: 450  DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPTAVV-----DSTDTMDL 496

Query: 481  GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
             +E  LYGS    T             S ++T    ++ DSL   GP+ D ++ L  N D
Sbjct: 497  DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556

Query: 528  ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
                   +ATG      + L +  LP           G +G+W++  + +   N  +   
Sbjct: 557  RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616

Query: 569  AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
             A  + +H+    +IIS +A     +   A        ++   + G TI A   F    +
Sbjct: 617  PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674

Query: 624  IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            + V      ++    +  D S      +         + + SI DP+VL+   D +I L 
Sbjct: 675  LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732

Query: 684  VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
            +G+P    +  +  + + + + + ++ C      G  +  L   +     +T   ++   
Sbjct: 733  IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790

Query: 742  ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
             + G   Q   + ++C   G LEI+ +      F+     S  + +VDTY    L     
Sbjct: 791  INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
               S  ++   + ++ +I  + V  L          RP+L   L  G +  Y+      P
Sbjct: 843  ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                   DP+  SRS ++          +RF +    A+  ++         +   K IS
Sbjct: 895  A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939

Query: 922  ------------GH--QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIV-AFTVLHNVNC 965
                        G    G F +G RPCW +   +  ++V P     S+V AFT       
Sbjct: 940  RLLIPFVTSPNPGQTLSGVFFTGDRPCWILSTDKGGVKVFPS--GHSVVHAFTASSVWES 997

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
               F+  + +G   +  +P G   D + P + +P       + Y    +L      V   
Sbjct: 998  KSDFLLYSEEGPSLLEWIP-GVQLDGHLPSRTVPRNKAYSNVVYDPSTSLI-----VAAS 1051

Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1085
               ++  S   D  +  + D  N+ S+    T T+E     +L PD     W T      
Sbjct: 1052 SSQSRFASYDEDGNIVWEPDASNI-SLPFCETSTLE-----LLSPDG----WVTLDGYEF 1101

Query: 1086 QSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
              +E    +  VTL  ++T+   +  + +GT   +GED+A +G   +F       +P   
Sbjct: 1102 APNEFVNCLDCVTLETSSTESGTKDYIVVGTTINRGEDLAVKGAAYVFEIIEVVPDPTAQ 1161

Query: 1145 VTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
            +   +  +L      KG ++A+  + G+L+ + G KI +  +   E L G+AF D   +Y
Sbjct: 1162 MKRWHRLKLHCRDDAKGPVTAMCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GVY 1220

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            V SL  VKN +++ D  KS++F++++E   +L +L KD   L     +F      +S++ 
Sbjct: 1221 VTSLCAVKNLLVISDAVKSVWFVAFQEDPYKLVILGKDPYPLYVTKADFFFAEGRVSIIS 1280

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH--VGAHVTKFL--RLQMLATSSDRTGAA 1313
             DE   ++I  Y P   ES  GQ LL R EFH  V    +  L  RL+ +     R    
Sbjct: 1281 CDEDGVMRILEYDPHDPESKNGQHLLRRTEFHGQVEYRTSAILARRLKGVDIPQSR---- 1336

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
                      L+ G  DGS+  +  ++E   +RL  LQ +L  +V HVAGLNPR FR   
Sbjct: 1337 ----------LICGLTDGSLITMTYVEEAASKRLHLLQGQLTRNVQHVAGLNPRGFRIVR 1386

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
             N    RP    I+D  LL  YE LP+  Q E+  Q GT R+ IL +
Sbjct: 1387 -NDYVSRPLTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKD 1432


>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1418

 Score =  287 bits (735), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 343/1434 (23%), Positives = 584/1434 (40%), Gaps = 216/1434 (15%)

Query: 99   ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
            + L LV  Y L G V SLA +     D     D++++AF DAK+S++E+D + H L   S
Sbjct: 81   SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138

Query: 159  MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
            +H +E  E        +       +  DP  RC  +      + IL   Q G  LV GD+
Sbjct: 139  IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198

Query: 218  ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
                           +   +G       +SS  ++L  +D  + H     F+H Y EP  
Sbjct: 199  YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258

Query: 261  VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
             I+   +   A  +  +      +  ++    K    + S   LP+D +K++ +P P+GG
Sbjct: 259  GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318

Query: 321  VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
             L++GAN  IH      + A+A+N++A    S     +S  ++ L+ A    L  +N   
Sbjct: 319  ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378

Query: 378  LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
            L+  K G L +++   DGR V  L + K +            S  T++G +  F+GS  G
Sbjct: 379  LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
            DS+L+ +T     + LS   K    ++ AD            D      +G   +   SA
Sbjct: 439  DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
            S+   S    ++F + D ++++ P+KD +       D +   + + ++  +LV       
Sbjct: 495  SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553

Query: 543  -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
                               E    + +W+V+ K  + +G  A  S+ A    A D +Y  
Sbjct: 554  AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613

Query: 578  YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +LI+S       E   +   TA    E  +         TI  G + G  R++QV +   
Sbjct: 614  FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673

Query: 632  RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
            R  D    + Q L   P   E+GS      ++S S ADPY+L+   D S+ +L  D +  
Sbjct: 674  RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727

Query: 691  TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
               +     + S+K                         WLS  + ++    +       
Sbjct: 728  MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
               + +    G L IF++P+ +    V   +         ++   L    T   SSS   
Sbjct: 758  --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                 K  +  + V EL      + +  P+L    +   ++ YQ Y F   E        
Sbjct: 805  -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVKP---- 851

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                 +L    +   RL    FS  P  A   E+T  G      TI  N+ G+   F++G
Sbjct: 852  --FFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
            + P + +     L    ++   S+   +  H   C+ GF Y+ + G L++CQLP G  Y 
Sbjct: 905  TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
            D  W V+KI +      + Y   K++  L++ V   KP       L + E  H+    N+
Sbjct: 965  DAGWAVKKISINQDVQAMCYHPPKDV--LVLGVGDKKPFT-----LPEDEHHHEWLEENI 1017

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTL-FNTTTK 1105
            +   +     VE+  +++L+        Q+ A I    +++ E  LT++V+ L  +  T 
Sbjct: 1018 TFKPM-----VEQGMIKVLDT-------QSLAVIDTYELEAFEVVLTIKVLNLEVSENTH 1065

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
            E + L+A+GT +++GED+ +RG + +F        P    T      +  +E++G+++A+
Sbjct: 1066 ERKQLVAVGTGFIRGEDLPSRGCIYVFEVINVVPEPGRPETNRRLKLIAKEEVRGSVTAI 1125

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHK 1215
              +  QG LL+A G K ++   K  GT L  +AF D      V+  +     +L+GD  K
Sbjct: 1126 TDVGSQGFLLMAQGQKCMVRGLKEDGTLLP-VAFMDMQCYVTVAKELNGSGMLLMGDAAK 1184

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
              +F+ + E   ++ L  K    ++  A +FL     L L+V+D   N+    Y P   +
Sbjct: 1185 GAWFVGYTEDPYKMILFGKSRSKMEVMAADFLPHDKQLYLMVADGDCNLHALQYDPDHPK 1244

Query: 1276 SWKGQKLLSR---------------------------AEFHVGAHVTKFLRLQMLATSSD 1308
            S  GQ+LL +                           A+ H   HV+     +  A   D
Sbjct: 1245 SLSGQRLLHKSTFHTGHFTTTMTLLPSSLSPTVSPSSADEHANGHVSPSPSPENDAMDID 1304

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                AP     +   +L  T  GS+  + PL E  +RRL +LQ  L+ ++ H  GLNPR+
Sbjct: 1305 ---PAPAGTVQH---ILLTTQTGSLALLTPLSEQQYRRLGALQTYLIGALEHWCGLNPRA 1358

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            +R   S G   R     IVD  LL+ +  L  + + E A + G     + S+L 
Sbjct: 1359 YRAVESEGFGSR----GIVDGALLARWCELGSQRRAEGAAKVGVEEWVVRSDLE 1408


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1673

 Score =  286 bits (732), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 219/774 (28%), Positives = 362/774 (46%), Gaps = 125/774 (16%)

Query: 753  YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            +  V  ++G L ++ +P+F   C F +  F +     +D  +  +     T  N++ +E 
Sbjct: 930  WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                       + VV L     S     P+L  +  +G +  Y+  +F  P  TS +DD 
Sbjct: 987  --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031

Query: 871  VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
             S         SR+ +   + A  L+    +R  R P D  TR+               +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091

Query: 905  TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
             P   P     Q +  F  I  SG   + G  ++GSRPCW MV  +              
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151

Query: 943  -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
                             LR HP   DG +  F  LHNVN  HGF+Y+  +G+ +ICQLP 
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211

Query: 986  GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS------VPVLKPLNQVLSLLIDQE 1039
               +D+ WPV K+P+  T H++ Y      Y +  S      +P  +  + V + +ID+ 
Sbjct: 1212 QFNFDHDWPVCKVPIHKTVHKVAYHYSSQTYAIATSTPERFDIPHAQYASAVAAAVIDE- 1270

Query: 1040 VGHQIDNHNLSSVDLHR---------TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
             G ++ +       +             TV+ Y++ ++    +   W+T  +I +  +E 
Sbjct: 1271 -GDEMPDAERKVTGIRELSEIKPGMYEATVDRYKIELV----SSVTWETVDSIELSEAET 1325

Query: 1091 ALTVRVVTLFNTTTKENETL-LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
             + +  V L +  T   + L LAIGT Y +GED+++RG++ L+       +P N  T   
Sbjct: 1326 VMALEAVDLSSKETISGKKLYLAIGTGYSRGEDLSSRGKLHLYDVIEVVPDPNNPQTNRK 1385

Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
               V S++ +   SA+ ++  +LL A GPKII+++    E+ G+AF D   ++V SL+ V
Sbjct: 1386 FKHVDSEDDRSPFSAICTVNDYLLAAIGPKIIMYQLEDGEITGVAFLDVN-VFVTSLSSV 1444

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KN I + DI KS++F++++E+ A+L +L +D   L  +A   LID + L+L+V+D  KN+
Sbjct: 1445 KNLIQICDIQKSVWFVAFQEEPAKLAVLGRDVHPLQGYAANMLIDDNQLALLVADGDKNL 1504

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
                YAP   +S  G++L+ + E H+G HV+KF+R++       R  A   S +    A 
Sbjct: 1505 HTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMRRKPLL--RNDAIVFSKQYLNVA- 1561

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS-------NGK 1377
               TLDG++  I P+ E  F+RL  L  ++V S+ H+AGLNPR FRQ          +G 
Sbjct: 1562 --ATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNPRGFRQAQHRVRPITLSGF 1619

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               PGP  I+D +LL  Y  L   +Q  +A   G+   +++ +L ++  G  F 
Sbjct: 1620 IGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMDDLLEVLTGLDFF 1673



 Score =  197 bits (502), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           AA LEL   +R+HGN+ SL ++     + S + D+++L+F++AK+S++E+      L   
Sbjct: 87  AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143

Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
           SMH FE  E+  L     S  R P  +KVDPQG C  + +YG ++ IL   Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            +   S   F   I    V+   DLD  ++++ DF F+ GY  P + I+++ E TW  R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
             +  T  I+ +S+ T  + +P+++    LP++   L++VP+PIGG++V+  N I +   
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315

Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
            H+   +C           + L  Y + LD  Q  P                   ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
            +   ++  D+ LL  + G+++ + ++ +  V          V+   L++          
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435

Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
                  +NP  L+   +++     FLGSR G  L   +   S  +  +  L  +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492

Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
             A S+  +  +++  + +  NGE    +  LYG ++   ++                  
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550

Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
            +++T  F + DS+  + PL+DF+ GL                     +ATG        
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610

Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
            +++    ++V   ELP  + +WT+        + D   ++   D +H Y+I+S  + T 
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
           +L+  +  TE+ ++  ++  G T+  G L     ++QV   G  + D  +   D +    
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724

Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
                         G    E   G ++  V+S S  DPY +L M+ G I LL  D +T  
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784

Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
           ++  +    E  K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
            musculus]
          Length = 1311

 Score =  285 bits (729), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 703  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 759  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 809  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 869  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 929  PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 981

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 982  DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1032

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1033 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1052

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1053 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1110

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1111 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1170

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P+ E T
Sbjct: 1171 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1223

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1224 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1283

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1284 SELAKKIGTTPDIILDDL 1301



 Score =  283 bits (724), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G       P V+VDP 
Sbjct: 11  KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
           GRC  +L+YG ++++L   +     + +E     G G  +    S++I++R LD K  ++
Sbjct: 68  GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124

Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
            D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS  +LP 
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
           D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +     +    + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
            A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT+ + T+     F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
           LGSRLG+SLL+++T        SS    E  D E      KR+  +          QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362

Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
           +  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G                L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418

Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
            I        + + + + K    ++V   ELPGC  +WTV            K+      
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
            S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537

Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +QV   G R+L+G      L F P +         + ++  ++ADPYV++  ++G + + 
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587

Query: 684 V 684
           +
Sbjct: 588 L 588


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score =  285 bits (729), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 339/1431 (23%), Positives = 604/1431 (42%), Gaps = 196/1431 (13%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896  -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
              ++++C+ P  + +D  W  +KI L      + Y +    Y +  +  V        +L
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059

Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
            + + L  +  T E +  + +GTA  +GED+AARG + +F   +     D P+    +  +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKLI 1119

Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
              +E+KGA+++L+ +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177

Query: 1205 KN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
            K     ++GD  K ++F  + E+  +L+L +KD G+L   A +FL DG+ L ++V+D+  
Sbjct: 1178 KGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDC 1237

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            NI +  Y P+   S KG +LL R+ F  G   +    L   ATSS + G     D  +  
Sbjct: 1238 NIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLD 1296

Query: 1323 A------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
            +      +L  +  GSI  I P+ E ++RRL +LQ +L +++ H  GLNPR+FR   S+G
Sbjct: 1297 SSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDG 1356

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
               R     +VD +L+  +  L  + + EIA++ G    +I ++L  +  G
Sbjct: 1357 IGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pongo abelii]
          Length = 1388

 Score =  285 bits (728), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 207/702 (29%), Positives = 332/702 (47%), Gaps = 87/702 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
                L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 830  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   
Sbjct: 880  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVL 1032
            Q   ++   PS +      P   + L    H         +Y +  S   P  +      
Sbjct: 940  QEPQRLSGSPSRTXXXXPTPPGLLGLPG--HWCVTPTNPQVYAVATSTNTPCAR------ 991

Query: 1033 SLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSEN 1090
               I +  G + +   +   + +     E + ++++ P      W+    A I +Q  E+
Sbjct: 992  ---IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEH 1044

Query: 1091 ALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
               ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +T+  
Sbjct: 1045 VTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNK 1104

Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
               +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D   LY+  +  V
Sbjct: 1105 FKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISV 1163

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KNFIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+
Sbjct: 1164 KNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNL 1223

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT----- 1319
             ++ Y P+  ES+ G +LL RA+FHVGAHV  F R       +   GAA G  K      
Sbjct: 1224 MVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGLSKKSVVWE 1276

Query: 1320 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
            N+      ++ G IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  
Sbjct: 1277 NKHITWLVSVRGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTL 1336

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +    +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 1337 QNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1378



 Score =  173 bits (438), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            T K HP+IWS  +LP D  + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275



 Score =  115 bits (288), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)

Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +S K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344

Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
           T        +++  +  KEE    +    +T     +     QD V+  E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
            T+ A  T+SF V DS++NIGP  + + G                L I        + + 
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460

Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
           + + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H 
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G 
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616


>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 671

 Score =  285 bits (728), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
            SVLTS + T+     FLGSRLG+SLL+++T        +++  +  KEE        P 
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421

Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
           +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP 
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477

Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
                                     + + GL  +   +    S + + E+V        
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537

Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
                             ELPGC  +WTV         ++ +G   +   S+  A  D  
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597

Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656

Query: 635 D 635
           +
Sbjct: 657 E 657


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score =  284 bits (727), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 339/1431 (23%), Positives = 604/1431 (42%), Gaps = 196/1431 (13%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  ++ Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P   +
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKP---L 894

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 895  RVLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
              ++++C+ P  + +D  W  +KI L      + Y +    Y +  +  V        +L
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059

Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
            + + L  +  T E +  + +GTA  +GED+AARG + +F         D P+    +  +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLI 1119

Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
              +E+KGA+++L+ +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177

Query: 1205 KN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
            K     ++GD  K ++F  + E+  +L+L +KD G+L   A +FL DG+ L ++V+D+  
Sbjct: 1178 KGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDC 1237

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            NI +  Y P+   S KG +LL R+ F  G   +    L   ATSS + G     D  +  
Sbjct: 1238 NIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLD 1296

Query: 1323 A------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
            +      +L  +  GSI  I P+ E ++RRL +LQ +L +++ H  GLNPR+FR   S+G
Sbjct: 1297 SSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDG 1356

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
               R     +VD +L+  +  L  + + EIA++ G    +I ++L  +  G
Sbjct: 1357 IGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score =  284 bits (727), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 204/693 (29%), Positives = 334/693 (48%), Gaps = 70/693 (10%)

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            +G L I+ +P+F   F   +F      IVD        DS              G K  I
Sbjct: 811  TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             +  + E+ M         P L        I+ Y+A+ +    +  +     S  +   V
Sbjct: 849  PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908

Query: 880  SNVSASRLRNLRFSRTPLDAYTREET--------------PHGAPCQR--ITIFKNISGH 923
            S  S+S     +  R  +D +   +T                    QR  +  F +++G+
Sbjct: 909  SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968

Query: 924  QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
             G F++G++P W M   +  +RVHP   +  IV FT  HNVNC HGFI V S+  +++ +
Sbjct: 969  AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028

Query: 983  LPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
            L + G  YD  W +QK+ L  T H+I Y     +Y ++VS  V   +    +  ID   G
Sbjct: 1029 LRTEGINYDLDWVIQKVLLGQTVHKIQYHPVMRVYAVLVSSSVPTRMKNDDNQYID---G 1085

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
             + D              +E++ + ++ P      W+    +  +  E   ++    L  
Sbjct: 1086 KETDERGPGEF----LPEMEQFSMILVSP----VTWEIVDKVEFEEFEQCFSLECALLDS 1137

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQ--NLVTEVYSKELKG 1155
              T+   +  + IGT  ++GED   +G + ++         DNPQ  +    V ++++KG
Sbjct: 1138 KQTSTGRKYYMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKG 1197

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            A++A+ ++ GHL    G K+I+      E L G+AF D   +YV S++ +KNFIL+GD  
Sbjct: 1198 AVTAMCTVSGHLAACIGSKVIVWSLEDDERLVGVAFIDVQ-IYVTSMSSIKNFILIGDAQ 1256

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            KSI+FL ++ + A+L LL KD+ S D    +F+ID  +L L+V D  +NI ++ YAP   
Sbjct: 1257 KSIWFLGFQLEPAKLTLLGKDYQSFDVGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNL 1316

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            +S+ GQKL+ R +FHVG+ V   +RL  +  +        G + + R   L GT +GSI 
Sbjct: 1317 QSFGGQKLMRRGDFHVGSQVQTMVRLPQIEKTEK------GFEYSRRHFCLCGTFNGSIA 1370

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCEL 1391
             I+ + E TF+RL +L   LV+++ HVAGLNPR+FR     G   R   +   +++D +L
Sbjct: 1371 VISSISEKTFKRLNTLYGHLVNNLQHVAGLNPRAFRLI--KGPKQRMSTNRTKAVLDGDL 1428

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  +  L +EEQ E   Q GTT ++I+ +L D+
Sbjct: 1429 IFEFAGLSIEEQKETTKQIGTTVTRIMEDLVDI 1461



 Score =  179 bits (453), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)

Query: 88  KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
           K+  ++   +   LELV  ++++G + ++  +           DS++L F DAK+S+LE+
Sbjct: 87  KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146

Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
             S + +   S+H +E  E+      ++ F   P    + +DPQ RC     Y  ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
              Q  S  + +    G           S +I+L  LD  +K+V D  F+  Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
           L + E TW GR+     T  +  +S+  T K +P+I+S   LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318

Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
           V+ AN+I + SQ S    +A+N Y      + + P   +        + L+ A A     
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
           D  L+  + G   L+ V  DG  V  + +S+                 P +L S  + + 
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435

Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           N      FFLGSR+GDSLL+++          +       D   +      +     D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
           +   +  +L L  S+ +    A   F         +I P   F++      D+ A     
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540

Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                         IW++   K+ +  N         DD++   L IS    T+VL   D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
            L EV     ++ +G TIA   LF   R++QV+  G  +L  +G  +           ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
                 + ++  SI DPY+LL + +  I  L GD ST   + +Q P  I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  284 bits (727), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 176/529 (33%), Positives = 273/529 (51%), Gaps = 42/529 (7%)

Query: 917  FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
            F NI+G+ G F+ G  P W  M  R  L +HP   DG + +F    NVNC  GF+Y   +
Sbjct: 894  FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953

Query: 976  GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
              L+I  LP+  +YD+YWPV+K+PLKATPH + Y  E  ++ +I S P      Q ++++
Sbjct: 954  SELRISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMESKVHVIIASTP------QPVTVI 1007

Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
             D   G   D       D    Y+ EE Y +++L P      W+   TIP    E     
Sbjct: 1008 PDPN-GETEDALETVERDGRFVYSQEETYYLQLLSPTS----WE---TIPHSKYEMEAHY 1059

Query: 1095 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QN 1143
             V  +     +  ETL      + +GT    GE+++A+G+VL+F        P     Q 
Sbjct: 1060 HVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIFDVSVVIPEPGKPFSQY 1119

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLN 1202
             +  +Y +E K  ++ L  + G +L A G KI + ++    +L  +AF DA   Y+ +  
Sbjct: 1120 RLKNLYDQEQKWPVTGLECVNGLILTAMGQKIFMWQFKDNKDLLAVAFIDA-ETYIHTAQ 1178

Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
             +K FIL GD+ +SI  L + E    L+L+++D   ++ F+T F+IDG  L  +VSD  +
Sbjct: 1179 SIKGFILTGDVTRSIQLLHYNEDRRSLSLISQDPNPMEVFSTTFMIDGKALGFLVSDSDR 1238

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            NI +F Y P+   S  G  L+   + HVG+ V  FL ++       +T A  G+ +  + 
Sbjct: 1239 NITLFQYQPENPASSGGANLVRCGDIHVGSLVNVFLNIRC------KTSAGLGASREMKI 1292

Query: 1323 AL-------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
            AL        FGTLDG IGC+ P+ E  +RRL  LQ K+   + H+AGLNP++FR F + 
Sbjct: 1293 ALADKRQCTFFGTLDGGIGCLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTR 1352

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +       +I+D  LL  Y  L  +E+ + + Q GTT +QI+ +L ++
Sbjct: 1353 HQYLHNAQRNILDGTLLYQYLSLTAKEKFDFSKQIGTTVAQIMEDLKEI 1401



 Score =  224 bits (571), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C S    HS  + V                            V +
Sbjct: 2   YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            +++ I+ V      ++  +N G+ K            L     +  HGN++SL  +   
Sbjct: 35  TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            +D    RD ++L+F DAK+S++E++   +GL+  SMH FE  E   ++ G       P+
Sbjct: 77  HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP+GRC  +L++G  + +    Q    L  D     S    +  I  ++ I+LRDL 
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             +  +KD  F+ GY  P ++ L E   TWAGR+S +  + M+  LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
             NLP D+  L  VP P+GGVLV GANT+ Y +QS+    L+LN   +Y        E  
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
             S  + LD + + ++ N+  L+S ++GD+ ++T+  D   R V+R+   K   S+L+S 
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           I +I     FLGSRL +SLL+++     ++ +   + E  G                  A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401

Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
           + D+   +++ +YG ++ +  ++       +S  V DSL+ IGP+   + G    A  S 
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456

Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
             + K   + ELV                          ELPGC  +WTV          
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516

Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   D SR         H YLI+S    TMVL+T   +TE+ +S  +  Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
             T+ AGN+     ++Q      R+L G     Y+  D+  G              V  V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619

Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPS--------------------TCTVSVQ- 695
            +  PYV++ + +G I LL        +  PS                      T S+Q 
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679

Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
                      P A    K+P  S +L +D+     GP     K    + +   +     
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737

Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +  P      + ++C E GALEI+ VP F  VF V  F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
            owczarzaki ATCC 30864]
          Length = 1725

 Score =  283 bits (725), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 177/519 (34%), Positives = 284/519 (54%), Gaps = 38/519 (7%)

Query: 920  ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            + GHQ   G F+ G RP W ++   R+ LR H  L DGS+ AF+  +N  C  GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
            QG L+ CQL   + +DN WPV+++PL+AT H I Y      Y L+ S P  KP   +  L
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHP--KPYFNLPRL 1335

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              D+            ++      T + + ++++ P      W++  +  + + E   +V
Sbjct: 1336 TNDETYTPVPYTPKPRAI----PATFDTFSLQLISPVT----WESIHSFDLPAFERVTSV 1387

Query: 1095 RVVTLFNTTTKENETLL----AIGTAYVQGEDVAARGRVLLFS---TGRNADNPQN--LV 1145
             +  +   T++E  T L     IGT  ++GEDV   GR+++F         + PQ    +
Sbjct: 1388 DIAAI---TSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKL 1444

Query: 1146 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1204
              +  +E KGAI+AL+ + GHL+   G KII+ ++   + ++G+AF D    +VVS++ +
Sbjct: 1445 KYLMEREQKGAITALSHVCGHLVSCIGQKIIIWQFASDDTMDGVAFIDTQ-TFVVSVSAI 1503

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KNFIL+GD++ S++ L + E    L  +A+DF  +   +T+FL+DGS+L  + +D  +N+
Sbjct: 1504 KNFILVGDLNNSVFLLRFNETTKHLGFIARDFDHMSVASTQFLVDGSSLGFLATDSHQNL 1563

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRF 1322
             +F Y P   ES  GQ+LL + +FHVG+HV + LR+  + L  S DR GA+       R 
Sbjct: 1564 VVFAYNPLNRESNNGQRLLRQLDFHVGSHVQQVLRMVPRSLPVSVDR-GAS-----VKRH 1617

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
              L  TL+GS+  +AP+ E TFRRL+ LQ++LV  +   AGLNP  +R +    K     
Sbjct: 1618 IDLLATLEGSLNALAPIGETTFRRLEWLQRQLV-GLQQRAGLNPIGYRAYRFPRKMTTTR 1676

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +++D ELLS +  L L EQ E+A Q   T   ++ ++
Sbjct: 1677 AGNVIDGELLSRFLYLGLAEQRELARQRRNTPEDLIDDI 1715



 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)

Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
           R+  S+ I L +L   + HV D  F+ GY EP + +L E    +W GR   +  TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
           LSI+T+   HP++WS   LP ++ +++AVP P+ G ++V  + I + SQS+ +  ++LN 
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413

Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
             ++S +    +P       SS    +      +L  +  L  T+ G++ + T++ +GR 
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
           V R+ +     SVL   +T++ N  + F+GSR  DS+L++     T  +    L+S   +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533

Query: 453 EFGDIEADAPSTKRLRRSSSD 473
           +F        + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547



 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
           FA ++  H PT + +C     T++  R   V +  L++   +D+   S  G G    L  
Sbjct: 2   FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59

Query: 59  --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
              +TA     +++ R    G  +   S +    +    + +A          LELV  +
Sbjct: 60  GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L GNVES+ +          +RDS++LAF++AK++V+++D +   L+  S+H +E    
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172

Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
           + ++ GR++ A      P+++VDP  +C   LVYG ++IIL   Q     + ++D + S 
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230

Query: 224 GGFSARIESSHVI 236
              +A +  S  I
Sbjct: 231 QAPAASVPPSAQI 243



 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 27/198 (13%)

Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADL 595
           S++E  EL G +G+W+V+   S   +   + +++ D   H+ L+ S +  T+V  T  + 
Sbjct: 735 SSFE--ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEE 789

Query: 596 LTEVTESVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
           L ++ ES  +F  G TIA GN+F   G+  ++ VF  G R++DG  + Q+L     +S  
Sbjct: 790 LEQIAES-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV- 847

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSC 710
                 S ++  SIA+  VL   +DG++  +   GD      S  T AA     +PV + 
Sbjct: 848 ------SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAV 897

Query: 711 TLYHDKG----PEPWLRK 724
           +LY D+     PE  L++
Sbjct: 898 SLYADRSGLFVPEAVLQR 915


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Anolis carolinensis]
          Length = 1294

 Score =  283 bits (725), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E + +S  S E K            LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + DE     G G  +    S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R+LD K  ++ D  F++GY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
           HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL+S   
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305

Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
            +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLT+ + T+     FLGSRLG+SLL+++T       +++  K+     E      KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424

Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           + + ++      A QD V+  E+ +YGS + +  +   T+SF V DS++NIGP  + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541

Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
                  D+   +A           D + H +LI+S E  TMV        +    +D  
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601

Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
                 T  AGN+   R ++QV   G R+L+G      L F P +         S ++  
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671



 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 139/559 (24%), Positives = 243/559 (43%), Gaps = 89/559 (15%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  E+G +EI+ +P +  VF V  F  G+  +VD+   +    +E    +  EE   
Sbjct: 786  WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            Q     +  + +V L  ++     SRP+L  +  D  +L Y+A+      + S+      
Sbjct: 842  QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
              R   V +    R +  R S+   ++   EE   P G    R   F++I G+ G F+ G
Sbjct: 891  KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
              P W +                                  VTS+G L++  +      +
Sbjct: 950  PSPHWLL----------------------------------VTSRGALRLHPMTIDGPIE 975

Query: 991  NYWPVQKIPLKATPHQITYFAEKN----LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
            ++ P   +     P    YF  +     ++     +P +   + +    I++ V   +  
Sbjct: 976  SFAPFHNV---NCPKGFLYFNRQGTGGGIHNACSRIPRMTGEDDMEFETIERGVLKCVPG 1032

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1106
                  DL  ++ ++                       ++  E+   ++ V+L +  T  
Sbjct: 1033 EGFGHPDLILSFKID-----------------------LEEWEHVTCMKTVSLKSEETVS 1069

Query: 1107 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
              +  +A+GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL
Sbjct: 1070 GLKGYIAVGTCLMQGEEVTCRGRILIMDIIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1129

Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
                G+L+ A G KI L      +L G+AF D   LY+  +  VKNFIL  D+ KSI  L
Sbjct: 1130 CHCNGYLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1188

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
             ++E+   L+L+++D   L+ +  +F++D   L  +VSD  +N+ ++ Y P+  ES+ G 
Sbjct: 1189 RYQEESKTLSLVSRDAKPLEVYCVDFMVDSCQLGFLVSDRDRNLLVYMYLPEAKESFGGM 1248

Query: 1281 KLLSRAEFHVGAHVTKFLR 1299
            +LL RA+FHVGAHV  F R
Sbjct: 1249 RLLRRADFHVGAHVNAFWR 1267


>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
 gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
          Length = 1460

 Score =  282 bits (722), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 371/1518 (24%), Positives = 614/1518 (40%), Gaps = 271/1518 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
            NLVV  +++++++  +V     + S +SG   R             DG+ A+        
Sbjct: 28   NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
                      L LV    L G V  LA +    A +    DS+++A +DA++S++E+D  
Sbjct: 88   LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145

Query: 151  IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
             H L   S+H +E  E       + S    PL      ++ DP  RC  +      + IL
Sbjct: 146  RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199

Query: 205  KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
               Q    + +GD D     G   A+ +SS VI+                 L +LD  + 
Sbjct: 200  PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258

Query: 245  HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
            H     F+H Y EP   IL              H T M+  L +    K    I S   L
Sbjct: 259  HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316

Query: 305  PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
            P D ++++ +P+P+GG L+VG+N  IH      +  +A+N       S   + +S  ++ 
Sbjct: 317  PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376

Query: 364  LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
            L+      L  D+   L+    G + L+T   DGR V  L+L    + +  S++   +TT
Sbjct: 377  LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436

Query: 418  ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
               +G +  F G   GDS+L                   FG  +  + + +R  R+    
Sbjct: 437  LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477

Query: 475  LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
            LQ            +  + ++L     A+    S+  +       +F + D LV+I P++
Sbjct: 478  LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537

Query: 517  DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
              +YG  +    S                 A G  K ++   +            E P  
Sbjct: 538  AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597

Query: 548  KGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLII------SLEARTMVLETAD 594
            +G WT+       KS +G  A +S    YD   +Y  ++I+        E   +   TA 
Sbjct: 598  RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
                +  +      G TI AG +    R+IQV +   R  DG + ++Q L   P   E  
Sbjct: 658  GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713

Query: 654  SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
            +G+E   V S S+ADP++L+   D S+ +   D S     +     + S+ K ++ C LY
Sbjct: 714  TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771

Query: 714  HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
             D          S   +    +G+    A         +   +   SGAL I+ +P+   
Sbjct: 772  AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
               V + +S        Y+   L       + +  +GT    KE +  + V +L      
Sbjct: 815  PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
            + H  P+L     +  +  YQ +        S+     + S +L    V      N   +
Sbjct: 855  STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902

Query: 894  RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
            ++P +A   +E  H      +    N+ G+   F+ G+ P + +   + + RV P L   
Sbjct: 903  KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960

Query: 953  SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFA 1011
             ++A +  H   C HGFIY  S+ I ++CQ P G  Y +    V+KIP+      + Y  
Sbjct: 961  GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAVAY-- 1018

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
                +P + S  V    ++   L  D +   +    NLS        TV+   +++L P 
Sbjct: 1019 ----HPPMQSYVVGCNTSEPFELPKDDDYHKEWARENLSF-----KPTVDRGILKLLSPI 1069

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
                 W     + M+  E  L V  + L  +  T E + L+A+GTA  +GED+  RGRV 
Sbjct: 1070 T----WTVVDAVQMEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVY 1125

Query: 1131 LFSTGRNADNPQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1180
            ++        P    T    ++ +KE   +GA++AL+ +  QG +L+A G K ++   K 
Sbjct: 1126 VYDIADVIPEPGRPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKE 1185

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
             G+ L  +AF D    YV +   +      LL D  K ++F  + E+  ++ L  K    
Sbjct: 1186 DGSLLP-VAFMDM-NCYVTAAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTK 1243

Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKF 1297
            L+    +FL DG  LS VVSD    I I  + P+  +S +G  LL R  F+ GA H TK 
Sbjct: 1244 LEVLNADFLPDGKELSFVVSDADGYIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHATKS 1303

Query: 1298 LRL-------------------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            L L                        S ++    P + + +   LL  +  G +  + P
Sbjct: 1304 LLLPASTPADKEKNDGNAANAQAKAKASDNKQPREPAAQRPH--VLLLASPTGVLAALRP 1361

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLS 1393
            L E  +RRL SL  +L +S+PH AGLNPR +R   +  +    G D     SIVD  +L 
Sbjct: 1362 LSESAYRRLSSLAAQLTNSLPHPAGLNPRGYRA--AGAECPPAGVDAGLGRSIVDGTVLE 1419

Query: 1394 HYEMLPLEEQLEIAHQTG 1411
             +  L +  ++E+A + G
Sbjct: 1420 RFAELGMARRVELAGRAG 1437


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score =  282 bits (722), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 359/1466 (24%), Positives = 608/1466 (41%), Gaps = 223/1466 (15%)

Query: 57   NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
            NLVV   ++++IY +        V E G + + N    KR           L+L   Y L
Sbjct: 28   NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78

Query: 110  HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            HG V  +   S+    NSR   D+++L+F +AK+S++E++  I  +   S+H +E  +  
Sbjct: 79   HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135

Query: 169  HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
                  +       + VDP  RC  VL +G++ + IL   Q G  LV DE       D F
Sbjct: 136  LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194

Query: 221  GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
                  ++  +S                S V+ L  LD  + H     F+H Y EP   I
Sbjct: 195  TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254

Query: 263  LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
            L+    T A  +  +    + S  ++    +    + S   LP D   ++A+P+P+GG L
Sbjct: 255  LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314

Query: 323  VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
            ++G+N  IH      + A+A+N +A  + S   + +S   + L+ +    +  +    LL
Sbjct: 315  LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374

Query: 380  STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
            +  TG+LVL+    DGR V    +    P+    D+         ++G+   F+GS   D
Sbjct: 375  TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433

Query: 432  SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
            SLL+     S  S  S    E+  D + +      +      S A ++ VN        +
Sbjct: 434  SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486

Query: 490  ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
             SN   SA + +SF V D L ++  L+  + G   + D+ A  +S QS +EL EL    G
Sbjct: 487  VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542

Query: 550  ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
                     +    H     +  G  ADS      S  +  +D          E+ + V 
Sbjct: 543  SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596

Query: 591  ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
             T    T+  E++ Y V G                 T+  G L G  RV+QV     R+ 
Sbjct: 597  LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656

Query: 635  DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            D +  M Q     P   E   G E   V S S ADPY+L+   D S+ LL  D S     
Sbjct: 657  DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
            +  P  I SS+  +  C LY DK                  V E  D A       G+ Y
Sbjct: 712  LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745

Query: 754  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
              +  +   L +F +P    V   +                   D  + I SS +     
Sbjct: 746  MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788

Query: 814  GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
              +E I  + V +L         + P+L        ++ Y+               PV  
Sbjct: 789  NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829

Query: 874  SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            +     + V+   ++  N    + P++A   +        +R+    +I G+    +SG+
Sbjct: 830  NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P   +   + L     +   SI   +   +  C  G IYV ++ +++ C+L   +  D 
Sbjct: 887  SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WP++KIPL     ++ Y A    Y  +    V+   ++    L D +  H       ++
Sbjct: 947  SWPIRKIPLN---EEVDYLA----YSTVSGTYVVGTTHEQDFKLPDNDELHP----EWAN 995

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL 1110
             D+     V +  +++L P      W+   +    ++E    +  + L  +  T E + +
Sbjct: 996  EDISLRPKVAQGSIKLLNPKT----WKVIDSYTFNAAERITAIENINLEISEKTSERKDM 1051

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL-- 1163
            + +GT + +GED+AARG V +F       +P     NL  ++  +E ++GA++A++ +  
Sbjct: 1052 IVVGTTFAKGEDIAARGNVYVFDVINVVPDPDEPGTNLKLKLIGEESVRGALTAVSGIGG 1111

Query: 1164 QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYF 1219
            QG L++A G K ++   K  G+ L  +AF D    YV  +  +K     L+GD  K ++F
Sbjct: 1112 QGFLIVAQGQKCMVRGLKDDGSLL-PVAFIDVQ-CYVSVIKELKGTGMCLIGDALKGLWF 1169

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
              + E+  ++ L  KD   L+    +FL DG  L ++V+D   N+ +  Y P+  +S  G
Sbjct: 1170 TGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNG 1229

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF----ALLFGTLDGSIGC 1335
             +LL+R +FH+G   +    L   A SS+       S   + +      L  T  G +  
Sbjct: 1230 DRLLNRCKFHMGHFASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMAL 1289

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            +  L E ++RRL +LQ +L +++ H  GLNPR++R   S+G   R     ++D +LL  +
Sbjct: 1290 VTSLSEESYRRLSALQSQLSNTLEHPCGLNPRAYRAVESDGVVGR----GMIDGKLLMRW 1345

Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNL 1421
              L    +LEIA + G    +I ++L
Sbjct: 1346 LDLSRPRKLEIAGRVGADEWEIRADL 1371


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cavia porcellus]
          Length = 1387

 Score =  282 bits (721), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 201/678 (29%), Positives = 316/678 (46%), Gaps = 88/678 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 779  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 835  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 885  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 945  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1005 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1057

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T     
Sbjct: 1058 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1108

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
                    V G                                LKG ++A   L QG  +
Sbjct: 1109 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1128

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G +I L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   
Sbjct: 1129 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1186

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+F
Sbjct: 1187 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1246

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
            HVGAHV  F R       +   GA  G  K      N+    F TLDG IG + P+ E T
Sbjct: 1247 HVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1299

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+
Sbjct: 1300 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1359

Query: 1404 LEIAHQTGTTRSQILSNL 1421
             E+A + GTT   IL +L
Sbjct: 1360 GELAKKIGTTPDIILDDL 1377



 Score =  271 bits (693), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 209/676 (30%), Positives = 332/676 (49%), Gaps = 88/676 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
                         L I        + + + + K    ++V   ELPGC  +WTV     
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539

Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           G T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648

Query: 669 PYVLLGMSDGSIRLLV 684
           PYV++  ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664


>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1439

 Score =  282 bits (721), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 338/1407 (24%), Positives = 600/1407 (42%), Gaps = 176/1407 (12%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            + A L LV  +RLHG V  L  +    + N    D ++++FEDAKI+VLE+ +  H L  
Sbjct: 118  TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176

Query: 157  TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
             S+H +E +P+ + L     S   G L +VDP  RC  + +      I+   Q       
Sbjct: 177  VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228

Query: 216  DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                        A    S +++L    D  + +V D  F+ G+  P + +L +   TW G
Sbjct: 229  -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277

Query: 273  RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
            R++    T  +   ++    + +P+I S  NLP+D   + A  + +GGV+V+ +N+I + 
Sbjct: 278  RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337

Query: 333  SQSAS-CALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            SQS+   AL++N +A  +      P     ++ ++ L+ +   ++ +    L  K G + 
Sbjct: 338  SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397

Query: 388  LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
             + +  DG VV  + +      S + + +T +     F+GS  G S+L++ T        
Sbjct: 398  PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449

Query: 447  SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
               ++EE  D  +DA +   +  + S  + D        +  +  SL    +N T  + K
Sbjct: 450  --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507

Query: 500  TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
              S    ++ DSL   G + D S+ L  N +       +ATG      + L +  LP   
Sbjct: 508  KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567

Query: 546  --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                    G +G+W            T Y ++     AD+  +    D   A  +     
Sbjct: 568  KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627

Query: 586  RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
            RT   E          S+   V G+TI A   F R  ++ V     R+L  DGS   + +
Sbjct: 628  RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
            DL                 +   SI DP+VL+   D +I L +G+     +  +  + + 
Sbjct: 679  DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729

Query: 701  ESSKKPVSSCTLYHDKGPEPWLRKTSTDAW---LSTGVGEAIDGADG-----------GP 746
            + + + ++ C    + G      + + ++     +T   + +  AD            G 
Sbjct: 730  DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789

Query: 747  LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
            L+  D+  +  C      +I+ +P  + VF+V    +    + D+    AL         
Sbjct: 790  LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
             S  G    R +    + V +  +        +P L   L  G +  YQA     P   S
Sbjct: 841  -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892

Query: 866  KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
              D+ +S  S  +  + V+       R   +       ++         +T     +   
Sbjct: 893  SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952

Query: 925  GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQ 982
            G F +G  PCW +   R  +R+HP     S+V AFT          F+  + +G   +  
Sbjct: 953  GVFFTGDHPCWILKPDRSGIRIHPS--GHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEW 1010

Query: 983  LPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH 1042
            +P  +  +   P + IP   +  ++T+ A   L   IV+   L+   +  +   D  +  
Sbjct: 1011 MPD-TDVETELPSRSIPQPRSYSKVTFDASTGL---IVAAAHLE--AEFATYDEDNNIVW 1064

Query: 1043 QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1102
            + D+ N+S        T+E     ++ PD     W T       ++E   +V  V L  +
Sbjct: 1065 EPDSANVS-FPRSSCSTLE-----LISPDE----WITMDGFEFANNEFVTSVESVPLETS 1114

Query: 1103 TTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-------ELK 1154
            +T+  ++  +A+GT   +GED+A RG   +F        P+N     + K       + K
Sbjct: 1115 STESGSKDFIAVGTTIDRGEDLAVRGTTYVFEIVEVVP-PENSSLSRWWKLRLRCRDDAK 1173

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G ++AL ++ G+L+ + G KI +  +   E L G+AF D   +YV +L  VKN +++GD 
Sbjct: 1174 GPVTALCAMDGYLVSSMGQKIFVRAFDMDERLVGVAFLDVG-VYVTTLRAVKNLLVIGDA 1232

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             KS++F+ ++E   +L +LAKDF ++     +F+    ++S++ +DE   ++++ Y P+ 
Sbjct: 1233 AKSVWFVGFQEDPYKLVILAKDFQTVCVTTADFIFTEDSMSILTNDENGVMRLYQYDPQD 1292

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
             +S  GQ+L+ R EF    H T     Q     + R G    +    +  ++ G++DGS+
Sbjct: 1293 PDSRNGQQLMCRTEFD--THTT----CQTSIVFARRVGEGEEA-ALPQAKVVAGSIDGSL 1345

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
              +  +DE  F+RLQ LQ +L  ++ HVAGLNP++FR    N    +P    I+D  LLS
Sbjct: 1346 AALTCMDEPAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDYVSKPLSKGILDGNLLS 1404

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSN 1420
             Y  LP+  Q EI  Q  T R+ +L +
Sbjct: 1405 SYLELPIPRQEEITKQIATERAAVLRD 1431


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score =  282 bits (721), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 333/1478 (22%), Positives = 628/1478 (42%), Gaps = 215/1478 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD     +   S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    T
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664

Query: 641  -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
                 ++ + G   S +          ++    +S S+AD  + +   D ++  L  D S
Sbjct: 665  PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
                 V     + S K    SC LY DK                TG+  +ID     P+ 
Sbjct: 725  GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
            + D++  +      L +       C+      + G   ++      +   S+  I++   
Sbjct: 766  KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816

Query: 809  EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
                +G +  + S+ ++E  +    + WSA    P+L        I+C+     EG    
Sbjct: 817  ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
                          + ++  S+  N    R P    + + +      + + I  +ISG  
Sbjct: 855  --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
              F+ G+   + +          +L   +  + + L    C+ GFIY+ SQ  ++ C+LP
Sbjct: 901  AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960

Query: 985  SGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQ 1043
              + +D  W ++++ L      + Y     +Y       VL   +     L  D E+  +
Sbjct: 961  PMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPE 1013

Query: 1044 IDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNT 1102
              N +  ++    +       ++++ P+     W    +  + + E  + ++ ++L  + 
Sbjct: 1014 WRNEDCLAISFFPS--ARGSFIKLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSE 1067

Query: 1103 TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAI 1157
             T E + ++ +GTA+ +GED+ +RG + +F   +   +P +  T+     +  + +KGA+
Sbjct: 1068 NTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAV 1127

Query: 1158 SALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLG 1211
            +AL+ +  QG +L+A G K ++   K  G+ L  +AF D    YV  +  +K     +LG
Sbjct: 1128 TALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILG 1185

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D  K ++F  + E+  +++L AKD   L+  A EFL DG  L +VV+D   NI +  Y P
Sbjct: 1186 DAVKGVWFAGYSEEPYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDP 1245

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFG 1327
            +  +S  G +LLSR++FH+G   +    L     SS++   ++ G D  N+  L   L  
Sbjct: 1246 EDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMT 1305

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T +GS+G I  + E ++RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++
Sbjct: 1306 TQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GML 1361

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
            D  LL  +  +  + + EIA + G    +I ++L  ++
Sbjct: 1362 DGNLLFKWIDMSKQRKTEIAGRVGAREWEIKADLEAIS 1399


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1433

 Score =  281 bits (718), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 358/1491 (24%), Positives = 628/1491 (42%), Gaps = 234/1491 (15%)

Query: 57   NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
            N+VV  ++++ I+ VR        Q+E  KE      K +   +  V MD          
Sbjct: 42   NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101

Query: 95   ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
               G++ A+         LV  +RLHG V  L A+ +    D+  + D ++++F+DAKI+
Sbjct: 102  TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+  S+H +   S+H +E +P+ + +        R  L + DP  RC  + +    + 
Sbjct: 160  LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215

Query: 203  ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
            IL    SQ    ++  E +      +S     S +++L  D+D  +++V DF F+ G+  
Sbjct: 216  ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P + +L + + TW GR+     T  +   ++      +P+I +   LP+D + L    + 
Sbjct: 272  PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331

Query: 318  IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
            IGGV+++ +N + +  QS     L +N +   +        ++QE  R    ++L+ A  
Sbjct: 332  IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388

Query: 370  TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
             ++ +    L  K G +  + ++ DGR V +L +S      T P+V    +  IG+   F
Sbjct: 389  VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
            +GS +G S+L++          ++ ++EE  D +   A+ P+T                L
Sbjct: 445  IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
               S+ A Q   NG        A +N  + +     ++ D++   GP+ D ++GL  N D
Sbjct: 495  YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549

Query: 528  ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
                   +ATG     ++ L +  +P           G +G+W            T + +
Sbjct: 550  RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609

Query: 557  SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
             S   +AD+  +    D   +  +  +  R+    +     +  VT     F QG  I  
Sbjct: 610  PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669

Query: 615  GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
              +F    VI+V E      DG+          S  +    +    + S SI DP++L+ 
Sbjct: 670  -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716

Query: 675  MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
              D +I L +G+     +  +  + +        +   Y D      L +T  +A     
Sbjct: 717  REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770

Query: 735  VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
             GEA      G ++ G+   +  +    G +EI+ +P     F+     +    I D+  
Sbjct: 771  PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830

Query: 793  REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
              AL                Q        + V +L +      H RP L  +L  G +  
Sbjct: 831  PPALS-------------LPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877

Query: 853  YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
            Y+A     P       DP+  +RSL++       L NL +      D    EE       
Sbjct: 878  YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924

Query: 912  QRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
            ++  I + +            + G F +G RP W +   +  +RV P      + AFT  
Sbjct: 925  EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPS-GHNVVHAFTTC 983

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
                    F+  + +G   +  +P     D + P + +P ++ P+    F   +   LIV
Sbjct: 984  SLWESRGDFLLYSEEGPSLVEWMPD-IILDAHLPARSVP-RSRPYSHVVFDASS--SLIV 1039

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
            +      +N+  S   D  +  + D+ N+S      T T+E     ++ PD     W T 
Sbjct: 1040 AASSF--MNRFASYDEDGNIVWEPDSPNISFPHCE-TSTLE-----LISPDG----WITM 1087

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
                  ++E    V  V L   +T+   +  +A+GT   +GED+A +G V +F       
Sbjct: 1088 DGYEFAANEFVSCVVSVPLETVSTESGMKDFIAVGTTINRGEDLAVKGAVYIFEIVEVVP 1147

Query: 1140 NPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYD 1192
            +    +   +  +L      KG +S L  + G+L+ + G KI +  +   E L G+AF D
Sbjct: 1148 DASLNIKRWWRLKLLCRDDAKGPVSFLCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLD 1207

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE---FLID 1249
               +YV SL  VKN +++GD  KS++F++++E   +L +L KD     C  T    F  D
Sbjct: 1208 V-GVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKD--PHHCCVTRADLFFAD 1264

Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
            G  LS+V  DE+  ++++ Y P   ES  GQ LL R EFH         R  +L     +
Sbjct: 1265 GH-LSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTE----YRSSLLVARRPK 1319

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
                 G  +  +  L+ G++DGS+  +  +DE  F+RL  LQ +L+ +V HVA LNP++F
Sbjct: 1320 A----GDPEIPQARLICGSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAF 1375

Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            R    N    RP    ++D  LL+ +E LP+  Q E+  Q GT R+ +L +
Sbjct: 1376 RMVR-NEYVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score =  280 bits (717), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++                          T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           +   + A+++  + EE            S +K                       R++A 
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427

Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
              +G  ++S    V    ELPGC  +WTV          +    G   + S   A DD 
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487

Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
             H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546

Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           L+G      L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587



 Score =  165 bits (418), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 94/250 (37%), Positives = 137/250 (54%), Gaps = 26/250 (10%)

Query: 1191 YDAP------PL--------YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
            YDAP      PL        Y V   +  NFIL  D+ KSI  L ++E+   L+L+++D 
Sbjct: 925  YDAPWPVXKIPLRCTAHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDA 984

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
              L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+FHVGAHV  
Sbjct: 985  KPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNT 1044

Query: 1297 FLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            F R       +   GA  G  K      N+    F TLDG IG + P+ E T+RRL  LQ
Sbjct: 1045 FWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1097

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+ E+A + G
Sbjct: 1098 NALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1157

Query: 1412 TTRSQILSNL 1421
            TT   IL +L
Sbjct: 1158 TTPDIILDDL 1167



 Score =  126 bits (317), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 86/289 (29%), Positives = 136/289 (47%), Gaps = 20/289 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 702  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 758  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 808  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 868  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYP-LIVSVPVLKPLNQVLSLLIDQE 1039
             WPV KIPL+ T H + Y  E  + P  I++  V+K     +SLL  QE
Sbjct: 928  PWPVXKIPLRCTAHYVAYHVESKVCPNFILAADVMKS----ISLLRYQE 972


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
            [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
            nidulans FGSC A4]
          Length = 1339

 Score =  276 bits (706), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 352/1456 (24%), Positives = 603/1456 (41%), Gaps = 234/1456 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++I+ +R        S ++ +T+ R          L L   Y+L G V  +
Sbjct: 28   NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + ++ G D      ++++AF DAK+S++E+D   +GL   S+H +E  +      
Sbjct: 82   CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
              +    G ++  DP  RC         + I+   Q G  LV D+  FGS   +  R+E 
Sbjct: 136  ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193

Query: 232  --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
                                SS V+ L  LD  + H     F++ Y EP   IL+ +  T
Sbjct: 194  NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253

Query: 270  WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                +  +      + +++    +    + S   LP D +K++A+P P+GG L++G+N  
Sbjct: 254  SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313

Query: 329  IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
            +H      + A+ +N ++    S     +S  ++ L+        +D    LL+  TG  
Sbjct: 314  VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373

Query: 387  VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
             L++   DGR V  +    LS  +   L S  ++   +GN   F GS   DS+L+     
Sbjct: 374  ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428

Query: 441  SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             G S  SS  K+ F G    D         S  DA +D +     +       N  S   
Sbjct: 429  -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479

Query: 500  TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
              +F    + D L + GP++D   G    A +  T   K    ELV   G    G   + 
Sbjct: 480  VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536

Query: 555  HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
             +        S     A+S    S +   +D+   Y+I+S        E+   VLE  D 
Sbjct: 537  KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            L  +T          T+  G L  + RVIQV     R  D  +   D             
Sbjct: 595  LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            S+    ++ ++ DPY+ +   D ++ LL  D S                  +   TL  D
Sbjct: 642  SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685

Query: 716  KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
               + WL     S +A   T    +I                +  +   L ++ +P+F  
Sbjct: 686  VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
            V +V + V     I+ T   E  K S T              +EN+  + VVEL      
Sbjct: 731  VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
            ++ S PFL     +  ++ Y+ +     E T          R L  +N +  +  N    
Sbjct: 770  SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819

Query: 894  RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
                D    E  P       + I  NI+G    F+ G  P    +FR      P   +L 
Sbjct: 820  ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
             G I       +   + GF Y+ S G L + +LP G+     W ++ +P+     ++TY 
Sbjct: 866  GGFIKGLGCFDS--PDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYV 922

Query: 1011 AEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
            +  + Y       VL    +    L  D E+  +  N  +S +       V +  ++++ 
Sbjct: 923  SASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVS 970

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
            P      W    + P++ +E+ + ++ ++L  +  T E   ++ +GT+  +GED+ +RG 
Sbjct: 971  PKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGC 1026

Query: 1129 VLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--K 1179
            + +F       +P+    N   ++  KE +KGA++AL+ +  QG L+ A G K ++   K
Sbjct: 1027 IYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLK 1086

Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
              G+ L  +AF D    +V  +  +K     + GD  K ++F  + E+  +++L AKD  
Sbjct: 1087 EDGSLLP-VAFMDMQ-CFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLD 1144

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
             L+  A +FL DG+ L +VV+D   N+ +  Y P+   S  G KLL+R++FH G   +  
Sbjct: 1145 YLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTV 1204

Query: 1298 LRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
              L     SS+R  A  GSDK +         +L  + +GSIG +  + E ++RRL +LQ
Sbjct: 1205 TLLPRTLVSSER--AMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQ 1262

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             +L +++ H  GLNPR++R   S+  A R     ++D  LL  Y  +  + + EIA + G
Sbjct: 1263 SQLTNTLEHPCGLNPRAYRAVESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRVG 1318

Query: 1412 TTRSQILSNLNDLALG 1427
             T  +I ++L  ++ G
Sbjct: 1319 ATEWEIRADLEAISGG 1334


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
            [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score =  276 bits (705), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 335/1464 (22%), Positives = 618/1464 (42%), Gaps = 221/1464 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +   + E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76

Query: 117  AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
              +      NSR   +++++AF +AK+S++E+D     +   S+H +E  +        +
Sbjct: 77   CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
                G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS            
Sbjct: 134  LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190

Query: 223  GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
            GGG         +      S V+ L  LD  + H     F++ Y EP   IL+ +  T +
Sbjct: 191  GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250

Query: 272  GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
              +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+N  +H
Sbjct: 251  ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310

Query: 331  YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
                  + A+ +N ++  + S     +S  ++ L+      L +     LL   TG++ +
Sbjct: 311  IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370

Query: 389  LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            +    DGR V  +          L+  N +   S  T IG+   FLGS  GDS+L+ ++C
Sbjct: 371  MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428

Query: 440  GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
             S +S       ++  D  AD     +   S  D  +D +     + +L G   +   SA
Sbjct: 429  SSSSSKKHRLQAKQAIDDSADMSEEDQ---SEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485

Query: 498  QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
               + F + D L+NIGPL+D + G ++  +    G    S    +++   +G        
Sbjct: 486  FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545

Query: 553  VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
            V  +    H   S  + + D  + A L    EA             R  V+ T    ++ 
Sbjct: 546  VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605

Query: 600  TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
             ES+ + V G                 TI  G    R+RV+QV +   R  D    +TQ 
Sbjct: 606  EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
                  ++     ++    +S S+AD  + +   D ++  L  D S     V     + S
Sbjct: 666  YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720

Query: 703  SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
             K    SC LY DK                TG+  +ID     P+ + D++  +      
Sbjct: 721  GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            L ++ V +   + ++ + + G + ++                 SSE     G +EN+   
Sbjct: 762  LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
             V +L  + WSA    P+L     +  ++ Y+ ++             + T  +  +  +
Sbjct: 804  IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
              S+  N        D  + + +      + + I  +ISG    F+ G+           
Sbjct: 847  KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG----ILKICQLPSGSTYDNYWPVQKI 998
                          F +  + +  H F+ +  +      ++ CQLP  + +D  W ++K+
Sbjct: 897  ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRT 1057
             L      + Y     +Y       VL   +     L  D E+  +  N  +S     R 
Sbjct: 944  HLGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPDDDELHPEWRNEAISFFPSARG 996

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
              +     +++ P+     W    +  + + E  + ++ ++L  +  T E + L+ +GTA
Sbjct: 997  SFI-----KLVSPNT----WSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGTA 1047

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
            + +GED+ +RG + +F   +   +P +  T+     +  + +KGA++AL+ +  QG +L+
Sbjct: 1048 FARGEDIPSRGCIYVFEVVQVVPDPDDPETDRKLKLIGKESVKGAVTALSEIGGQGFVLV 1107

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
            A G K ++   K  G+ L  +AF D    YV  +  +K     +LGD  K I+F  + E+
Sbjct: 1108 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGIWFAGYSEE 1165

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              +++L AKD   L+  A EFL DG  L +VV+D   NI +  Y P+  +S  G KLLSR
Sbjct: 1166 PYKMSLFAKDLDYLEVSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSR 1225

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
            ++FH G   +    L     SS++    +   D  N+ AL   L  T +GS+G I  + E
Sbjct: 1226 SKFHTGNFASTLTLLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPE 1285

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             ++RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D  LL  +  +  +
Sbjct: 1286 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1341

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
             + EIA + G    +I ++L  ++
Sbjct: 1342 RKTEIAGRVGAREWEIKADLEAIS 1365


>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
 gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
          Length = 2830

 Score =  273 bits (699), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 252/983 (25%), Positives = 419/983 (42%), Gaps = 181/983 (18%)

Query: 575  YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +HAYL+I++   RTMVL   D L +VT S   ++ V   T+AAGNLF    ++Q    G 
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
            R+L+G  + Q+L      +     ++ S                    ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008

Query: 679  SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
            +  LL GDP + T+ V T AA         S ++ +++  L+ D+            +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057

Query: 732  STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
            ++        +         I+  +C  SG LE + +P+   VF      +       G 
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113

Query: 785  THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
              + D Y         +E E++     G G G  E+     VVEL ++ +    S     
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165

Query: 838  --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
              RP L  +   G ++ YQ  L   P ++   + P +   +   S               
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225

Query: 885  -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
             +R  +L ++     +++R +       ++  +    + + G F++GSRP W +  R  L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFI-YVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
              HP   +G++ A T  HN NC  GFI   +S+G+LK+CQLP  +  D  W  +++PL+ 
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPWVTRRVPLRV 2339

Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1062
            TPH++ +F +  L   I S  V+           D            ++    R    E 
Sbjct: 2340 TPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHAAAAYAAAAAAAAGRGRE---EA 2396

Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA------ 1116
            +E+R+LEP+  G  W +    P    E AL ++V+ L N TT + + LLA+GT       
Sbjct: 2397 WELRLLEPNGCGRLWLSPLLPP---GEQALCLKVIYLQNATTGDTDALLAVGTGSPMGQL 2453

Query: 1117 --------------------------------------YVQGEDVAARGRVLLFST---- 1134
                                                     GED    GR+LL++     
Sbjct: 2454 GGGNWRFRLPRGRVAGSGGLVVHRQCEREGAGRGCRGERPPGEDYPCLGRILLYTISAEV 2513

Query: 1135 ----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW---------- 1180
                G N     + V  V ++++  A++++   +  LL+  G +I +++W          
Sbjct: 2514 VDLGGGNLTRRWSAVL-VATRDMASAVTSVQEFKSQLLVTCGSRIEMYEWRGPAAGASGG 2572

Query: 1181 ----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
                 G  L   AF+D P L V SL  VK+++L  D  + +YF+ + +    L  ++KDF
Sbjct: 2573 GGGGPGGRLEKRAFFDLPSL-VTSLVAVKDYLLAADASQGLYFVRYSDSARVLEFMSKDF 2631

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMS-ESWKGQKLLSRAEFHVGAH 1293
               D      +I+   L+ + +D   N+ +  FY +   + E W GQ+L      HV   
Sbjct: 2632 DHRDVLTAGVVINEPKLAFLAADAAGNLALSEFYGSRNTNPEFWAGQRLAPLGLMHVARR 2691

Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQK 1352
            ++  + ++M           P SD  NR ALL G  +G +  IAP+ D    +RL +LQ 
Sbjct: 2692 LSCCVSIKM-----------PTSDGKNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQN 2740

Query: 1353 KLVDSVPHVAGLNPRSFRQFH-------SNGKAHR----PGPDSIVDCELLSHYEMLPLE 1401
             +   +PHVAGLNPR+FR            G++H     P  + ++D +LL  + +L  +
Sbjct: 2741 HMSRRLPHVAGLNPRAFRHRFCRIPKSLGGGQSHHAPPAPASNGLLDGQLLLGFPLLSRQ 2800

Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
             Q + A   G T  QI+S+L  +
Sbjct: 2801 HQGQAAEALGVTVRQIMSDLRAI 2823



 Score =  104 bits (260), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)

Query: 228  ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G +  +  TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
            SIS  LK+H ++W   +LP+D +KLLAVP     VLV+  N +   SQ++  A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423

Query: 347  VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
            +         LD S+E P         + + +V  D A     +AT + + +V     ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            G L+ + + ++G   QR+ + +T    + S +  I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518



 Score = 95.9 bits (237), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)

Query: 56   PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
            PNL+V   N +E++ +R     +  +  +           G   A LELV  Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137

Query: 115  SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            SLA+LS G   +S RRD+++LAF + K+SV+E++   H LR +S+H FE    +  + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193

Query: 175  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
             +    P V  DP GRC  +     Q+ +L A +  +G    V D    G G G   RI 
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253

Query: 232  SSHVINLR 239
              H+   R
Sbjct: 1254 GVHINERR 1261


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score =  273 bits (698), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 350/1483 (23%), Positives = 618/1483 (41%), Gaps = 221/1483 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++++ +     GS  +   G+  ++   D +  A L L   Y + G +  L
Sbjct: 28   NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84

Query: 117  AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
              +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85   ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEESHMSPWVPD 141

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES- 232
              +    + VDP G C  +  +G+  + IL   Q G  LV D+ D   +G   +  +   
Sbjct: 142  LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVNDA 200

Query: 233  ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                            S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201  QKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSLT 260

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
              +      S  ++    K    + +   LP D +K++ +P PIGG L++G N  +H   
Sbjct: 261  IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVDQ 320

Query: 334  QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
               + A+ +N +A    +     +S   + L+      L +     LL    G + +LT 
Sbjct: 321  AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILTF 380

Query: 392  VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS--- 441
              DGR V  + L     ++  S++ S  +   ++G +  F GS  GDS+L+ ++  S   
Sbjct: 381  KVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSAT 440

Query: 442  -------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
                         GT+ LS   +++  D +           +++   + +VNG+     G
Sbjct: 441  KKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD-----G 495

Query: 489  SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
            +A+         F+    D L ++GP +D + G    + +     S       +EL   +
Sbjct: 496  AAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAAR 546

Query: 549  GI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVLE 591
            G       TV  +       DS +M   DD Y  + I  ++             R ++L 
Sbjct: 547  GFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLLA 603

Query: 592  TADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILD 635
             A    +  ESV Y V                +  TI  G L    RV+QV     R  D
Sbjct: 604  KAKG-DDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSYD 662

Query: 636  GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVS 693
             +     +   P   E  S  E  TV+  S A+PY+L    D S+ +L  D +     V 
Sbjct: 663  CNLGLAQIY--PVWDEDTS--EERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEVE 718

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
            +Q  AA   S K VS C LY DK      +  S+D              D       +I 
Sbjct: 719  IQGSAA---SAKWVSGC-LYEDK-----TKIFSSDL-------------DTEHAATPNIL 756

Query: 754  SVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
              +    G L IF +PN    +  VD                 L  S     SSS     
Sbjct: 757  LFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV- 801

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +E +  + V +L      A H  P++        ++ Y+ Y   G    SK      
Sbjct: 802  --NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--FI 853

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
             + +  V     ++  N   +R+P            +P + +    ++ G++  F+SG  
Sbjct: 854  KAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQN 901

Query: 933  PCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            PC+ +   + R  +    +L   ++ + T  H   C  GF YV    ++++ +LPS + +
Sbjct: 902  PCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRF 958

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
            D+ W  +KIPL      I Y +    Y +  S        +   L  D E   +  N  +
Sbjct: 959  DSAWATRKIPLGEQVDCIVYSSASESYVIGTST------KEDFKLPEDDESHTEWRNEFI 1012

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENE 1108
            + +       ++   V++LEP      W       ++ +E    ++++ L  + TT E +
Sbjct: 1013 TFLP-----QLDRGTVKLLEPKN----WSAIDIYEVEPAERITCIKIIRLEISETTHERK 1063

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQ-NLVTEVYSKE-LKGAISALASL 1163
             ++ +G+A  +GED+  +G + +F       + D+P+ N   +++++E +KGA++A++ +
Sbjct: 1064 DMVVVGSAVAKGEDIVPKGCIRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGI 1123

Query: 1164 --QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSI 1217
              QG L++A G K ++   K  G+ L  IAF D    YV  L  +K     ++GD  K +
Sbjct: 1124 GGQGFLIVAQGQKCMVRGLKEDGSLLP-IAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGL 1181

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
            +F  + E+  +L+L  K+  +L     +FL DG+ L ++V+D+  N+ +  Y P+   S 
Sbjct: 1182 WFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSS 1241

Query: 1278 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT---------NRFALLFGT 1328
            KG +LL R+ FH G     F     L      T ++P  +           +++ +L   
Sbjct: 1242 KGDRLLHRSVFHTG----HFASTMTLLPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITF 1297

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
              GSIG I+PL+E ++RRL +LQ +LV+++ H  GLNPR +R   S+G   + G   ++D
Sbjct: 1298 QTGSIGVISPLNEDSYRRLLALQSQLVNALEHPCGLNPRGYRAVESDGMGGQRG---MID 1354

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
              LL  +  +  + + EIA + G     I  +L  L  G ++L
Sbjct: 1355 GNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1397


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score =  271 bits (693), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 331/1464 (22%), Positives = 618/1464 (42%), Gaps = 210/1464 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD     +   S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665  TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700  IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
            + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720  VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
               L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761  DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
                V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803  IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
             ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846  HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
                         S   F  L   N        +    ++ C+LP  + +D  W ++++ 
Sbjct: 904  ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR-T 1057
            L      + Y     +Y       VL   +     L  D E+  +  N  +S     R +
Sbjct: 947  LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 999

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
            +    ++  +   D     +    +  + + E  + ++ ++L  +  T E + ++ +GTA
Sbjct: 1000 FIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1058

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
            + +GED+ +RG + +F   +   +P +  T+     +  + +KGA++AL+ +  QG +L+
Sbjct: 1059 FARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLV 1118

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
            A G K ++   K  G+ L  +AF D    YV  +  +K     +LGD  K ++F  + E+
Sbjct: 1119 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEE 1176

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              +++L AKD   L+  A EFL DG  L +VV+D   NI +  Y P+  +S  G +LLSR
Sbjct: 1177 PYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSR 1236

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
            ++FH+G   +    L     SS++   ++ G D  N+  L   L  T +GS+G I  + E
Sbjct: 1237 SKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPE 1296

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             ++RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D  LL  +  +  +
Sbjct: 1297 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1352

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
             + EIA + G    +I ++L  ++
Sbjct: 1353 RKTEIAGRVGAREWEIKADLEAIS 1376


>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1417

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 324/1388 (23%), Positives = 594/1388 (42%), Gaps = 160/1388 (11%)

Query: 103  LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
             V  +RLHG V   ES+ I+S          D ++++F+DAKI++LE+ D+++ L   S+
Sbjct: 120  FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175

Query: 160  HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
            H +E +P+ + L+         PL    ++ DP  RC  +++    + IL   Q  + L 
Sbjct: 176  HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227

Query: 214  VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
            + ++DT  S          S V+++  D+D  +KHV D +F+ G+  P + +L +   TW
Sbjct: 228  IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285

Query: 271  AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              R+     T  +   ++    +  P++ +  NLP+D   L+   + +GGV++V AN + 
Sbjct: 286  TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345

Query: 331  YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
            Y +Q S    L +N +   + S   LP+      S +++L+ ++A ++ ++   +    G
Sbjct: 346  YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404

Query: 385  DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +  + V  DGR V RL + S    + + + +  + +   F+GS  G S+L++      T
Sbjct: 405  TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458

Query: 444  SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
            S +   +KEE  +++  AP+      +  D   D  +GE       A   T         
Sbjct: 459  SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510

Query: 504  AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
            ++ DS+   GP+ D ++ L    D       +ATG      + L +  LP          
Sbjct: 511  SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570

Query: 546  -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
             G +G+W++   ++ R + +   R +      +  +IIS +A     +   A   ++   
Sbjct: 571  GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
             +   + G T+ A + F    ++ V     R+L  DG+   + +DL              
Sbjct: 631  QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681

Query: 658  NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
               +   S+ DP++++   D S+ L +G+P    +  +  + + E + K ++ C  + D 
Sbjct: 682  RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740

Query: 717  GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                  R  +  A     V   +    + G   Q   + ++    G LE++ +P    VF
Sbjct: 741  TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797

Query: 776  TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
            +     +  + +VD+    AL                Q        + V ++A+      
Sbjct: 798  STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
             S+ +L   L  G    Y+A     P   S    P + + +L V  V        +    
Sbjct: 845  SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892

Query: 896  PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLR 944
              D    EE       ++  I + +              G FL+G RPCW +   +  ++
Sbjct: 893  AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWILSTDKGGVK 952

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
            + P      + AFT          F+  + +G   I  +P    ++ + P + IP    P
Sbjct: 953  IMPS-GHQVVHAFTACSLWESKGDFLLYSDEGPSLIEWVPE-IQFEGHLPSRSIP---RP 1007

Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
               ++   +    L+V+   L+  +   S   D+ V  + D  N+S          E   
Sbjct: 1008 RPYSHVVFEPTTTLLVAASSLQ--STFTSYDEDRNVVWEPDEPNMS------LPVCETSA 1059

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDV 1123
            + ++ PD     W T        +E    +  +TL   +T+   +  +A+ T   +GED+
Sbjct: 1060 LELISPDT----WTTMDGYEFAQNEFVTCMECITLETLSTETGTKDFVAVSTTINRGEDL 1115

Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYS------KELKGAISALASLQGHLLIASGPKIIL 1177
            A +G V +F       +P       Y        E KG ++AL  +  +L+ + G KI +
Sbjct: 1116 AVKGAVYIFEVVEVVPDPAMGQKRWYRLKLHCRDEAKGPVTALCGMDNYLVSSMGQKIFV 1175

Query: 1178 HKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
                  E L G+AF D   +YV SL  VKN +++GD  K ++ ++++E   +L +LAKD+
Sbjct: 1176 RALDLDERLVGVAFLDVS-VYVTSLRAVKNLLVIGDALKGVWLVAFQEDPYKLVVLAKDY 1234

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
              +     +        SL+  DE+  +++  Y P   ES  GQ+LL R EFH     T+
Sbjct: 1235 YPIPVACADLFFADGKASLISCDEEGVLRLSEYDPHDPESRHGQRLLCRTEFH---GQTE 1291

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
            +    ++A    R G    ++   +  L+ G  DGS+  +  +D+   +RL  LQ +L  
Sbjct: 1292 YRTSHLIA----RRGKGLDAE-IPQAKLICGHTDGSLTSLTYVDDAVSKRLHLLQGQLAR 1346

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
            +V HVAGLNP++FR    N +  RP    I+D  LL+ +E LP+  Q+E+  Q  T R+ 
Sbjct: 1347 NVQHVAGLNPKAFRVVR-NDRVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTT 1405

Query: 1417 ILSNLNDL 1424
            +L +  DL
Sbjct: 1406 VLKDWLDL 1413


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score =  270 bits (691), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 181/621 (29%), Positives = 329/621 (52%), Gaps = 68/621 (10%)

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             ++++VE++++    ++S+P+L      G ++ Y+++  E  +   K  +     R LS 
Sbjct: 876  ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++ S +            D Y +         + I   K  S + G F+ G +P W    
Sbjct: 934  NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVWIFNE 973

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKI 998
            +  +R+H    DG+IV+    HN +C +GF+Y T  +  +KI  L     ++N + ++++
Sbjct: 974  KGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRV 1033

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
            P+K + H+I Y  E   Y ++VS P               +V  +++  +   +      
Sbjct: 1034 PIKLSAHKIAYHNELKCYVVVVSFP---------------QVTQELEEDSKKPI-----L 1073

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT--TKENETLLAIGT 1115
            T E+++++I++P      W+   +  +Q  E  L +++V+L F  +  T +++  L IGT
Sbjct: 1074 TDEKFQIKIIDP-TIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGT 1132

Query: 1116 AYVQGEDVAARGRVLLFS--TGRNADNPQNLVTE----VYSKELKGAISALASLQGHLLI 1169
            A+  GED   +GRVL+F   + +      +L T+    +Y KE KG ++AL+S+ G LL+
Sbjct: 1133 AFTFGEDTQCKGRVLVFEIVSHKTQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLLM 1192

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
              GPK+ ++++   +L  ++F+DA  +Y+ S++ +K +I++GD++KS+YFL W   G QL
Sbjct: 1193 TIGPKLTVNQFLTGQLVTLSFHDAQ-IYICSISTIKTYIVIGDMYKSVYFLQW--NGKQL 1249

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
              L+KD+ SL+ F+TEF+++  TLS++VSD  KNI +F + P    S +GQ LL +A+FH
Sbjct: 1250 VPLSKDYQSLNIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLCKADFH 1309

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
            +G+++ KF+R  M       +     +D+     + FGTLDGS+  + PLDE  ++    
Sbjct: 1310 IGSNIEKFVRTPMKFNIQSSSNGNNNNDQ----LVFFGTLDGSLNVLRPLDERMYQLFYH 1365

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLEE 1402
            LQ KL   +P  AGLN + +R F S  +     P +       I+D +LLS +  L  +E
Sbjct: 1366 LQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQKE 1424

Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
            +  +A   G+   +IL+ L +
Sbjct: 1425 RRLLASSVGSNTDEILTALKN 1445



 Score =  248 bits (633), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 198/738 (26%), Positives = 331/738 (44%), Gaps = 119/738 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
           NLV++  N +++Y +       K  KN   T ++  +  +    SLEL+   +L G +ES
Sbjct: 31  NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           +A +   G +    +DS++L F DAKISVL+++  I    I S+H +E+ E+   K GR 
Sbjct: 84  MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
            F   P++K+D Q RC  +L+Y   +++L   Q  S L  +++             ++  
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
                 D     +F F++GY EP ++ LHE   TW  R++ K  T  ++A+SI+ + K  
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
            +IW   N+P++  +L++VP P+ G LV+  N + + +Q++   LA+N YA + +    E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316

Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
            P     +    LD ++  +L+ D  + S K G+L++  ++ DGR VQR+ +SK   SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
            + +  + ++L FLGSRLGDSLL+Q+T  S T        +E  + E  +   K+ + S 
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428

Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            + L            D V  EE  L+    N  +S Q      + D ++N+GP+ D   
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484

Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
           G  +N       +     Y    +EL  C G                             
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544

Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISLE 584
             WTVY+K+S     D                               +++Y  YL IS+ 
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604

Query: 585 ARTMVLETADLLTEVTESVDYFVQG----RTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
             T    T  L T  +E      +G    RT+  GNLF +RR++ + E   ++L D + +
Sbjct: 605 NGT----TNILDTTSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            Q++                 + S  I DPYVL+  SD SI+L   D     ++    + 
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708

Query: 700 IESSKKPVSSCTLYHDKG 717
               +  V + +L+ DK 
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score =  269 bits (688), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 358/1474 (24%), Positives = 606/1474 (41%), Gaps = 202/1474 (13%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28   NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +    + +    D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85   QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177  FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142  LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200

Query: 220  FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201  QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261  IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334  QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321  AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392  VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381  KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436

Query: 441  SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E     
Sbjct: 437  SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494

Query: 497  --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G     
Sbjct: 495  GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
              TV  +    +  DS +M   DD Y  + I  ++ +   T +  + D   LL +     
Sbjct: 555  GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611

Query: 601  --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
              ESV Y V                +  T+  G L    RV+QV     R  D  Y    
Sbjct: 612  KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
                P   E  S  E  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670  AQIYPVWDEDTS--EERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701  ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
              S K VS C LY DK           + + S    E     + GP    +I   +    
Sbjct: 727  --SGKWVSGC-LYEDK----------MNIFFSDFDIEN----EAGP----NILLFLLDVD 765

Query: 761  GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            G L IF +PN +  +  VD                 L  S     SSS        +E +
Sbjct: 766  GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
              + + +L      A H  P++        ++ Y+ Y   G    S         R L  
Sbjct: 809  TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSGL-------RFLKA 857

Query: 880  SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             N  V   R     N   +R+P            + C+ +    ++ G++  F+SG  PC
Sbjct: 858  VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905

Query: 935  WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            + ++     R H     G  V + +  H   C  GF YV    ++++ +LPS + +D+ W
Sbjct: 906  F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964

Query: 994  PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
              +KI L      I Y +    Y +  S        +   L  D E   +  N  ++ + 
Sbjct: 965  ATRKIALGEQVDSIVYSSASECYVIGTSA------KEDFKLPEDDESHTEWRNEFITFLP 1018

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1112
                  +E   V++LEP      W T  +  ++ +E    + V+ L  +  T E + ++ 
Sbjct: 1019 -----QLERGTVKLLEPKN----WSTIDSHELKPAERITCIEVIRLEISELTHERKDMVV 1069

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--QG 1165
            +G++ V+GED+  +G + +F        P    ++   ++++KE +KGA++AL+ +  QG
Sbjct: 1070 VGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQG 1129

Query: 1166 HLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLS 1221
             L++A G K ++   K  G+ L  +AF D    YV  L  +K     ++GD  K ++F+ 
Sbjct: 1130 FLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFIG 1187

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            + E+  +L+L  K+  +L     +FL DG+ L ++V+D+  N+ +  Y P+   S KG +
Sbjct: 1188 YSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDR 1247

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSS----DRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
            LL R+ FH G   +    L   A +     D       S   +++ +L     GSI  I 
Sbjct: 1248 LLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVIT 1307

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
            PL E ++RRL +LQ +LV+++ H   LNPR +R   S+G   + G   ++D  LL  +  
Sbjct: 1308 PLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWLD 1364

Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            +  + + EIA + G     I ++L  L  G ++L
Sbjct: 1365 MGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1398


>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1460

 Score =  268 bits (684), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 365/1506 (24%), Positives = 603/1506 (40%), Gaps = 247/1506 (16%)

Query: 57   NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
            NLVV  +++++++  +        +Q  G+ + +++   + R+  D  G+ A+       
Sbjct: 28   NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87

Query: 100  ----------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
                       L LV    L G V  L+ +    + +    +S++LAF DAK+S++E+D 
Sbjct: 88   GFRADRTTNTKLVLVYETPLAGTVIGLSKIKTSTSRSGC--ESLLLAFRDAKLSLVEWDA 145

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
              + L   S+H +E  E       + S    PL      +  DP  RC  +      + I
Sbjct: 146  ERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLAI 199

Query: 204  LKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD--M 243
            L   Q    + +GD D    G               ++ IE      S V+ L +LD  +
Sbjct: 200  LPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPSL 259

Query: 244  KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
             H     F+H Y EP   IL              H+T M+  L +    K    I S   
Sbjct: 260  LHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVSG 317

Query: 304  LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            LP D Y+++ +P+P+GG L+VG N  IH         +A+N       S     +S  ++
Sbjct: 318  LPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLNI 377

Query: 363  ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDIT 416
             L+      L  D+   L+    G +VL+T   DGR V  L+L    P    +++   I+
Sbjct: 378  RLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRIS 437

Query: 417  T---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            T   IG ++ F GS  GDSL+  +T        +   + +    + D             
Sbjct: 438  TLSRIGKNVMFAGSEEGDSLVFGWT-----KKQTQAGRRKSKPRDDDFYMDDYEEEEEEV 492

Query: 474  ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------------ 521
               D+   E  S +   S  +       SF + D L++I P++  +YG            
Sbjct: 493  DEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEER 552

Query: 522  --LRINAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNAD 564
              + ++AD     A G  K S    +            E    +G WT+  K     +  
Sbjct: 553  NSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSLQ 612

Query: 565  SSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGR 610
            S +  +      YD   +Y  ++I++       E   +   TA     +  +      G 
Sbjct: 613  SDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAGI 672

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
            TI AG +    R++Q+ +   R  DG +    +   P   E  +G+E   V + SIADPY
Sbjct: 673  TIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADPY 728

Query: 671  VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
            +L+   D S  +   D S     ++    +  S K +S C LY+D               
Sbjct: 729  LLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND--------------- 772

Query: 731  LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVD 789
             ST +          P     I   +   SGAL I+ +P+ +  ++  D           
Sbjct: 773  -STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA-------- 821

Query: 790  TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
             Y+  AL    T       +GT    KE I  + V +L        H  P+L    ++  
Sbjct: 822  -YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSNDD 868

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            +  YQ Y ++           + T +  S   +   +L N  F+R P +   +++ P   
Sbjct: 869  LTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQP 915

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNHG 968
                +    NI+G+   FL G  P + +   + + RV P L    ++A +  H   C+HG
Sbjct: 916  RLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDHG 974

Query: 969  FIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
            FIY  S  I ++ Q+P   +Y      V+K+P+      + Y   +  Y  +V     +P
Sbjct: 975  FIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPPQQCY--VVGCNASEP 1032

Query: 1028 LNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
                       E+    D H   +  +L    T++   ++++ P      W    T+ ++
Sbjct: 1033 F----------ELPKDDDYHKEWARENLVFKPTLDRGLLKLISPIT----WTVIDTVQLE 1078

Query: 1087 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1145
              E  L V  + L  + +T E   L+A+GTA  +GED+  RGRV ++        P    
Sbjct: 1079 PCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVYDIADVIPEPGKPE 1138

Query: 1146 TEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYD 1192
            T   SK+LK         GA++AL+ +  QG +L+A G K ++   K  GT L  +AF D
Sbjct: 1139 T---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMD 1194

Query: 1193 APPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
                YV +   +      L+ D  K ++F+ + E+  ++ L  K    L+    +FL DG
Sbjct: 1195 MS-CYVTAAKELPGTGLCLMADAFKGVWFVGYTEEPYKMMLFGKSSTKLEVLTADFLPDG 1253

Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRL-QMLATSSD 1308
              L +V  D   +I I  + P+  +S +G  LL R  F+ GAH  TK L L   L T + 
Sbjct: 1254 KELFIVACDADGHIHILQFDPEHPKSLQGHLLLHRTSFNTGAHNPTKSLLLPSTLPTDTP 1313

Query: 1309 RT------------------GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
             T                    AP         LL  +  G I  + PL E ++RRL SL
Sbjct: 1314 STIDGSNPNTNNTNGTPNASNLAPYDATERPHILLLCSPTGLIAALRPLSESSYRRLSSL 1373

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLE 1405
              +LV+S+PH AGLNP+ +R    +      G D+     IVD  +L  +  L +  + E
Sbjct: 1374 AAQLVNSLPHAAGLNPKGYRM--PSADCPPAGVDASVGRNIVDGTVLERFTELGMARRAE 1431

Query: 1406 IAHQTG 1411
            +A + G
Sbjct: 1432 LAGRAG 1437


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
            melanogaster]
          Length = 813

 Score =  264 bits (674), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 224/823 (27%), Positives = 373/823 (45%), Gaps = 89/823 (10%)

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
            S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D   
Sbjct: 10   SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69

Query: 716  ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
                KG                       EP ++    +  L    G A       D A 
Sbjct: 70   LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129

Query: 744  GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
                   D +             VV  +SG LEI+ +P+   V+ V+   +G   + D  
Sbjct: 130  QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189

Query: 792  MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
              E +  S T   +S          ++ +S   +EL++     +  RP L  + T   +L
Sbjct: 190  --EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELL 246

Query: 852  CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP- 910
             YQ  +F  P+   K        R L   N+   +  ++       D     E+    P 
Sbjct: 247  IYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQPK 297

Query: 911  -CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
              Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN  +G
Sbjct: 298  YVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNG 357

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
            F+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      +P+
Sbjct: 358  FLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE--EPM 415

Query: 1029 NQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATIP 1084
             +       D+E+  +       S D    Y +  ++E+ ++ P+     W+    A+I 
Sbjct: 416  TKYYRFNGEDKELSEE-------SRDERFIYPIGSQFEMVLISPET----WEIVPDASIT 464

Query: 1085 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1143
             +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        P  
Sbjct: 465  FEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGK 524

Query: 1144 LVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
             +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   +YV
Sbjct: 525  PMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN-IYV 583

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
              +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  +V+
Sbjct: 584  HQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVT 643

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
            D ++NI ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +         
Sbjct: 644  DAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL---Y 700

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
             N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S+ K 
Sbjct: 701  ENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQ 760

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
                   I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 761  GINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 803


>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
          Length = 565

 Score =  264 bits (674), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
                          L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score =  263 bits (673), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 359/1511 (23%), Positives = 624/1511 (41%), Gaps = 247/1511 (16%)

Query: 57   NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
            NLVV  +N++ I+ VR +        E  +E K           GE       DG    S
Sbjct: 40   NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99

Query: 101  ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
                            L LV  ++LHG V  L+ + +  A    + D ++++F+DAKI++
Sbjct: 100  KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158

Query: 145  LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
            LE+ D++H L   S+H +E +P+   L          PL K    VDPQ RC  + +   
Sbjct: 159  LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210

Query: 200  QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
             + IL               F            S +++L    + ++++V DF F+ G+ 
Sbjct: 211  ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256

Query: 257  EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
            +P + ++ E   TW GR+     T  +   ++      +P+I S   LP D+  L  VP+
Sbjct: 257  KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314

Query: 317  PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
              GGV++   N++ Y  QS+S   A+ +N +A  ++      LP    ++ L+ +    +
Sbjct: 315  -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
                  +    G +  + V+ DG+ V +L + K     T PSV+ S              
Sbjct: 374  DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-----GE 482
             LGD  L   +      +L +   EE  D E +  + K +       L D  +      +
Sbjct: 420  -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478

Query: 483  ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
              +   +   +T+  +     ++RD+L   GP+   ++ L    D       +ATG    
Sbjct: 479  NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538

Query: 533  ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
                     +  ++  +++ + G +G+W++  + S      SS   A +   HA      
Sbjct: 539  GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595

Query: 579  LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            LI+S +A       R          TEV  ++   V G T+ A   F R  ++ V     
Sbjct: 596  LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            ++L+           P  +E  +  +         + S SI DP+VL+   D S+ L +G
Sbjct: 654  KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702

Query: 686  DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
            +     +  +  + + E + K ++ C      G      +TS     +T   + +   G+
Sbjct: 703  ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
              G   Q   + ++    G +EI+ +P     F+V    S    +VD++ + AL  S   
Sbjct: 763  TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
                 +   G+   E +   +V E           R  LF  L +G +  Y+A     P 
Sbjct: 821  PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869

Query: 863  NTSKSDDPVSTSRSLSV--------SNVSASRLR----NLRFSRTPLDAYTREETPHGA- 909
            +T+ S    S   ++ V          V A + R    N++F +    A+  +    GA 
Sbjct: 870  STTASQPAASVDGAMDVDASSTEPQQQVEAEKKRSQTLNIKFVKISSIAFEIQRHEDGAE 929

Query: 910  ------------------PCQRITIFKNIS----------GHQGFFLSGSRPCWCM-VFR 940
                                QR+ +   +            + G F +G +P W +   +
Sbjct: 930  GGERGSGERASGILAEHKKIQRLFVPFTVKPKTSDGTPSPTYSGVFFTGDKPNWIIGTDK 989

Query: 941  ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
              ++++P      + +F+          F+  T  G   I  LP   TY +  P + IP 
Sbjct: 990  GGVQIYPS-GHNVVHSFSACSLWEERGEFLVYTEDGPCLIEWLPD-FTYSHPLPARSIP- 1046

Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1060
            +   +    F       LIV+   +    Q      D++ G ++   +   VD   T T 
Sbjct: 1047 RGRGYSNVVFDPSTC--LIVAASSM----QARFASYDED-GVRVWEKDGPGVDDPITDT- 1098

Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQ 1119
                + ++ P+     W T       ++E    + +VTL    T+  ++  +A+GT   +
Sbjct: 1099 --SALELISPNS----WITMDGFEFATNEYINDISIVTLETAATETGSKDFIAVGTTIDR 1152

Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGP 1173
            GED+AA+G   +F       +P    T  Y   L      KG ++A+   QG+L+ + G 
Sbjct: 1153 GEDLAAKGAAYIFEIVEVVPDPAISPTRWYKLRLRCRDDAKGPVTAVCGFQGYLVSSMGQ 1212

Query: 1174 KIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
            KI +  +   E L G+AF D   +YV SL ++KN +L+GD  KS+ F++++E   +L LL
Sbjct: 1213 KIFVRAFDSDERLVGVAFMDVG-IYVTSLRVLKNLLLIGDAVKSVMFVAFQEDPYKLVLL 1271

Query: 1233 AKDFGSLDCFATEFLIDGS-TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH-- 1289
            AKD         +F  +    L+L+V DE+  ++I+ Y P   +S  G+ LL R E+H  
Sbjct: 1272 AKDVHLHSVTRADFFFNADGDLALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYHGQ 1331

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
            V  H +         T + R    P   +++   LL G+ DGS+  + P+DE  F+RLQ 
Sbjct: 1332 VPYHTS--------TTIARRDKEDPSIPQSH---LLIGSADGSLSSLVPVDEYAFKRLQL 1380

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            LQ +L  ++ HVAGLNP++FR    N    +P    I+D +LL+ YE LP+  Q E+  Q
Sbjct: 1381 LQGQLTRNIQHVAGLNPKAFR-IVKNDYVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQ 1439

Query: 1410 TGTTRSQILSN 1420
             GT R  +L +
Sbjct: 1440 IGTERGVVLRD 1450


>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
 gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
          Length = 1402

 Score =  263 bits (672), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 339/1426 (23%), Positives = 584/1426 (40%), Gaps = 230/1426 (16%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++I+ ++     V  EG  E  N+      E          + A L LV  
Sbjct: 28   NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
            + L G V SLA +    A N++ R +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88   FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166  E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG-------SG 212
            +      W    +   +F     +  DP  RC  +      + IL   Q         S 
Sbjct: 145  DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199

Query: 213  LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
            L G  DT        SGG  + +    SS V+ L +LD  + H     F+H Y EP   I
Sbjct: 200  LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259

Query: 263  LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
            +          ++ +      S  ++    K    + S   LP+D  +++ +P PIGG L
Sbjct: 260  IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319

Query: 323  VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
            ++G N  IH      +  +A+N +A +  S     +S  ++ L+  +   L  D    +L
Sbjct: 320  LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379

Query: 380  STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
                G L+++T   +GR V  + +          VL +  +   N +    F+GS  G+S
Sbjct: 380  VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439

Query: 433  LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
            +L+      G S  ++ L+    ++  D   T        D L D +  +        +A
Sbjct: 440  VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
            + +  +   T+SF + D+L +I P++D                   S G    A  + TG
Sbjct: 494  AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551

Query: 533  ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
            + ++    +    ELP   GIW V+ +             D+    + D +Y  YL++S 
Sbjct: 552  LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611

Query: 584  EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
             +      T+V E   + L+E  +      +G T+  G L    +V+QV     R  D S
Sbjct: 612  TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670

Query: 638  YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP 697
             +  D      + E+G+      V++ S ADPY+L+         L  D S   +     
Sbjct: 671  ELNMDQILPMEDEETGN---ELRVINASFADPYLLV---------LREDSSVKILKASGD 718

Query: 698  AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC 757
              +E  +                  R  S+  WLS  + ++    +        I++ + 
Sbjct: 719  GELEDLEA-----------------RGLSSTKWLSASLFKSATFTE--------IFAFLL 753

Query: 758  YESGALEIFDV--PNFNC-VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
               G L IF +  P   C V     F+     +     R A K + TEI           
Sbjct: 754  TPEGGLHIFAMSEPEKPCYVAEALGFLPPLLTVDFVPRRSAAKATITEI----------- 802

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                        LA          P L    +   ++ Y+A+ F                
Sbjct: 803  ------------LAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFP--------------- 835

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRP 933
             S S +++    LR ++ ++  +  Y  +     A  +   +   ++ G+   F  G+ P
Sbjct: 836  -SRSAADLWTKNLRWIKLAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASP 894

Query: 934  CWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             +  +F+E     P+   L    +   T  H  +C  GF YV S   L+I QLPS + + 
Sbjct: 895  SF--IFKEA-SSSPRVIGLSGKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFG 951

Query: 991  NY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQIDN 1046
            +  W  +++P+ A  + + Y     LY +    P   VL P           E  H    
Sbjct: 952  HLGWATRRLPMDAEVYALAYHP-AGLYVVGTGQPEDFVLDP----------SETYH---- 996

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
            + L   D+    +VE   +++++     G W    T      E  L ++ + L  + TT 
Sbjct: 997  YELPKEDISFKPSVERGVIKLIDE----GTWSIIDTHVFDPQEVVLCIKALNLEVSETTH 1052

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
            + + L+A+GT+ V GED+A +G + +F        P    T      +   E+KGA+SA+
Sbjct: 1053 QRKDLIAVGTSIVHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAI 1112

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + L  QG L++A G K ++   K  GT L  +AF D    YV +L  + N   +L+GD +
Sbjct: 1113 SELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGDAY 1170

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + ++F  + E+  +++L  +   +L+  A EFL     L ++V+D   NIQ+  + P+  
Sbjct: 1171 RGVWFTGYTEEPYKMSLFGRSKHNLEAMAVEFLPFNGELHIIVADADMNIQVLQFDPENP 1230

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGA----APGSDKTNRFAL--- 1324
            +S +G +LL +A FH G   T    LQ    +  S+   G     AP S  +    L   
Sbjct: 1231 KS-EGSRLLHKATFHTGHFPTTTHLLQSHLQMPESASTFGTTDTFAPDSTPSAPLPLHQV 1289

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
            L  +  G++  I PL E ++RRL +L   L++++    GLNP +FR
Sbjct: 1290 LITSQSGTLALITPLSESSYRRLSNLAAYLINTLESPCGLNPVAFR 1335


>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
          Length = 1405

 Score =  261 bits (667), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 333/1424 (23%), Positives = 587/1424 (41%), Gaps = 223/1424 (15%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28   NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
            + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88   FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145

Query: 167  ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
                  W    +   +F     +  DP  RC  +      + IL   Q         S  
Sbjct: 146  LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 214  VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
             G +++      + G  +    SS V+ L +LD  + H     F+H Y EP   I+    
Sbjct: 201  DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG+N
Sbjct: 261  DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320

Query: 328  -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
              IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+    G
Sbjct: 321  EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380

Query: 385  DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             L++LT   DGR V  + +                S  + +G    F+GS  GDS+++ +
Sbjct: 381  RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440

Query: 438  TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
            T        +S L+ +  +   D            D   D+ N    ++   +A+ +  +
Sbjct: 441  TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494

Query: 497  AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
            A  T++F + D L +I P+K+            + G + ++    A       N EL   
Sbjct: 495  APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554

Query: 542  ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
                 +LP  +G W V+ +       +     D     A + +Y  YL++S         
Sbjct: 555  TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614

Query: 587  TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
            T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  D   
Sbjct: 615  TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
               + ESG+      V++ S ADPY+L+   D S+++               A  +   +
Sbjct: 674  PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
             V +  L             S   WLS  + ++    +        +++ +    G L +
Sbjct: 718  DVEATGL-------------SNSQWLSASLFKSASFTE--------VFAFLLTPEGGLRV 756

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
            F V +      V + +S    ++   Y+  R A+K + TEI                   
Sbjct: 757  FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
                LA     A    P L    +   I+ Y+A+                 S S S +++
Sbjct: 798  ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
                LR ++ S+  +  YT +     +  +   +   +I G+   F  G+ P +  +F+E
Sbjct: 838  WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895

Query: 942  RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQK 997
                 P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +  W  ++
Sbjct: 896  SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954

Query: 998  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
            +P+ A  H + Y +   LY +    P    L+         E  H    + L   D+   
Sbjct: 955  MPMDAEIHALAYHS-SGLYIIGAGQPEEYQLDP-------SETYH----YELPKEDMSFK 1002

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
             T+E   +++L+       W    T  +   E  L+++ + L  +  T + + L+A+GTA
Sbjct: 1003 PTIERGIIQLLDEKT----WAIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTA 1058

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
             + GED+A +G + +F        P    T      +   E+KGA+SA++ L  QG +++
Sbjct: 1059 ILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIM 1118

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQ 1225
            A G K ++   K  GT L  +AF D    YV  L  +     + + D ++ ++F  + E+
Sbjct: 1119 AQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEE 1176

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              +++L A+   SL+  A +F+     L L+V+D   N+Q+  + P   +S  G +LL +
Sbjct: 1177 PYRMSLFARSKHSLEAIAVDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHK 1236

Query: 1286 AEFHVGAHVTKFL-----RLQMLATSSDRTGA-------------APGSDKTNRF-ALLF 1326
            + FH G H    L     RL+M  ++SD  GA             +P    T     +L 
Sbjct: 1237 STFHTG-HFPATLHVVHSRLKM-PSASDFAGANNTENGDFEMDTSSPDDKATQPLHQILC 1294

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             T  G++  + PL E T+RRL +L   L +++   AGLNPR+FR
Sbjct: 1295 TTQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1338


>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
            C5]
          Length = 1385

 Score =  261 bits (667), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 330/1414 (23%), Positives = 585/1414 (41%), Gaps = 223/1414 (15%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28   NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
            + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88   FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145

Query: 167  ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
                  W    +   +F     +  DP  RC  +      + IL   Q            
Sbjct: 146  LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 209  -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
             G      ++ T  + G  +    SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 201  DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258

Query: 266  RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
               T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG
Sbjct: 259  SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318

Query: 326  AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
            +N  IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+   
Sbjct: 319  SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378

Query: 383  TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
             G L++LT   DGR V  + +                S  + +G    F+GS  GDS+++
Sbjct: 379  NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438

Query: 436  QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
             +T        +S L+ +  +   D            D   D+ N    ++   +A+ + 
Sbjct: 439  GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492

Query: 495  ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
             +A  T++F + D L +I P+K+                   S G    A AS T ++++
Sbjct: 493  PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550

Query: 537  SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
             +   V   +LP  +G W V+ +       +     D     A + +Y  YL++S     
Sbjct: 551  LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610

Query: 587  ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
                T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  
Sbjct: 611  GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
            D      + ESG+      V++ S ADPY+L+   D S+++               A  +
Sbjct: 670  DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713

Query: 702  SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
               + V +  L             S   WLS  + ++            ++++ +    G
Sbjct: 714  GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752

Query: 762  ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
             L +F V +      V + +S    ++   Y+  R A+K + TEI               
Sbjct: 753  GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797

Query: 819  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
                    LA     A    P L    +   ++ Y+A+                 S S S
Sbjct: 798  --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
             +++    LR ++ S+  +  YT +     +  +  +    +I G+   F  G+ P +  
Sbjct: 834  AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891

Query: 938  VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-W 993
            +F+E     P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +  W
Sbjct: 892  IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950

Query: 994  PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
              +++P+ A  H + Y +   LY +    P    L+         E  H    + L   D
Sbjct: 951  ATRRMPMDAEIHALAYHS-SGLYIVGTGQPEEYQLDP-------SETYH----YELPKED 998

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1112
            +    T+E   +++L+       W    T  +   E  L+++ + L  +  T + + L+A
Sbjct: 999  MSFKPTIERGIIKLLDEKT----WTIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVA 1054

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QG 1165
            +GTA + GED+A +G + +F        P    T      +   E+KGA+SA++ L  QG
Sbjct: 1055 VGTAILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQG 1114

Query: 1166 HLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLS 1221
             +++A G K ++   K  GT L  +AF D    YV  L  +     + + D ++ ++F  
Sbjct: 1115 FMIMAQGQKCMVRGLKEDGTLLP-VAFMDM-QCYVSDLKNLPGTGMLAMSDAYRGVWFTG 1172

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            + E+  +++L A+   SL+  A +F+     L L+V+D   N+Q+  + P   +S  G +
Sbjct: 1173 YTEEPYRMSLFARSKHSLEAIAIDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSR 1232

Query: 1282 LLSRAEFHVGAHVTKFL-----RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
            LL ++ FH G H    L     RL+M  ++SD     P         +L  +  G++  +
Sbjct: 1233 LLHKSTFHTG-HFPATLHVVHSRLKM-PSASDFAATQP------LHQILCTSQSGTLALV 1284

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             PL E T+RRL +L   L +++   AGLNPR+FR
Sbjct: 1285 TPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1318


>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
          Length = 1495

 Score =  261 bits (666), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 350/1450 (24%), Positives = 594/1450 (40%), Gaps = 216/1450 (14%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S   L LV  Y L G V SLA +    +++    +S++L+F+DAK+S++E+D   HGL  
Sbjct: 147  STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204

Query: 157  TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
             S+H +E  E     W        ++     +  DP+GRC  +      + IL   QG  
Sbjct: 205  VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259

Query: 212  GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                D+               T  + G        S V+ L  LD  + H     F++ Y
Sbjct: 260  DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL       +  +  +         ++    K    I     LP+D +K++ +P
Sbjct: 320  REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
            SPIGG L+VG N  IH      +  +A+N  A S  S   + +SS  + L+      L  
Sbjct: 380  SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439

Query: 375  DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
            D    L+   TG+L +L+   DGR V  L+L +  PS          S  + I ++  F+
Sbjct: 440  DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
            GS   DS+++ ++  S  +      +      +          +   D  +D + GE   
Sbjct: 499  GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557

Query: 484  -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
             + L G  + +  S    ++F + DSLVNI PL+D +       R + D  AT IS +SN
Sbjct: 558  AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616

Query: 539  YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            +ELV                          E P  +GIWT+  K       +  +  A  
Sbjct: 617  FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676

Query: 573  D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
            D       ++   +I+S        E+   VL +A    E     ++    G TI  G +
Sbjct: 677  DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734

Query: 618  FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
                RV+Q+ +   R  DG   + Q L     + E+G+      +++ SI DPYVLL   
Sbjct: 735  GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789

Query: 677  DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
            D SI +   D       ++       + K +S C LY+D                STG+ 
Sbjct: 790  DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832

Query: 737  EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
                 ++G    +  I S++  E GAL ++ +P+ +  ++  +      T I   Y   R
Sbjct: 833  AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891

Query: 794  EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
              + ++ TE+                       L      A    P+L    ++  +  Y
Sbjct: 892  STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928

Query: 854  QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
            +   F+ P     S+ P   S+SL    +    +       T + A   E    G+P + 
Sbjct: 929  EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
            I    N+ G    FL G  P + +   +       L    + + +  H   C+ GFIYV 
Sbjct: 981  IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037

Query: 974  SQGILKICQL-PSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPL--IVSVPVLKPLNQ 1030
            S+GI ++ QL P  +  D    ++K+ +      +TY   K++Y +  +V  P   P + 
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQAVTYHPPKDVYVIGTVVKEPFELPKDD 1097

Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRA 1081
                                  D HR +  E+           +++L P      W    
Sbjct: 1098 ----------------------DYHREWAKEDITFKPLTGRGFLKLLNPSN----WSVID 1131

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRN 1137
             + + S E  + ++ + L  +  T E + L+ +GTA  +GED+A RGRV ++   +    
Sbjct: 1132 KVELDSHEIIMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGRVYVYEVITVVPF 1191

Query: 1138 ADNPQ-NLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1190
             D P+ N   ++ +KE   +GAI+ ++ +  QG +++A G K ++   K  GT L  +AF
Sbjct: 1192 PDRPETNKKLKLIAKEEIPRGAITGISEIGTQGFMIVAQGQKSMVRGLKEDGTLLP-VAF 1250

Query: 1191 YDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
             D    YV ++  +      L  D  K ++F  + E+  ++ +  K    ++    + L 
Sbjct: 1251 IDMN-TYVTTVKSLPGTGMCLFADAIKGVWFAGYSEEPYKMTIFGKQSQGMEVITADLLP 1309

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             G  L ++V+D   N+ +  + P+  +S  GQ LL R  F +G H+   + L  L T++ 
Sbjct: 1310 IGDELYIIVADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQ 1369

Query: 1309 RTGAA---PGSDKTNRFALLFGTL-DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
                A     S+ TN  + L  TL  G +  + PL E  +RRL +L   L + + H  GL
Sbjct: 1370 TPTPAVTSTASEPTNPASGLLMTLSSGVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGL 1429

Query: 1365 NPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            NP++ R  ++  +A    RP    IVD  +L  +  L  +++ E+A + G     I  +L
Sbjct: 1430 NPKAHRISNTAPEAVIGGRP----IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDL 1485

Query: 1422 NDLALGTSFL 1431
             ++A G  +L
Sbjct: 1486 QEIAAGLGYL 1495


>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1443

 Score =  260 bits (665), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 343/1512 (22%), Positives = 616/1512 (40%), Gaps = 233/1512 (15%)

Query: 57   NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
            NL+V   ++++++  ++       EEG+  SK + +    +  DG+ A+           
Sbjct: 28   NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85

Query: 100  -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
                  L LV  Y L G + SLA +    + +    +++++ F DAK+S++E+D +  G+
Sbjct: 86   RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143

Query: 155  RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
               S+H +E  E   L+R       +ES      +  DP  RC  +   G  + I+   Q
Sbjct: 144  STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197

Query: 209  GGSGLVGDE----------------DTFGSGGGFSARIESSHVINLRDLDMKHVKD--FI 250
                +  D+                    S          S V+ L  LD   +      
Sbjct: 198  DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
            F++ Y EP   IL   ++  +  +  +  H T M+  L +    +    I S   LP+D 
Sbjct: 258  FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315

Query: 309  YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
            ++++ + +P+GG L++G N  IH      +  +A+N +A    S   + +S   + L+ +
Sbjct: 316  FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375

Query: 368  HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
                L  Q+   ++  +TG++ +L+   DGR V  L + + +     SV+ + ++T   I
Sbjct: 376  KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435

Query: 419  GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            G +  F+GS   DS+++ +   S  S   S  K     IE    ++        D   D 
Sbjct: 436  GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492

Query: 479  VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
            + GE  S+   A+N         S    + F V DSLVNI P+ + ++G   L  N D  
Sbjct: 493  LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552

Query: 530  ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
               +  +   ELV                          E P  +GIWT+  K  + +G 
Sbjct: 553  LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612

Query: 562  NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQG 609
             A   + +     A D +Y   +I+S +A +   ET+D+    + + +           G
Sbjct: 613  EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TI AG L    RVIQV +   R  DG+  + Q L       +  +G+E   ++S S AD
Sbjct: 672  STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            PY+LL   D SI +   D +     ++       + K ++ C LY D             
Sbjct: 727  PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
               S GV   +    G  +++ ++   +    GAL I+ +P+  N V+  +         
Sbjct: 773  ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
               ++   L  +     S++ E              + EL +       +R P+L    +
Sbjct: 823  ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866

Query: 847  DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
               +  Y+      P +TS      S S     S +   ++ N   +R P    +  ET 
Sbjct: 867  TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912

Query: 907  HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
             G    R   + +  N+ G+   FL G  P + M   +       L    +   +  H  
Sbjct: 913  DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
             C+ GFIY    G+ ++ QLP  +T+      +QKI L    H + Y      Y    S 
Sbjct: 973  GCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVAYHPPTECYVAATST 1032

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY--TVEEYEVRILEPDRAGGPWQTR 1080
                          + E+  + DNH+        T+  T+E+  +R++ P      W   
Sbjct: 1033 EA------------EFELPKEDDNHHPQWAKEQITFKPTMEQGRLRLINPVN----WTVV 1076

Query: 1081 ATIPMQSSENALTVRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
              + +   E  + ++ + L  +  T E + L+A+GT   +GED+A +GR+ ++       
Sbjct: 1077 DEVELDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVYDVINVVP 1136

Query: 1140 NPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIA 1189
             P    T    K +      +GAI+ ++ +  QG ++++ G K ++   K  GT L  +A
Sbjct: 1137 EPDRPETNKRLKLIATEDIARGAITCISEIGTQGFMIVSQGQKCMVRGLKEDGTLLP-VA 1195

Query: 1190 FYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            F D    Y+ S+  +K     +  D  K ++   + E+  ++ L  K   +++    + L
Sbjct: 1196 FMDMN-CYITSIKELKGTGICVFSDAVKGVWVAGYTEEPYKMMLFGKSAKNMEIMQADLL 1254

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
             DG  L +V +D   N+ I  + P+  +S +G  LL R+ F +G H+   + L  L  + 
Sbjct: 1255 PDGKELYIVAADSDCNLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTL--LPRTK 1312

Query: 1308 DRTGAAPGSDKTNRFA--------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
              T   P  D  +  A        +L  +  G I  + PL E  +RRL +L   L++++ 
Sbjct: 1313 SATLLPPSPDAMDTAADATIPEHEILITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLY 1372

Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
            H  GLNPR++R    +      G  +++D  +L  +  L  + + E+A + G    ++  
Sbjct: 1373 HACGLNPRAYR-VDKDAPEGMVGSRTVIDGNILMRWMELGSQRRAEVAGRVGVDVLEVRE 1431

Query: 1420 NLNDLALGTSFL 1431
            +L  L  G  +L
Sbjct: 1432 DLASLMGGLGYL 1443


>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
          Length = 1437

 Score =  260 bits (664), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 334/1468 (22%), Positives = 615/1468 (41%), Gaps = 187/1468 (12%)

Query: 57   NLVVTAANVIEIYVVRVQE------------------------EGSKESKNSGE------ 86
            N+VV  +N++ I+ VR +                         EG  E   SGE      
Sbjct: 45   NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104

Query: 87   ----TKRRVLMDGISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
                T +  ++   +     L+  +RLHG    +E + I++    ++S   D ++++F+D
Sbjct: 105  SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160

Query: 140  AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
            AKI++LE+ +++H L   S+H +E +P+ + +          PL    ++ DP  RC  +
Sbjct: 161  AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212

Query: 195  LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
             +    + IL   Q  + L  D     +          S +++L    D  +++V DF+F
Sbjct: 213  SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270

Query: 252  VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
            + G+  P + +L + + TW GR+     T  +   ++    + +P+I +   LPHD + +
Sbjct: 271  LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330

Query: 312  LAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFS-------VE 363
                + +GGV+V+ +N+I Y  Q+     L ++ +   L    +LP  S S       ++
Sbjct: 331  APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNSL 422
            L+ +   ++ +    +  K G +  + ++ DG+ V RL ++     + + S +  + +  
Sbjct: 388  LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
             F+GS +G S+L++ T          G   E   + A   AP+         D       
Sbjct: 448  LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504

Query: 481  GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
             E+  + G   N + +  K       +  DSL   GP+ D ++ L  N D       +AT
Sbjct: 505  IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564

Query: 532  GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
            G      + L++  LP           G +GIW++  + +   N  +    A  + YHA 
Sbjct: 565  GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622

Query: 579  ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
               ++IS +A     +   A    +    +   + G TI A   F    ++ V      +
Sbjct: 623  NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682

Query: 634  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            +    +  D +      +         +   SI DP+VL+   D SI L +G+     + 
Sbjct: 683  I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740

Query: 694  VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
             +  + + E + + ++ C  + D      + + +  A +           + G   Q   
Sbjct: 741  RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++C   G +EI+ +P     F+        + + D Y   AL        S  ++   
Sbjct: 797  WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            + ++ +I  + V  L          RP L   L  G +  Y+ +    P        P +
Sbjct: 849  KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPL----PAA 899

Query: 873  TSRSLSVSNVSA-SRLRNLRFSRTPLDAYTREETPHGAPCQRIT-----IFKNISGHQ-- 924
             S +L V  V   SR  N++ S     +   E+       +RI+        + S  Q  
Sbjct: 900  RSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQ-------KRISHLLIPFATSPSPGQTF 952

Query: 925  -GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKIC 981
             G FL+G RP W +   +  ++V P     S+V AFT        + F+  + +G   + 
Sbjct: 953  SGVFLTGDRPSWLLCTDKGGVKVLPS--GHSVVHAFTASSVWESKNDFLLYSEEGPSLME 1010

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LP     D + P + +P   +   + Y    +L   IV+           S    +   
Sbjct: 1011 WLPD-VQLDGHLPSRSVPRPRSYSNVVYDPSTSL---IVAA----------SSQQSKFAS 1056

Query: 1042 HQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
            +  D + +   D + ++ + E   + ++ P+     W T        +E    +  +TL 
Sbjct: 1057 YDEDGNIVWEPDTNISFPSCECSALELISPEG----WVTMDGYEFAQNEFVNCLDCITLE 1112

Query: 1101 NTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------ 1153
              +T+   +  +A+GT   +GED+A +G V +F       +  + +  +Y  +L      
Sbjct: 1113 TMSTETGTKDFIAVGTTINRGEDLAVKGAVYIFEIVEVVPDTNSGLKRLYRLKLQCRDDA 1172

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
            KG ++AL  +  +L+ + G KI +  +   E L G+AF D   ++V SL  VKN +++GD
Sbjct: 1173 KGPVTALCGMDNYLVSSMGQKIFVRAFDLDERLVGVAFLDVG-VFVTSLRSVKNLLVIGD 1231

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
              KS++F++++E   +L +L KD         +     + +SL+V DE   I++  Y P 
Sbjct: 1232 AVKSVWFVAFQEDPYKLVILGKDPYHTCVTCADLFFAENRVSLLVCDEDGVIRLLEYDPH 1291

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
              ES  GQ LL R EFH     T++    ++A   D+    P +       L+ G+ DGS
Sbjct: 1292 DPESRGGQHLLRRTEFH---GQTEYRTSVLIARRKDKDIDIPQA------KLVCGSTDGS 1342

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +     ++E  F+ L  LQ +L  +V HVAGLNPR+FR    N    RP    I+D  LL
Sbjct: 1343 LVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLNPRAFRIVR-NDYVSRPLSKGILDGNLL 1401

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            + +E LP+  Q E+  Q GT R+ +L +
Sbjct: 1402 TTFEELPIARQNEMTRQIGTERATVLKD 1429


>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
          Length = 153

 Score =  259 bits (663), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 127/159 (79%), Positives = 139/159 (87%), Gaps = 6/159 (3%)

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
            M ESWKGQKLLSRAEFHVGAHV+KFLRLQML T     G A  S+KTNRFAL+FGTLDG 
Sbjct: 1    MVESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ----GLA--SEKTNRFALVFGTLDGG 54

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IGCIAP+DELTFRRLQSLQ+KLVD++PHV GLNPRSFR F SNGKAHRPGPD+I+D ELL
Sbjct: 55   IGCIAPVDELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELL 114

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            SHYEM+ LEEQLEIA Q GTTRSQILSN +D +LGTSFL
Sbjct: 115  SHYEMMSLEEQLEIAQQIGTTRSQILSNFSDFSLGTSFL 153


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
            C-169]
          Length = 1137

 Score =  259 bits (661), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 198/627 (31%), Positives = 293/627 (46%), Gaps = 80/627 (12%)

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            PFL  +L DGT L Y+A  F  P                    V   RL     +  P  
Sbjct: 553  PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593

Query: 899  AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
                + T   AP   +T F  +   +     G F+SG RP W +  R  L  H    +G 
Sbjct: 594  DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650

Query: 954  IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
            +   T  HN+NC  GFI           LKICQLP  +  D  WP+QKI ++ATPH++ Y
Sbjct: 651  VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQKIAVRATPHRLAY 710

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS---VDLHRTYTVEEY--E 1064
            +AE  LY L+VS PV  P  +       QE     D H   S    D     +  E   E
Sbjct: 711  YAEARLYVLLVSRPV--PYRE------HQEEASDGDPHASYSYICADAAAKASGTELGGE 762

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VR+LEP R    +QT A   +   E   +V    L N  T   E  + +GTA   GED  
Sbjct: 763  VRLLEPGR----YQTVARHALDPGEEPCSVAADWLRNAQTGALEPYITVGTALNYGEDYP 818

Query: 1125 ARGRVLLFSTGRNADNPQN--------LVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
              GR+LLF   R + +            +T V++      +  LA + G L+ A G  + 
Sbjct: 819  CSGRILLFKATRTSTSGAEQADPTISWQLTLVHASGFSRPVQGLAVMDGRLVAAVGNNMQ 878

Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL--SWKEQGAQLNLLAK 1234
            + +  G+ L+ I+F+ A  L++ S+  +K FILLGD+HK + F+    K     L  L+K
Sbjct: 879  VMELRGSSLHMISFFHA-QLFITSVATIKTFILLGDVHKGLTFVYADKKANYTALTQLSK 937

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY--APKMSESWKGQKLLSRAEFHVGA 1292
            D+  +D  A EFL++G  L L+  D  +N+++F Y    +   +W+G+KLL     HVG 
Sbjct: 938  DYNDVDVEAAEFLVNGKKLFLLACDAAQNLRLFAYDGGKEQQATWQGKKLLPLGAIHVGQ 997

Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQ 1351
            ++   L  ++  T +  TG           A +FG+  GSI  +AP  D L    L +LQ
Sbjct: 998  NICSSLSHRI--TPASATG-------VQLRAAVFGSAAGSIASLAPTWDGLPAEELLALQ 1048

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG---------PDSIVDCELLSHYEMLPLEE 1402
            +++V +VP VAGLNP SFR+ + +G     G          D ++D + L+ ++ LPL E
Sbjct: 1049 REMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQWLPLTE 1108

Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTS 1429
            Q+ +A +   +R Q+L  L ++ +  S
Sbjct: 1109 QVALAAKCNLSRQQVLHALREMVMAIS 1135



 Score =  171 bits (433), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
             S  + +S+++ L  L +  V+D +F+H Y EPV+++LHE + +W G++     T  ++
Sbjct: 18  ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
           A S++   K+H  +WS  NLP DA+KL+ VP   GG LV+  N + Y SQ A+ A A   
Sbjct: 78  AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
                          F ++L      WL ++  LL   +G L+L+ V  +G   +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           K   +   S +  +G  L FLGS + +SLL++     G ++L  G +E+    EADA   
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240

Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +  R   DA  D+ N   +E+SL        A  +T  A K +S  V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298

Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
              G   +  A    ++K                             EL   V LPG   
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
           +W V H    G   D S +      YHA+L ++ ++ T VL T + L E   S VD+ + 
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
            RT+ AGNL G  R++QV  RG  +L GS   QDL       +   G  N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462


>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1395

 Score =  255 bits (652), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+   +++++Y +R      ++ +     +++   D +    LEL    +L   +ESL
Sbjct: 31  NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G D     DS+IL+F DAK+S+++++ +   L I S+H FE      LK GR++
Sbjct: 90  NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
           F   PL++VD Q RC  +L+Y   + +L   +  S L                     DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203

Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
           +              S      +   S+VI+L  L +++VKDF F+H Y EP ++ LHE 
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             TW  R+S K  T +++A+S++   +Q P+IWS  +LP++  +L+ VP P+GG +V+  
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323

Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
           N + Y +QS+   L  N YA +      + P  S S      LD A+  +L  D  L S 
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
           K G+L++  ++ DGR VQR+ ++K   SVL+S    + ++L FLGSRLGDSLL+Q+T   
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442

Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
                      LS+  K+    E  D+  D     +   S +D     +  +E  ++   
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
            N+ ++  K++   + D + NIGP+ D   G+  + A  S     +Q + ELV       
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558

Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                              ELPG +  WT+Y+      +   S  +       +      
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
           E  T+V +T   L EV +         TI   N+FGRRR+  V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
           +                +V    I DPYVL+   DG+I L  G+   T  +  + P    
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718

Query: 702 SSKKPVSSCTLYHD 715
             K  V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730



 Score =  198 bits (504), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 135/445 (30%), Positives = 224/445 (50%), Gaps = 58/445 (13%)

Query: 823  KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            K+VE+ +    ++ HS P+L  +   G IL Y+A          K  D +  ++ L    
Sbjct: 872  KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
                 +R ++ +   L +  RE +    P        ++I  F NI GH+G F+ G R  
Sbjct: 919  -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W    +  LR HP      + +FT  HN+NC++GFIY T +G+L+I QL +   ++N W 
Sbjct: 974  WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033

Query: 995  VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1054
            ++KIPL+ T H+I++  E   Y L++S P                   Q D         
Sbjct: 1034 IRKIPLRMTCHKISFHQEFKCYVLVISYP----------------QAPQSDEEEEEKEKS 1077

Query: 1055 HRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL--FNTTTKENETLL 1111
             +   +EE ++V++++P      W    +  M   E  L  ++V L   +    + +  L
Sbjct: 1078 KKPLILEEKFQVKLIDPSMN---WSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPYL 1134

Query: 1112 AIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
             +GTAY  GED   +GR+L+F          +    +  +  +Y K+ KG ++ALA L G
Sbjct: 1135 CVGTAYTHGEDTVCKGRILVFEIISHREVQDDTGEEKKRLNLLYEKDQKGPVTALAGLNG 1194

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
             LL++ GPK+I++ ++   L GIAFYD   +++VSL+ VKN+IL+GD++KS+ F   K+Q
Sbjct: 1195 LLLMSIGPKLIVNNFSSGSLVGIAFYDT-QIFIVSLSTVKNYILVGDMYKSVSFFKLKDQ 1253

Query: 1226 GAQLNLLAKDFGSLDCFATE--FLI 1248
              QL LL KD+  ++ F+ +  FL+
Sbjct: 1254 -KQLILLGKDYEEMNTFSNQVHFLV 1277


>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1439

 Score =  254 bits (649), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 353/1471 (23%), Positives = 585/1471 (39%), Gaps = 248/1471 (16%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
            Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66   YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125

Query: 125  DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
            +      ++++A   A++S+++++   H L   S+H +E  E       + S   GPL  
Sbjct: 126  E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173

Query: 183  ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
                +  DP  RC   L +G + I  L   Q               G     +  T  + 
Sbjct: 174  YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232

Query: 224  GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
             G S        +S+V+ L  LD  + H     F+H Y EP   I+   +          
Sbjct: 233  NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292

Query: 278  HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
            H +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH      
Sbjct: 293  HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350

Query: 337  SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
               +A+N +     +     +S   + L+  H   +  +N   L+    G L ++T   D
Sbjct: 351  PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410

Query: 395  GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
            GR V  + +      V         S I+ +  ++FF+GS   DSL++ +T         
Sbjct: 411  GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWT--------- 461

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---------------LYGSASN 492
                         A + +R  R   D+ +  +  E++                +   A+ 
Sbjct: 462  ----------RKQAQNARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATA 511

Query: 493  NTESAQKTFSFAVRDSLVNIGPLKDFSYGLR-INAD------------------------ 527
            N  S     +F V DSL++I P+KD + G +  N D                        
Sbjct: 512  NGVSKGGDLTFRVHDSLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGN 571

Query: 528  ASATGISKQSNYELV----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            A A  I  Q+    V    E P  +G WT  V     +    D    AA   E+ A    
Sbjct: 572  AGAVAILNQNIQPKVIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS--- 628

Query: 582  SLEARTMVLETADLLTEVTESVDYFV-----------------QGRTIAAGNLFGRRRVI 624
            S+  + M++   DL  +  E+ D +                   G T+ AG +    R+I
Sbjct: 629  SIYDKFMIVSKVDL--DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRII 686

Query: 625  QVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            QV +   R  DG   ++Q L     + E+G+      V+S SIADPY+LL   D SI + 
Sbjct: 687  QVLKSEVRCYDGDLGLSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVA 741

Query: 684  VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGAD 743
              D +     V+       S K ++ C LY D                +TG    +    
Sbjct: 742  QIDNNCELEEVEKQDDAILSTKWLAGC-LYAD----------------TTGRFAPVQTDK 784

Query: 744  GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEI 803
            G P  Q +I+  +   +GAL I+ +P+ +    V    +G T++                
Sbjct: 785  GTPEGQ-NIFMFLLSAAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SA 830

Query: 804  NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
            + +   GT Q   E +  + V +L         + P+L     +  +  Y+    E  + 
Sbjct: 831  DYAVRRGTVQ---ETLTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDK 883

Query: 864  TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNIS 921
            T      V  S++L    ++     N   +++P++    E  E P   P +      NI+
Sbjct: 884  T------VGLSKTLHFQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNIN 929

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G+   FL G+ P + +   +       L    +   +  H   C  GFIY  S+G  ++ 
Sbjct: 930  GYSTVFLPGASPSFIIKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVT 989

Query: 982  QLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            QLP+ + +      V+KIP+      I Y      Y +  SV            L   E+
Sbjct: 990  QLPADTNFTELGVAVRKIPIGDNVGLIAYHPPMETYAVACSV------------LERFEL 1037

Query: 1041 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
                D H   + +   +Y   E   ++++ P      W    T+ ++  E A+ ++ + L
Sbjct: 1038 PKDDDYHKEWAKEATTSYPQTERGIIKLMSPTT----WSVIDTVELEPHEVAMCMKTLHL 1093

Query: 1100 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-- 1152
              +  TKE   L+ IGTA  +GED+  RGR+L++        P     N   ++ +KE  
Sbjct: 1094 EVSEETKERRMLITIGTAINRGEDLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEI 1153

Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--N 1206
             +GA++ L  +  QG +L+A G K ++   K  GT L  +AF D    YV ++  V+   
Sbjct: 1154 PRGAVTGLCEVGSQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDM-NCYVTAVREVRGTG 1211

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            + L+ D  K ++F+ + E+  ++ L  K  G  +    +F+I G  L +VV D+   I +
Sbjct: 1212 YCLMTDAFKGVWFVGYAEEPYKMMLFGKSTGKFEVLTADFIIAGDELHIVVCDKDGVIHV 1271

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
              + P+  +S +G  LL+RA F     H T  L L     S+  T A      T    LL
Sbjct: 1272 MQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLSLPRTPASTATTSATKNPPTT----LL 1327

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD- 1384
              +  G++  + PL E  +RRL SL   +  ++PH A  NP++ R    +  A  PG D 
Sbjct: 1328 LASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKAHRLQPLD--ARTPGVDT 1385

Query: 1385 ----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
                SIVD  LL+ +  L    + E+A + G
Sbjct: 1386 SAGRSIVDGALLARWNELGAGRRSEVAGKGG 1416


>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
 gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
            P131]
          Length = 1475

 Score =  248 bits (634), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 342/1516 (22%), Positives = 607/1516 (40%), Gaps = 235/1516 (15%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 51   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 111  RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 171  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 224  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 284  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 342  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 402  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461

Query: 411  ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 462  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 518

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 519  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578

Query: 523  ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 579  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 639  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 698  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 753  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 807  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 848  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895

Query: 841  LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
               IL  ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +
Sbjct: 896  --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 939

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
            A   E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + 
Sbjct: 940  ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 996

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKN 1014
            A +  H   C  GFIY  S G+ ++ Q+P  +++      V+K+PL      I Y +   
Sbjct: 997  AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1056

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
            +Y L  S        +   L  D +   +    N+S   +     VE   ++++ P    
Sbjct: 1057 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1103

Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
              W    T   +  E A+ +R + L  + +T E   L+ +GTA  +GED+  RG + +F 
Sbjct: 1104 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1161

Query: 1133 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1182
                    GR   + +  + +V  +E+ +GA+++L+ +  QG +++A G K ++      
Sbjct: 1162 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1219

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
             +L  +AF D    YV  +  +      ++ D  K ++F  + E   ++ L  K   +L+
Sbjct: 1220 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1278

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
            C   + L DG  L +V +D   N+ +  + P+  +S +G  LL+R  F  GAH  +  + 
Sbjct: 1279 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1336

Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
             +L T+  R      S    R  +L  +  G +  + PL + T+ RL +L   L+ SVPH
Sbjct: 1337 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1396

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
             A LNP+++R   ++ +      D     ++VD  LL+ +  L    + E+A + G + +
Sbjct: 1397 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1456

Query: 1416 QILSNLNDLALGTSFL 1431
              + +  +  LG S +
Sbjct: 1457 AEVRSELEAVLGWSIM 1472


>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
 gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
          Length = 1452

 Score =  248 bits (633), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 342/1516 (22%), Positives = 607/1516 (40%), Gaps = 235/1516 (15%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 28   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 88   RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 148  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 201  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 261  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 319  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 379  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438

Query: 411  ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 439  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 495

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 496  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555

Query: 523  ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 556  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 616  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 675  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 730  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 784  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 825  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872

Query: 841  LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
               IL  ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +
Sbjct: 873  --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 916

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
            A   E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + 
Sbjct: 917  ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKN 1014
            A +  H   C  GFIY  S G+ ++ Q+P  +++      V+K+PL      I Y +   
Sbjct: 974  AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1033

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
            +Y L  S        +   L  D +   +    N+S   +     VE   ++++ P    
Sbjct: 1034 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1080

Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
              W    T   +  E A+ +R + L  + +T E   L+ +GTA  +GED+  RG + +F 
Sbjct: 1081 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1138

Query: 1133 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1182
                    GR   + +  + +V  +E+ +GA+++L+ +  QG +++A G K ++      
Sbjct: 1139 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1196

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
             +L  +AF D    YV  +  +      ++ D  K ++F  + E   ++ L  K   +L+
Sbjct: 1197 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1255

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
            C   + L DG  L +V +D   N+ +  + P+  +S +G  LL+R  F  GAH  +  + 
Sbjct: 1256 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1313

Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
             +L T+  R      S    R  +L  +  G +  + PL + T+ RL +L   L+ SVPH
Sbjct: 1314 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1373

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
             A LNP+++R   ++ +      D     ++VD  LL+ +  L    + E+A + G + +
Sbjct: 1374 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1433

Query: 1416 QILSNLNDLALGTSFL 1431
              + +  +  LG S +
Sbjct: 1434 AEVRSELEAVLGWSIM 1449


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score =  248 bits (633), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 168/543 (30%), Positives = 263/543 (48%), Gaps = 68/543 (12%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
            R+       G +G  ++  +P   +  R   R+HP   D    + +    +N+ C  G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127

Query: 971  YVT------SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
             +       ++G+LKIC +P G + D  WP++   +  T H + + A    + L+VS   
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPLRTKHVGMTVHHVAFHAATGCHVLVVSSQ- 1186

Query: 1025 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT---YTVEEYEVRILEPDRAGGPWQTRA 1081
                              +I++       L       T E+YEV++  P           
Sbjct: 1187 -----------------QEIEDERKPEGTLEGAIPPLTEEKYEVQLRAP--YSMELLDSY 1227

Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAAR--GRVLLFST---- 1134
                 + E AL ++VV L NT  K++    +A+GT +  GE   +R  GR+ +F      
Sbjct: 1228 EFDFANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVFEVTTVV 1287

Query: 1135 ------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP--------KIILHKW 1180
                  GR +   + + T    +++K  +SAL  L+G+LL+A GP        K+ +++W
Sbjct: 1288 GEEGYEGRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLLVAQGPNPGMIGGSKLYVYEW 1347

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
               +L G AF+DA  LY+ +L  VK FI+ GDI  S++ L W+E    L LLAKD   L 
Sbjct: 1348 VDEKLVGRAFFDAH-LYITTLKTVKFFIVFGDIRHSVHLLRWREDIRMLQLLAKDALPLS 1406

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR- 1299
             +A EF++ GS   L+ SDEQKN+Q+F + P   E ++ Q+L+ RA+ HVG+H+ KF+R 
Sbjct: 1407 VYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNSPE-YRRQQLICRADLHVGSHINKFIRW 1465

Query: 1300 -LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
             L    T   RT A             + TLDG IG I P+ E ++RRL +LQ  LV ++
Sbjct: 1466 PLPFRPTLGVRTAAH------------YTTLDGGIGAIIPIPEQSYRRLLALQNLLVTAM 1513

Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            PH AGLNPRS+R +       R    + +D  LL  Y  L L  Q++++     TR  IL
Sbjct: 1514 PHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLALQMQLSSALNQTREAIL 1573

Query: 1419 SNL 1421
             +L
Sbjct: 1574 GDL 1576



 Score =  239 bits (609), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
           NL V     +E+YV++ +E+   ++ N  +  ++   D   G   A+L+ V  Y L+GNV
Sbjct: 31  NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           ES+A +   G    R RD + L F DAK+S+LE+D+SI  +   S+H FE  E   +++G
Sbjct: 91  ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
           R SF R PL++VDP  RC  +LVY  +M+++     GS L  D++               
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203

Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                  G+       I  ++V++L +  +KHV DF F+ GY EP +  LHE   TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           ++  + T MI+ +S++ + ++ P+IWSA  LPH++  ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323

Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
               CAL LN YA++  D       +   +  D  H   L+    L S  TG+  ++ V 
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383

Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
              DG  ++ L L     K +PS    S +  +G+S  FLGSRLGDS LV+ 
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435



 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)

Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
           + F + D+L NIGP+     G R++A     G  K+ + ELV                  
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636

Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
                    LP  + +WTV+  +++  + D       ++E     HAY++IS   +  T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696

Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           VL+  +L   +  E VD+ V  +T+  GN+FG +R++QV      +L+G    Q+L    
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
               +G+G +   +++  I DPY+ L + DG + LLVGD S+  V+      +      +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           ++   + D  P+                GEA D        Q D+       +G  +++ 
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842

Query: 768 VPNFNCVFTVDKFV 781
           +P+   V+    FV
Sbjct: 843 LPSLELVYDAADFV 856


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 1434

 Score =  244 bits (622), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 201/699 (28%), Positives = 341/699 (48%), Gaps = 63/699 (9%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDSE---TEINSSSEE 809
            V+  E+G L I+ +P    V+ V K     +H+ D  + E   L D     ++I S++  
Sbjct: 773  VMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDESVVASDIASNTLN 828

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                 + E I    +VE+ +     +  RP LF ++ D  +  Y+ ++++   N      
Sbjct: 829  EALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMFMYD---NGVVEHL 880

Query: 870  PVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA---PCQRITIFKNISG 922
             V   R L  + V+    R+ RF     R P++A  R+   +     P +RI    N   
Sbjct: 881  AVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALHPFERIGNILN--- 931

Query: 923  HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
              G F+  S PC  ++    LR+HP   +G I++FT  +NV C +GFIY+T  +  ++I 
Sbjct: 932  --GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIA 989

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
            +LP+    D+  PV+KI    T H I Y  + N Y ++ S    K  N  L +L++++  
Sbjct: 990  KLPTDVELDSSLPVRKIRTGRTIHNIVYLLQSNTYAVVGSE---KKPNNRLCVLVNED-- 1044

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL 1099
               D H     D      +E Y+V++  P+     W+    A I M+  E       V L
Sbjct: 1045 KSFDEH--EKADSFVLPELEVYDVKLYSPED----WKPVPNAEIKMEDFEVLTCCEEVVL 1098

Query: 1100 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
             +  T    +  LA+GTA   GE+V  RGR+++         P     ++ +  +Y KE 
Sbjct: 1099 RSEGTVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQ 1158

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            KG +++L S  G+LL   G K+ +  +    L GI+F D    Y+  L  V+N  L  DI
Sbjct: 1159 KGPVTSLCSCNGYLLAGMGQKVFIWLFRDNNLQGISFLDMH-FYIHQLVGVRNLALACDI 1217

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            ++S+  L ++E+   L+L ++D  ++     A +FLID   ++ ++SDE  NI +F Y P
Sbjct: 1218 YRSVALLRYQEEYKALSLASRDMRAVVQPPMAAQFLIDNRQMAFIMSDEAANIAVFNYLP 1277

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTLD 1330
            +  ES  G++L+ R+E ++G +V  F+R++   +S    G       + NR ++LF +LD
Sbjct: 1278 EALESSGGERLILRSEINIGTNVNSFMRVKGHISS----GFVENEHYSLNRQSVLFCSLD 1333

Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            GS G + PL E  FRRL  LQ+ +   V   AGLN +  R        H     ++VD +
Sbjct: 1334 GSFGFVRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLNTRNMVDGD 1393

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            ++  Y  L L ++ ++A + GT+R  I+ +L +++  T+
Sbjct: 1394 VVFQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTT 1432



 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 172/656 (26%), Positives = 281/656 (42%), Gaps = 114/656 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE + H RL   V+SLA+        +    S++L F+ AK+SV+ F  +   L+  S+H
Sbjct: 107 LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G  +    P+++VDP  RC  +L+YG  + +L        L       
Sbjct: 164 CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       ++ + L  +D  + ++ D  F+ GY EP ++ L+E   T AGR   ++
Sbjct: 214 -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
            T  +  +S++T  + H  +W   NLP D  ++L +P PIGG L++GAN + Y +QS   
Sbjct: 263 DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322

Query: 338 CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
           C   LN+    +D   + P  S    ++ LD   A  +  +  ++  ++G L +LT+V D
Sbjct: 323 CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379

Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
               V+ ++        +   +T       F+GSR+GDSL +++          S L   
Sbjct: 380 STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
              +  D P  K+L+    +  QD +  E+L LYG A  +  S     +   F V D ++
Sbjct: 428 ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481

Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS---- 566
           N+ P K  +           +G S+  N  L E P    ++      + GH  DSS    
Sbjct: 482 NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFD--RVCACGHGKDSSICIF 528

Query: 567 ---------------------RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
                                 +   +D+ H Y+I S E  ++ LET + L E+   V +
Sbjct: 529 QQSIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-F 587

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ----DLSFGPSNSESGSGSENSTV 661
                TIAAG L      +QV      ++      Q     L+F               V
Sbjct: 588 ITSESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPLQLTF--------------PV 633

Query: 662 LSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           LS SI DP+V +   +G + L  L   P     +V  P  I  +K P+++  +Y D
Sbjct: 634 LSASIVDPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRD 689


>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
          Length = 1445

 Score =  244 bits (622), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 351/1504 (23%), Positives = 611/1504 (40%), Gaps = 258/1504 (17%)

Query: 57   NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
            NL+V+  ++++I+ V+         +  +K + N+GET  R + D  G+ +A        
Sbjct: 28   NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                      L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88   MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
              HG+   S+H +E  EW      + S   GPL      ++ DPQ RC   L +GL+ I 
Sbjct: 145  EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197

Query: 204  L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
                             +   G    E+   +     +   S          S V+ L  
Sbjct: 198  FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257

Query: 241  LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            LD  + H   F F+H Y EP + I+              H T  +  + +        L 
Sbjct: 258  LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
             +++N      K++A+P P+GG L++G N  IH      +  +A+N YA  +       +
Sbjct: 310  KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363

Query: 358  SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
            S   + L+      +  +N   LL T+ G++ ++T   DGR V  + +    ++    VL
Sbjct: 364  SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423

Query: 412  ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
                + +T +  +  F G+  GDS ++    G     + +  K+     E+         
Sbjct: 424  PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479

Query: 469  RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
                D   D + GE       ++    +      F V DSL+++ P+ D +YG       
Sbjct: 480  ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539

Query: 522  ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWTV-----Y 554
                     +R   D   A G  +  +  L+            + P  +G WT       
Sbjct: 540  SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599

Query: 555  HKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF--- 606
             KS +G     + +A  +D     +Y  ++I++ +      ET+D+        +     
Sbjct: 600  AKSLQGDKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGT 656

Query: 607  ----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
                  G TI AG +    R+IQV     R  DG   ++Q L     + E+G+      V
Sbjct: 657  EFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRV 711

Query: 662  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----G 717
            +S SI DPY+LL   D SI +           +        S K +S C LY D      
Sbjct: 712  ISASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFA 770

Query: 718  PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
            P    + TST   +   +  AI          G+++  +C       ++ +PN +    V
Sbjct: 771  PVQTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV 813

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
                +G ++I           S    + ++  GT     E +  + V +L     ++ H 
Sbjct: 814  ---AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH- 856

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
               L     +  +  Y+ +   G E   K D     + SL    VS S L     +++P+
Sbjct: 857  ---LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPV 902

Query: 898  DAYTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
            +A   E         R+   +   NI G+   FL G+ P + +   +       L    +
Sbjct: 903  EAAEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGV 958

Query: 955  VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEK 1013
               +  H   C  GFIY  S+G  ++ Q P  +        V+K+P+      + +    
Sbjct: 959  NGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPIDTAVSHVAWHPNM 1018

Query: 1014 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY-EVRILEPDR 1072
             +Y   V+   L+P           E+    D H   + +      ++E+  +++  P  
Sbjct: 1019 EVY--AVASSKLEPF----------ELPKDDDYHKEWAKEECPMPPMKEHGSIKLYSPIT 1066

Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
                W       ++  E A+ ++ + L  +  TKE   L A+GTA ++GED+  RGR+L+
Sbjct: 1067 ----WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILV 1122

Query: 1132 FSTGRNADNPQNLVTE-----VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWT 1181
            F        P    T+     +  +E+ +GA+++L  +  QG +L+A G K ++   K  
Sbjct: 1123 FDVVHVIPQPDRPETDRKLKLIAKEEIPRGAVTSLCEVGTQGLMLVAQGQKCMVRGLKED 1182

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
            GT L  +AF D    YVV+++ ++N  + L+ D +  ++F+ + E+  ++ L  K    L
Sbjct: 1183 GTLLP-VAFLDMS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKSGTQL 1240

Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL 1298
             C   +FL+ G+ LS+V SDE   + I  + P+   S +G  LL+RA F V   H    L
Sbjct: 1241 KCLTADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATL 1300

Query: 1299 RL------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
             L        L  S   TGAA   ++T    LL  +  G+I  + P+ E  +RRL SL  
Sbjct: 1301 VLPRTTTRPYLPQSEPATGAAGSQNRTQ--TLLLASASGAIASLNPITEHAYRRLTSLTT 1358

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIA 1407
             L +++PH AG+NP++ R    +G A  P  D     +IVD  LL+ +  L   ++ E A
Sbjct: 1359 SLANALPHAAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLARWNELGARQRAEAA 1418

Query: 1408 HQTG 1411
             + G
Sbjct: 1419 GKGG 1422


>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
            10762]
          Length = 1437

 Score =  243 bits (620), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 343/1491 (23%), Positives = 585/1491 (39%), Gaps = 263/1491 (17%)

Query: 52   IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
            IGP   NLVV   ++++++ V R+ +       +  + + R          L L+  Y L
Sbjct: 22   IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71

Query: 110  HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
             G V +L+ ++      GGA       +++ AF+DAK+S++E+D   + +   S+H +E 
Sbjct: 72   SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124

Query: 165  PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
               L    G        ++ VDP  RC  +     Q+ IL   Q G              
Sbjct: 125  DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184

Query: 212  --------GLVGDEDTFGSGGGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIE 257
                    G V  + T  +    S       +SS V+ L  LD  + H     F+H Y E
Sbjct: 185  EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P   IL   +      +  +      +  ++    +    + S   LP D +K++A+P P
Sbjct: 245  PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304

Query: 318  IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
            +GG L+VG N  IH      + A+A+N +A    +      S  +++L+      L +  
Sbjct: 305  VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364

Query: 377  --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
              AL+    G    L+    GR V  L +S+   TN   + +     + ++     F+GS
Sbjct: 365  GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
              G S LV++   + T        +  G                    APS   ++R++S
Sbjct: 425  EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
                             A+     A  T++F + DSL ++ P+ +   G   R   +   
Sbjct: 485  ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528

Query: 528  -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
              +  G  K S     N E++       ++ G K IW+V  +S  G    S+      D 
Sbjct: 529  LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582

Query: 575  YHAYLIISLEARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            Y   L +     T   + AD     + E+ E+ D+   G T+  G L     ++Q     
Sbjct: 583  YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641

Query: 631  ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
             R       T D   G S     S  E      +++ S  DPY+L+   D S+++L  D 
Sbjct: 642  IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
                         +   +P+ +         E  LR+     WL+  +         G L
Sbjct: 695  -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
              G   + +  + G L++F +P    V++              ++   L         S+
Sbjct: 724  SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
            +    +G K  +  + VV+L  +  +    RP+L        ++ Y+ + +         
Sbjct: 766  DAPQRRGGKVTLTEVLVVDLGAEGVT----RPYLIVRTAMDDLILYEPFHY--------- 812

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREE----TPHGAPCQRITIFKNISGH 923
                    S +  +  A+   +LRF + P     + +    T  G P Q       I G 
Sbjct: 813  --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862

Query: 924  QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
               +L G  P + +     L     L    + +F+ LH   C  GF  V   G LK  QL
Sbjct: 863  NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922

Query: 984  PSGSTYDNYWPVQKIPLKATPH---QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            P   ++   W V+ + L   P    Q+ Y  ++ +Y       V+     V   L D + 
Sbjct: 923  PGHVSFATGWSVRTLTLGEPPQEVRQVAYHEQRGIY-------VVATCRDVDFTLHDLDE 975

Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1099
              + D  NL          V +Y + +L    +    +   ++ M  +E   +++++ L 
Sbjct: 976  RQRDDEPNLKP-------QVPQYTLHLL----SATSHKVIQSLEMPYAEIVTSLKIMPLE 1024

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVTE--VYSKELK 1154
             +  T E + +L +G A  +GED  A+G + +F         D+P++ +       +E K
Sbjct: 1025 VSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVFDIIDVVPEPDDPESGIRLHIAAREETK 1084

Query: 1155 GAISALASLQGHLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFIL 1209
            GAI+AL S  G L+  A G KI++   K  GT L  +AF DA   Y+VSL  +      L
Sbjct: 1085 GAITALESFSGGLVGTAQGQKIMVRGLKEDGTCLP-VAFLDAQ-TYMVSLKTMGRSGLSL 1142

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
             GD  K ++F  W E+  +L LL K    ++  + EFL     L L+V D + ++ +  Y
Sbjct: 1143 AGDAWKGLWFGGWTEEPYRLTLLGKSRTKMEVVSAEFLPFDGQLYLLVVDGKMDLHVLQY 1202

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKTN 1320
             P+  ++  GQ+LL ++ FH+G      L L         Q   T+ D  G   G++ + 
Sbjct: 1203 DPENPKTVSGQRLLHKSTFHLGHWPVDMLLLPSDLAPFAQQAPLTNGDSNGHTNGTESSA 1262

Query: 1321 R---------FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
                      F +L     G++G I P+DE T+RRL +LQ +L   + H AGLNPR++R 
Sbjct: 1263 ANAPAPAPSLFHVLTTFQSGAVGLITPVDEATYRRLGALQTQLTSVLEHAAGLNPRAYRA 1322

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
              S     R     +VD  L+     L    + E+  + G     + S+L 
Sbjct: 1323 VESESLGGR----GVVDGMLVQRIGELGAARRAEVLGRAGADAWGLRSDLE 1369


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score =  242 bits (618), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 156/546 (28%), Positives = 276/546 (50%), Gaps = 79/546 (14%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG----------------SIV 955
            +RI  F +ISG +G F+ G +P W    +  LR+H                      ++ 
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181

Query: 956  AFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
             FT  +N++C  GFIY + +  ++KIC L +   ++N   +++IP K + H+I Y +E  
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAK 1241

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
             Y +IVS P               +V  ++   +   +      T +++++++++P    
Sbjct: 1242 CYVVIVSFP---------------QVTQELQEDSKKPI-----LTDDKFQIKLIDP-TID 1280

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTT---TKENETLLAIGTAYVQGEDVAARGRVLL 1131
              W+   +  +Q  E  L +++V+L  T           L IGTA+  GED   +GRVL+
Sbjct: 1281 WNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLV 1340

Query: 1132 F------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
            F      +   + +  +  +  +Y KE KG ++AL+S+ G LL+  GPK+ ++++    L
Sbjct: 1341 FEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSL 1400

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
              ++FYDA  +Y+ S+  +KN+I++GD++KS+YFL WK+    LNLL+KD+ +L+ F+TE
Sbjct: 1401 VTLSFYDAQ-IYICSICTIKNYIVIGDMYKSVYFLQWKDNKT-LNLLSKDYQALNIFSTE 1458

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
            F+++  TLS++VSD  KNI +F + P+   S  GQ                    Q +  
Sbjct: 1459 FIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQ------------------INQEING 1500

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
            ++      P  ++     ++FGTLDG +  + PLDE  +     +Q KL   +P  AGLN
Sbjct: 1501 NNKNDNRLPKKEQ----LVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGLN 1555

Query: 1366 PRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            P+ +R F S  +     P +       I+D +L+S +  L   E+  I++   +T  +I+
Sbjct: 1556 PKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEII 1615

Query: 1419 SNLNDL 1424
             +L D+
Sbjct: 1616 ESLKDV 1621



 Score =  199 bits (505), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           +++++++VKDF F+HGY EP ++ LHE   TW  R++ K  TC ++A+S++   K    I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W+  N P++   L++VP P+GG LV+ AN + Y +Q++   LA+N YA S+D+S  +   
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399

Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
            F             LD ++  +L++D  + S K G+L++  ++ DGR VQR+ +SK   
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
           SVLTS I  + N+L FLGSRLGDSLL+Q+T  S T         S+  K++         
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
           D  ++  +      +++   Q+  +   ++             L+    N  +S Q    
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
             + D ++NIGP+ D   G  I+     T    Q  Y     ELV               
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635

Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
                      ELPG   +WTVY                   K SR  N ++ +      
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695

Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                           D  +H YL +SL +  T++ ET   L EV +        +++  
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750

Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           GNLFGR+R++ +++ G ++++G   + Q++              N  + S  I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798

Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
              +G+I++  G D     +     +   +  + + S +L+ D+ 
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843



 Score = 91.3 bits (225), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
           NLV+   NV++IY +R ++    E   +S+   + ++      I+         SLEL+ 
Sbjct: 32  NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91

Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
             +L GN+ES+A +    ++    RDS+IL F DAKISVL++D  +    I S+H FE  
Sbjct: 92  EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147

Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           E+   K GR  F   PL+KVD Q RC  +L+Y   + +L   +  S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192


>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
            [Botryotinia fuckeliana]
          Length = 1447

 Score =  241 bits (615), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 338/1519 (22%), Positives = 608/1519 (40%), Gaps = 243/1519 (15%)

Query: 57   NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
            NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28   NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98   -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                      L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88   QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
            +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142  WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207  SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
             Q     + D D    G              G       SS V+ L  LD       H++
Sbjct: 202  KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261

Query: 248  DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
               F++ Y EP   IL       +  +  +  H T M+  L +    K    I S   LP
Sbjct: 262  ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316

Query: 306  HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
            +D ++++ +  P+GG L+VG N  IH      +  +A+N +A        L ++   + L
Sbjct: 317  YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376

Query: 365  DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
            +      L  +N   L+   +GD+ +L+   DGR V  L + + +     ++LT   S +
Sbjct: 377  EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436

Query: 416  TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
            +++G    F+GS + DS+++ +   SG +       +     E D            +  
Sbjct: 437  SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493

Query: 476  QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
             D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++G   L +  D   
Sbjct: 494  DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553

Query: 531  TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
                 QS  +LV                          +LP  +GIWT+  K  + +G  
Sbjct: 554  KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613

Query: 563  ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
             +  +         D +Y   +I+S   +A   + E+A     D   E     ++    G
Sbjct: 614  VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+      ++S S AD
Sbjct: 674  STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            P++LL   D SI +   D       ++    I  S K ++ C LY D           +D
Sbjct: 729  PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
            +  S   GE             ++   +    GAL I+ +P+ +    V +   FV    
Sbjct: 782  SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
                   + A +++ TEI                       L      +    P+L    
Sbjct: 828  SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864

Query: 846  TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
            ++  +  Y+ +  +            S S  L  S +   +++N   ++ P    + EE 
Sbjct: 865  SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910

Query: 906  PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
              GA      P + I+   N+ G+   F+ G  P + +   +       L    + + + 
Sbjct: 911  VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
             H   C+ GFIY +++GI ++ Q P  +T+ D    ++KI +    H + Y      Y +
Sbjct: 968  FHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVHAVAYHPPLQTYVI 1027

Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
              S               D E+    D+       ++    ++E+  ++++ P      W
Sbjct: 1028 GTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPVN----W 1071

Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---S 1133
                 I ++  E    ++ + L  +  T E + L+ +GTA  +GED+A  GR+ ++   +
Sbjct: 1072 SVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVYDVVT 1131

Query: 1134 TGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGT 1183
                 D P+       + +E+ ++   G ++ L+ +  QG +L+A G K ++   K  GT
Sbjct: 1132 VVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLKEDGT 1191

Query: 1184 ELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
             L  +AF D    YV S+  +      ++ D  K ++F  + E+  ++ L  K    ++ 
Sbjct: 1192 NLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYRMLLFGKSAAKMEV 1249

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-------V 1294
               + L DG  L +V +D   N+ I  Y P+  +S +G  LL R  F +GAH       +
Sbjct: 1250 LCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLL 1309

Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
                 L  L T+   +      + T   +  LL  +  G++  ++PL E  +RR  +L  
Sbjct: 1310 PTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPLTESQYRRFGTLVS 1369

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
             L +++ H  GLNPR++R    +      G  +I+D  +L  +  L  + + E+A + G 
Sbjct: 1370 HLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELGSQRRGEVAGRVGV 1428

Query: 1413 TRSQILSNLNDLALGTSFL 1431
               ++   L+ L  G  F+
Sbjct: 1429 DVLELRDELSGLRGGLGFI 1447


>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 2479]
          Length = 1358

 Score =  240 bits (613), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 359/1466 (24%), Positives = 599/1466 (40%), Gaps = 236/1466 (16%)

Query: 45   ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
            E+P  + +G   NLVV     + ++ +R  EE +    +     ++  MD          
Sbjct: 39   EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93

Query: 96   ISAAS--------LELVCHYRLHGNVESLAILSQGGADNS-RRRDSIILAFEDAKISVLE 146
            I  A         L L+  + LHG V  LA L     D+S    D ++++FE AK S  +
Sbjct: 94   IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
                   +   S+H +E  + + +    + +   P+++ DP  R   + +    + +L  
Sbjct: 152  -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201

Query: 207  SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
             Q  S L   +D+  S                   ++K++KDF+F+ G+  P + +L   
Sbjct: 202  VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              TWAGR      T  +   +I T+    +PLI S   LP D+  L+A PS +GGV+VV 
Sbjct: 244  MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303

Query: 326  ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
            A+ I +  QS    + ++N   NY  ++  DSS E    S  + LD +HA ++  +  LL
Sbjct: 304  ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359

Query: 380  STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
              +TG++  +    DGR V  + + + + +V   S +   G+   F+GS  GDSLL    
Sbjct: 360  VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419

Query: 439  CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
                 S      +EE        P TK+      D  +++  G     +      S    
Sbjct: 420  KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466

Query: 496  SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
             A   F  AV D        LV IG     S G  +N       I+K+  +E  +L    
Sbjct: 467  IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
              W +  + +   NA   +     ++    +  + E  T +   +    +V E +  F +
Sbjct: 522  ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
               IA G    R R++ V      +LD            SN+   G+  E S    +++ 
Sbjct: 577  P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623

Query: 665  SIADPYVLLGMSDGSIRLLVGD-----------PSTCTVSVQTPAAI------------- 700
            SIADPYVL+  +DGS+ + VGD           P    + V   A +             
Sbjct: 624  SIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPEGLALPVCQAAEVFTDTTGIYRTFEA 683

Query: 701  -----ESSKKPVSSC----TLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
                 E   KPV +        H  G E   R   +   +S  V       +     +G 
Sbjct: 684  TQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQDSKPAISADVATTESAFNAA---RGT 739

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
             +  +  +SG L+I  +P+F+ V   +        + D+    +  D +T      EEG 
Sbjct: 740  QWIALLAQSGELQIRSLPDFDLVLQSNG-------VYDS--EPSFTDDQTGELPELEEG- 789

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + +  M    +  +       RP +  +   G +  Y+A     P  T  + D  
Sbjct: 790  -----DEVSQMLFCPIGTRTL-----RPHVIVLHRSGRLNIYEAQ----PRFTVDARD-- 833

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLS 929
             + RSL+V      R R +    T L + T   T  P   P      F +I G  G F++
Sbjct: 834  QSRRSLAV------RFRKVH---TQLLSVTPSSTVKPAAIP------FTDIEGLTGAFIT 878

Query: 930  GSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNVN-CNHGFIYVTSQGILKICQLPSGS 987
            G RP W +        HP    G    A+      +   HG  ++  +    IC +P   
Sbjct: 879  GERPHWIISSDS----HPIRAFGLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTL 934

Query: 988  TYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
              D   P  +  ++ T   + +      Y     +SVP  +  ++   +L+  E    + 
Sbjct: 935  NTDFAMPCDRYKMERTYTHVAFDPPSCHYVAAAAMSVP-FQAYDEEGEILLGPEGPDLLP 993

Query: 1046 NHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
              N  SS++L   ++      R+L+                  +E  L V  VTL ++++
Sbjct: 994  PKNERSSIEL---FSAGSEPFRVLD------------GYDFDQNEEVLCVESVTLESSSS 1038

Query: 1105 KEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP---QNLVTEVYSKE-LKGAISA 1159
                   +A+GT    GED A  G V +F             N   +   K+  +  +SA
Sbjct: 1039 PTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEVVGTKPGVSNWRLKYRCKDPTRNPVSA 1098

Query: 1160 LASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            +A++ G+++ ++GPKI+         L G+AF D   +YV S+ + KN IL+GD  KS+ 
Sbjct: 1099 IANINGYIVHSNGPKILAKGLDYDDRLMGLAFLDVS-MYVTSIRVFKNLILVGDFVKSLI 1157

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
            F S +E   +   + +D   L   A +FL+    ++ + +D+  N+++  + P   +S  
Sbjct: 1158 FASLQENPYKFVTIGRDLADLSLTAADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLN 1217

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            G+KLL++ EF  G  VT    +    T+ +    AP S       L++ T DG+I  +  
Sbjct: 1218 GEKLLTQTEFGTGCPVTASCMIARRKTAEEE--FAPQSQ------LIYATADGAITSVVA 1269

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E  F+RLQ +Q +LV +  HVAGLNPR+FR    N    RP    ++D  LL+H+ + 
Sbjct: 1270 VKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVR-NDLVPRPLARGVLDGGLLAHFALQ 1328

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDL 1424
            PL  Q E+  Q GT    + S+L  L
Sbjct: 1329 PLRRQREMMRQIGTDAVTVGSDLYTL 1354


>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
 gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
          Length = 1394

 Score =  238 bits (607), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 339/1379 (24%), Positives = 549/1379 (39%), Gaps = 242/1379 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
            NL V  +++++I+  +V       S+N+G   R             DG+ A+        
Sbjct: 41   NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
                      L LV  + L G V  L  +      N+R   DS++LAF+DAK+S++E+D 
Sbjct: 101  QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              H L   S+H +E  E                +  DP  RC  +      + IL   Q 
Sbjct: 158  EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217

Query: 210  GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
               + +GD D    G   +  + S                S V+ L +LD  + H     
Sbjct: 218  DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+H Y EP   IL              H   M+  L +    K    I S   LP D ++
Sbjct: 278  FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335

Query: 311  LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            ++ +P+P+GG L+VG+N  IH         +A+N       S   + +S  ++ L+    
Sbjct: 336  VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395

Query: 370  TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
              L  D+   L+    G + ++T+  DGR V  L+L    + +  S++   ++T   IG 
Sbjct: 396  DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
            +  F G   GDS+L                   FG  +      +R  R+  +A      
Sbjct: 456  NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496

Query: 475  ---LQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLR 523
                 +    +E  LYG AS          S          V D L+N+GP++  +Y   
Sbjct: 497  EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556

Query: 524  INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
            +    S                 A G  K +     N E+        E P  +G WT+ 
Sbjct: 557  VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616

Query: 555  HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
             K        S +   +         +Y  ++I++       E   +   TA     +  
Sbjct: 617  AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
            +      G TI AG +    R+IQV +   R  DG + ++Q +     + E+G+      
Sbjct: 677  TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
             +S SIADP +L+   D S+ +   D S     ++      ++ K ++ C LY D     
Sbjct: 732  AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                 +T A+      E + G  G P     I   +   SG+L I+ +P+ +    V + 
Sbjct: 786  -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
            +S        Y+   L       + S+ +GT    KE +  + V +LA  R    H+   
Sbjct: 834  LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
                  D TI  YQ + +    +TS   D    S++L        +L N  F+++P +A 
Sbjct: 875  -----DDLTI--YQPFRY----STSAGAD---FSKTLFFQ-----KLPNAAFAKSPEEAD 915

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTV 959
              E T H      +    NI+G+   FL G+ P + +   +   RV P L    ++A + 
Sbjct: 916  EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
             H   C +GFIY  SQ + ++ QLP    Y +    V+KIP+      + Y      Y  
Sbjct: 974  FHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQSY-- 1031

Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1078
            +V    L+P      L  D +   +    NLS        TV+   ++++ P      W 
Sbjct: 1032 VVGCNTLEPFE----LPKDDDYHKEWARENLSF-----KPTVDRGILKLVSPIT----WT 1078

Query: 1079 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
               ++ M+  E  L V  ++L  +  T E + L+A+GTA ++GED+  RGRV ++     
Sbjct: 1079 VVDSVQMEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVYDITEV 1138

Query: 1138 ADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTE 1184
               P    T   SK+LK         GA++AL+ +  QG +L+A G K ++   K  GT 
Sbjct: 1139 IPEPGRPET---SKKLKLIAKEEIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTL 1195

Query: 1185 LNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
            L  +AF D    YV +   +      LL D  K ++F  + E+  ++ L  K    L+  
Sbjct: 1196 LP-VAFMDMN-CYVTNAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTKLEVL 1253

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRL 1300
              +FL DG  L +V  D   NI I  + P+  +S +G  LL R  F+ GA H TK L L
Sbjct: 1254 NADFLPDGKDLFIVACDADGNIHILEFDPEHPKSLQGHLLLHRTTFNTGANHPTKSLLL 1312


>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
            clavigera kw1407]
          Length = 1461

 Score =  238 bits (606), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 341/1432 (23%), Positives = 568/1432 (39%), Gaps = 210/1432 (14%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S + L LV  + L G V  LA +   G  +    +++++A +DA++S+LE+D   + L  
Sbjct: 100  SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157

Query: 157  TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
             S+H +E  E                +  DP  RC  +      + IL   Q     V  
Sbjct: 158  ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217

Query: 217  ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
            +D                     G G G        S V+ L +LD  + H     F+H 
Sbjct: 218  DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y EP   IL     T A  V  +         ++    K    I S  NLP D ++++ +
Sbjct: 278  YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            PSPIGG ++VGAN  IH      +  +A+N +     S     +S  ++ L+      L 
Sbjct: 336  PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
             +    L+    G L +LT+  DGR V  L +     +    V+ S IT    IG  + F
Sbjct: 396  AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
             GS   DS+++ ++   G +          G D+ AD       R    +   D  + + 
Sbjct: 456  AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515

Query: 484  LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
                 SA+      N T       SF + D L++I P++D   G       L   +D + 
Sbjct: 516  YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575

Query: 530  ---------ATGISKQSNYELV------------ELPGCKGIWTV-----YHKSSRGHNA 563
                     A G  +     L+            E    + +WTV       ++ +G   
Sbjct: 576  IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635

Query: 564  DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
             ++    Y+    +  Y+I++ E      ET+D+        +           G T+ A
Sbjct: 636  GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694

Query: 615  GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
            G +   RR+IQV +   R  DG   ++Q L       +  +G+E   VL  SIADPY+LL
Sbjct: 695  GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
               D S+ +   +       ++      +S K V+ C LYHD        KTS  A+L +
Sbjct: 750  IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTAS--VFSKTSILAFLLS 806

Query: 734  GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
                                      SG   I+ +P+      V + ++   R  + D  
Sbjct: 807  A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841

Query: 792  MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
            +R  +                   KE +  + V +L      A    P+L     +  + 
Sbjct: 842  VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879

Query: 852  CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
             YQ               P+ T  SL   + S   L+  N  F+++P+ + + ++     
Sbjct: 880  IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
                + + +NI G+   FL GS   + +   +       L   ++ + +  H  +    F
Sbjct: 924  RAMPLRVCENIGGYSTVFLPGSSASFVLKSAKSQPRVVSLQGTAVRSLSPFHTESSERSF 983

Query: 970  IYVTSQGILKICQLPSGSTYDNYWP-VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
            IYV  +G  ++C +P+G          +K+ L    + + Y      Y +  S       
Sbjct: 984  IYVDVEGSGRVCSMPAGWNLTELGVCARKVALDTDANALAYHPPTGTYAVGTSA------ 1037

Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
              + +  + ++  H+ D +  S+    R    E   + ++ P    G W T  T+ M+  
Sbjct: 1038 --LEAFELPKDDPHRADWNKESTA--FRPL-AERGRLLLMSP----GSWSTIDTVEMEPY 1088

Query: 1089 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QN 1143
            E  + V+ + L  +  T E + L+A+GTA  +GED+A RGRV +F        P     N
Sbjct: 1089 EVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIRGRVYVFDVVSVIPEPGRPETN 1148

Query: 1144 LVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
               ++ +KE   +GA++A++ +  QG +L+A G K ++   K  GT L  +AF D    Y
Sbjct: 1149 RKLKLIAKEDIPRGAVTAVSEIGTQGLMLVAQGQKCLVRGLKEDGTLLP-VAFMDMN-CY 1206

Query: 1198 VVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            V S   +      ++ D  K ++F  + E+  ++ L  K    L     + L DG  L +
Sbjct: 1207 VTSAKELPGTGLCVMSDAFKGVWFTGYTEEPYKMILFGKSNTRLHALNVDLLPDGKELFI 1266

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF-LRLQMLATSSDRTGA-- 1312
            VV+D   N+ +  + P+  +S +G  LL RA F  GAH +   L L    T +DR  A  
Sbjct: 1267 VVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTANG 1326

Query: 1313 -------APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
                    P + +  +  LL G+  G +  + PL E  +RRL SL  +L  S+   AGLN
Sbjct: 1327 ETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGLN 1386

Query: 1366 PRSFRQFHSNGKAH-RPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            P+ +R    +  A   PG D     S+VD  LL+ +  L    + EIA + G
Sbjct: 1387 PKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438


>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 507

 Score =  237 bits (604), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29  NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83  LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
           SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508 SLVNIGPLKDFSYGLRI 524
           SL+N+ P+     G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505


>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 8904]
          Length = 1339

 Score =  234 bits (596), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 315/1251 (25%), Positives = 519/1251 (41%), Gaps = 183/1251 (14%)

Query: 242  DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWS 300
            ++K++KDF+F+ G+  P + +L     TWAGR      T  +   +I T+    +PLI S
Sbjct: 200  EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITS 259

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQE 354
               LP D+  L+A PS +GGV+VV A+ I +  QS    + ++N   NY  ++  DSS E
Sbjct: 260  VTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE 319

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-S 413
                S  + LD +HA ++  +  LL  +TG++  +    DGR V  + + + + +V   S
Sbjct: 320  ----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPS 375

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
             +   G+   F+GS  GDSLL         S      +EE        P TK+      D
Sbjct: 376  TLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVDD 422

Query: 474  ALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGLR 523
              +++  G     +      S     A   F  AV D        LV IG     S G  
Sbjct: 423  WDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGST 479

Query: 524  INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
            +N       I+K+  +E  +L      W +  + +   NA   +     ++    +  + 
Sbjct: 480  MNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAATQ 534

Query: 584  EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
            E  T +   +    +V E +  F +   IA G    R R++ V      +LD        
Sbjct: 535  EGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD-------- 583

Query: 644  SFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD-----------PS 688
                SN+   G+  E S    +++ SIADPYVL+  +DGS+ + VGD           P 
Sbjct: 584  ----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPE 639

Query: 689  TCTVSVQTPAAI------------------ESSKKPVSSC----TLYHDKGPEPWLRKTS 726
               + V   A +                  E   KPV +        H  G E   R   
Sbjct: 640  GLALPVCQAAEVFTDTTGIYRTFEATQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQD 698

Query: 727  TDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
            +   +S  V       +     +G  +  +  +SG L+I  +P+F+ V   +        
Sbjct: 699  SKPAISADVATTESAFNAA---RGTQWIALLAQSGELQIRSLPDFDLVLQSNG------- 748

Query: 787  IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
            + D+    +  D +T      EEG      + +  M    +  +       RP +  +  
Sbjct: 749  VYDS--EPSFTDDQTGELPELEEG------DEVSQMLFCPIGTRTL-----RPHVIVLHR 795

Query: 847  DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET- 905
             G +  Y+A     P  T  + D   + RSL+V      R R +    T L + T   T 
Sbjct: 796  SGRLNIYEAQ----PRFTVDARD--QSRRSLAV------RFRKVH---TQLLSVTPSSTV 840

Query: 906  -PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNV 963
             P   P      F +I G  G F++G RP W +        HP    G    A+      
Sbjct: 841  KPAAIP------FTDIEGLTGAFITGERPHWIISSDS----HPIRAFGLKQAAYAFCKTT 890

Query: 964  N-CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIV 1020
            +   HG  ++  +    IC +P     D   P  +  ++ T   + +      Y     +
Sbjct: 891  HQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAFDPPSCHYVAAAAM 950

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
            SVP  +  ++   +L+  E    +   N  SS++L   ++      R+L+          
Sbjct: 951  SVP-FQAYDEEGEILLGPEGPDLLPPKNERSSIEL---FSAGSEPFRVLD---------- 996

Query: 1080 RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
                    +E  L V  VTL ++++       +A+GT    GED A  G V +F      
Sbjct: 997  --GYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEVV 1054

Query: 1139 DNP---QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDA 1193
                   N   +   K+  +  +SA+A++ G+++ ++GPKI+         L G+AF D 
Sbjct: 1055 GTKPGVSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAFLDV 1114

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
              +YV S+ + KN IL+GD  KS+ F S +E   +   + +D   L   A +FL+    +
Sbjct: 1115 S-MYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRDLADLSLTAADFLVHEGQV 1173

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
            + + +D+  N+++  + P   +S  G+KLL++ EF  G  VT    +    T+ +    A
Sbjct: 1174 TFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEE--FA 1231

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
            P S       L++ T DG+I  +  + E  F+RLQ +Q +LV +  HVAGLNPR+FR   
Sbjct: 1232 PQSQ------LIYATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVR 1285

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             N    RP    ++D  LL+H+ + PL  Q E+  Q GT    + S+L  L
Sbjct: 1286 -NDLVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335


>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
            [Phytophthora sojae]
          Length = 1752

 Score =  233 bits (594), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 171/600 (28%), Positives = 289/600 (48%), Gaps = 93/600 (15%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
            +T F N++   G F  G+ P W +  R +    P +C  +      +++FT  H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217

Query: 968  GFIYVTSQGILKICQLPSGSTY-----DNYWPVQKIPLKATPHQITYFA----------- 1011
            GFIY  S+G L++C+LPS  T         + +QK    AT H + Y             
Sbjct: 1218 GFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGNHGPGGVSEAL 1277

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS-------------------- 1051
            E   Y ++ SV  +KP +   +  + ++   + +  NL +                    
Sbjct: 1278 EAPTYAVVCSVK-MKPTDAERATEV-EDADEEKEPENLDANGNPVGSNVMAPTAEMFPDF 1335

Query: 1052 -VDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT----- 1103
             +D       E YE+R+++ +  G  W  R    +  +  E  L+V+++ L++++     
Sbjct: 1336 EIDQMAHTEEEVYELRLVQTNEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEE 1394

Query: 1104 --------TKENETLLAIGTAYV--QGEDVAARGRVLLFS---------TGRNADNPQNL 1144
                     K+    L IGT +V   GED + RGR+LL+           G +  +    
Sbjct: 1395 VASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLYELDYAQYVDEEGGSTSSKLPK 1454

Query: 1145 VTEVYSKELK-GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1203
            +  V+ KE + GAIS++  L  ++L A G K+I++++   +L G AFYDA  +++V+LN+
Sbjct: 1455 LRLVFIKEHRQGAISSVVQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MFIVTLNV 1513

Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
            VK+F++ GD++KS++FL W+E   QL LLAKD+  L   ATEF +    L+L+  D  +N
Sbjct: 1514 VKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDEN 1573

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-----SDRTGAAPGSDK 1318
            + +  +AP+  ES  GQ+LL  ++FH+G  V    R ++         + R   AP S  
Sbjct: 1574 LHVMQFAPQDIESRGGQRLLRVSDFHLGVQVASMFRKRVDGPGGHVAVNGRGPRAPPSYY 1633

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
             N    + G  +G +G + P+ E  FRRL +LQ  +V+++P    LNPR FR   +N + 
Sbjct: 1634 VN----VMGNSEGGVGALIPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQR 1689

Query: 1379 HRPGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
                PD+          +D  +L  +  L    Q E+A   GTT   ++ NL ++   T+
Sbjct: 1690 RCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVIHNLLEVQHATA 1749



 Score =  159 bits (403), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 159/628 (25%), Positives = 262/628 (41%), Gaps = 143/628 (22%)

Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L+ M  V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
            + HP IW+  NLP D +KL    +P+GGV+V+ AN   Y +Q+    LA N +A     
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396

Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
                  D+  E P    +   + L      +L     LL+   GD  +L++ Y+    +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456

Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
            L      S +  + L+  +   G               F+GSR GDS+L        TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516

Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
                 K    EE    E            +  A ++  + ++L LYG+A        S 
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576

Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADAS- 529
           +TE+   T   AV+                      D L +IG +     G+  NAD++ 
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNADSNE 636

Query: 530 -------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
                  + G  +     ++            EL GC+ +WTV         +   R   
Sbjct: 637 KREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR--- 693

Query: 571 YDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
               Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R++Q+F++G
Sbjct: 694 ---SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFKQG 750

Query: 631 ARIL------------DGS----------------------YMTQDLSFGPSNSESGSGS 656
           AR++            DG+                        TQ+++        G   
Sbjct: 751 ARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVECGGMNV 810

Query: 657 ENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAIES 702
           + +TV  +SV + DPY+LL ++DGS+RLL+GD     ++V  P               ++
Sbjct: 811 DTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTESNGTADA 870

Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
           SK   SS  L++D     W      +AW
Sbjct: 871 SKHGSSSACLFYD-----WAGMFRENAW 893


>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Piriformospora indica DSM 11827]
          Length = 1324

 Score =  233 bits (593), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 328/1432 (22%), Positives = 596/1432 (41%), Gaps = 210/1432 (14%)

Query: 55   VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
            V NLVV   N + IY VR     EE   ES  K+SG ++         +  L LV  + L
Sbjct: 36   VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87

Query: 110  HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
            HG +  LA +           D ++++F+D+K++++E+ ++++ +   S+H +E    L 
Sbjct: 88   HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146

Query: 170  LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSA 228
                 E  A    ++ DP  RC  +++    + +L   Q  + L V D        G  +
Sbjct: 147  NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTELDVQD--------GIQS 195

Query: 229  RIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
              E      S+V N+  +D  ++++ D +F+ G+  P + IL + + TW GR+       
Sbjct: 196  IAEELPYSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNT 255

Query: 282  MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCAL 340
             +  +S+    + + +I +   LP+D+  +    + +GGVLVV AN+I H    S    L
Sbjct: 256  SLFFISLDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTL 315

Query: 341  ALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
             ++ +A  + D+S     +   + L+ +   ++ +   +LS   G  + + + ++GR V 
Sbjct: 316  PMSGWAARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVW 375

Query: 400  RLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF 454
             L    T      PSVL +      + L FLGS  GDS+L ++                 
Sbjct: 376  GLTAVHTFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------- 414

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
                                 QD+ +  +  L     N +E+   +FS    D+L + G 
Sbjct: 415  ---------------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGS 449

Query: 515  LKDFSY-GLRINADAS---ATGISKQSNYELV----------ELP---GCKGIWTVYHKS 557
                S+ GLR + + +   A G+     +  V          +LP   G +GIW     S
Sbjct: 450  YTAASFFGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIW-----S 504

Query: 558  SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIA 613
             R H  +   +    +      ++S +A T     + + T+    +D  +  R    TIA
Sbjct: 505  MRVHQGNGIELPLGHNT-----LLSTDA-TPTPGASRIATKSQARLDINITTRIPMLTIA 558

Query: 614  AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
                F    ++QV     R+L     T D S      +  + +  + +   +I DPYVL+
Sbjct: 559  VAPFFDGTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARIRHAAICDPYVLI 613

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----KGPEPWLRKTSTD 728
               D ++ L VG+P+   +  +  + +   K    + T Y D     K  E  +R+T   
Sbjct: 614  LREDDTLGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLKIDEDLMRETK-- 671

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
                  VG           ++G+ +  +C  +G LEI+ +P    VF V     G +   
Sbjct: 672  -----AVG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF-VSSISLGPS--- 711

Query: 789  DTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
                   LK D + E++S+++     G    +  + + +L     S H     L  +   
Sbjct: 712  ------VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH-----LIVLYDS 759

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSRTPLDAYTREETP 906
              ++ YQ      P    K+  P    RS+    +S   R+ +   + TP +  T   + 
Sbjct: 760  NLLIVYQMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANPTPDENQTSGGSN 815

Query: 907  HGAPCQRITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN 962
                 + I  F  +        G F++G  P W +       +H      ++ +FT    
Sbjct: 816  EKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCGYEAVHSFTPCSM 875

Query: 963  VNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
             + +  F+  T +G   + Q   G T+   +P  K     T   I Y    N   L+V+ 
Sbjct: 876  WDFSPTFLMSTEEGSC-LVQWTPGITFHGQYPCSKTRKGRTQTNIAY---SNTTGLLVAA 931

Query: 1023 PVLKPLNQVLSLLIDQEVGH--QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
                  N    LL D+E  +  + D  N+S   L  +       + +L+P+     W T 
Sbjct: 932  SS----NDRDFLLFDEEGTNSWEPDGVNVSLPKLGAS------ALELLDPET----WVTI 977

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
                  ++E    V  V L   +T+  N+  +A+GT+  +GED+A RG   +F       
Sbjct: 978  DGYEFAANEVVNIVESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVRGGTYIFEIAEVIQ 1037

Query: 1140 NPQ------NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYD 1192
            + +      + +  +   E KG ++A+  + G+L+ + G KI +  +   E L G+AF D
Sbjct: 1038 DTEERGRRRHRLKLLCKDEAKGPVTAVCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLD 1097

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
            A  +YV S+  +KN +++ D  K ++F++++E   +L +L+K+         +F    + 
Sbjct: 1098 AG-VYVTSIRCLKNLLVITDAIKGVWFVAFQEDPFKLVILSKEVRPTSIPQGDFFFAHND 1156

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            + L+  D +  +++  Y P   ++ +G +LL   EF         +R+ M   SS     
Sbjct: 1157 MELLTIDLRGVLRLHSYDPTHVDTEEGARLLCSVEFQTHVEPVTIVRVAMEQPSS----- 1211

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
                  ++   LL   +DGS+  ++PLD   F+RL  LQ +LV    H+A LNP+++R  
Sbjct: 1212 ---DSASDASRLLIPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHIAALNPKAYRAV 1268

Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              +    R     ++D  LL  ++ L  + Q  IA+Q G T   ++ +   L
Sbjct: 1269 QGSSTT-RTMSRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDCTQL 1319


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score =  232 bits (592), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 326/1429 (22%), Positives = 585/1429 (40%), Gaps = 223/1429 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896  -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
              ++++C+ P  + +D  W  +KI L      + Y +    Y +  +  V        +L
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059

Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
            + + L  +  T E +  + +GTA  +GED+AARG + +F   +     D P+    +  +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKLI 1119

Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
              +E+KGA+++L+ +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            K     G I   ++        A++ + + D         +FL D + L ++V+D+    
Sbjct: 1178 KG----GAITNCLF-------SARMTVPSSD-------DADFLPDENRLYILVADDD--- 1216

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA- 1323
                       S KG +LL R+ F  G   +    L   ATSS + G     D  +  + 
Sbjct: 1217 --------YPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLDSS 1267

Query: 1324 -----LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
                 +L  +  GSI  I P+ E ++RRL +LQ +L +++ H  GLNPR+FR   S+G  
Sbjct: 1268 GPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIG 1327

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
             R     +VD +L+  +  L  + + EIA++ G    +I ++L  +  G
Sbjct: 1328 GR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1372


>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
 gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
          Length = 1547

 Score =  232 bits (592), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 325/1339 (24%), Positives = 529/1339 (39%), Gaps = 209/1339 (15%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
            D  +   L LV  + L G V  LA +    A+ +               DS+++AF DA+
Sbjct: 93   DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152

Query: 142  ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
            +S++E+D   H L   S+H +E  E       + S    PL      +  DP  RC  + 
Sbjct: 153  LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206

Query: 196  VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
                 + IL   Q    + +GD D    G   +    S+ V+N                 
Sbjct: 207  FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            L +LD  + H     F+H Y EP   IL              H   M+  L +    K  
Sbjct: 267  LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
              I S   LP D ++++ +P+P+GG L+VG+N  IH         +A+N       +   
Sbjct: 325  TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384

Query: 355  LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
            + +S  ++ L+      L  D+  L      G   ++T   DGR V  L++     S   
Sbjct: 385  VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444

Query: 411  -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
                   S ++ IG +  F G   GDSLL+ +      +G   L +      GD++A+  
Sbjct: 445  SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503

Query: 463  STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
                L     D + +  + +E           ESA +               SF V D L
Sbjct: 504  --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561

Query: 510  VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
            +++ P++  +Y                G+R + +   T G  K +    V          
Sbjct: 562  LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621

Query: 543  --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLIIS------LEART 587
              E P  +G WTV       KS +G  A +S    YD   +Y  ++I++       E   
Sbjct: 622  RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVAKVDLDGYEKSD 681

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +   TA     +  +      G TI AG +    R+IQ+ +   R  DG +    +   P
Sbjct: 682  VYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV--P 739

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
               E  +G+E   V S SI DP++L+   D S  +   D S     +       +S K +
Sbjct: 740  MLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKWL 797

Query: 708  SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
            + C LY D          +T A+     G+      GG L Q  +   +   SGAL I+ 
Sbjct: 798  TGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIYR 839

Query: 768  VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL 827
            +P+ +    V + +S        Y+   L       + S+ +GT    KE I  + V +L
Sbjct: 840  LPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVADL 883

Query: 828  AMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL 887
                    H  P L    T+  +  YQ + +    NT      +  S++L        +L
Sbjct: 884  G----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QKL 927

Query: 888  RNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
             N  F+++P +A   E T H      +    N+ G+   FL G+ P + +   + +    
Sbjct: 928  PNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKVL 986

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQ 1006
             L    ++A +  H   C HGFIY  S+ + ++ QLP   +Y      V+KIP+      
Sbjct: 987  PLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIAA 1046

Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1066
              Y      Y +  + P      +   L  D +   +    NL+        TV+   ++
Sbjct: 1047 AAYHPPMQSYVVGCNTP------EPFELPKDDDYHKEWARENLAF-----KPTVDRGNLK 1095

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
            ++ P      W    +I M+  E  L V  + L  +  T E + L+A+GTA  +GED+  
Sbjct: 1096 LVSPIT----WTVVDSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLPT 1151

Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPK 1174
            RGRV ++        P    T   SK+LK         GA++AL+ +  QG +L+A G K
Sbjct: 1152 RGRVYVYDIADVIPQPGRPET---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQK 1208

Query: 1175 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLN 1230
             ++   K  G+ L  +AF D    YV +   +      L+ D  K ++F  + E+  ++ 
Sbjct: 1209 CMVRGLKEDGSLLP-VAFMDM-SCYVTAAKELPGTGLCLMADAFKGVWFTGYTEEPYKMM 1266

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
            L  K    L+    +FL DG  L +VVSD   +I I  + P+  +S +G  LL R  F+ 
Sbjct: 1267 LFGKSATRLEVLNADFLPDGKELFIVVSDADGHIHILQFDPEHPKSLQGHLLLHRTTFNT 1326

Query: 1291 GAHVTKFLRLQMLATSSDR 1309
            GAH      L  + T +D+
Sbjct: 1327 GAHQPTKSLLLPVTTPADQ 1345



 Score = 48.1 bits (113), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 18/106 (16%)

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR------------- 1370
            L+     G +  +  L E  +RRL SL  +L  S+PH AGLNPR +R             
Sbjct: 1418 LVLAAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWS 1477

Query: 1371 QFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
               S+  A  PG D     +IVD  LL  +  L +  ++E+A + G
Sbjct: 1478 SSSSSFSAVVPGVDAGVGRTIVDGALLQRFTELGMARRVELAGRAG 1523


>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
          Length = 1411

 Score =  231 bits (588), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 323/1345 (24%), Positives = 545/1345 (40%), Gaps = 193/1345 (14%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
            Y  R+ ++   ES   G     V  D  +   L LV  Y + G V  LA +      NS+
Sbjct: 66   YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122

Query: 129  RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
               +++++A   A++S++++D   H L   S+H +E  E         S   GPL     
Sbjct: 123  SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176

Query: 183  -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
             +  DP  RC  +         L + L    +        + G            T G+ 
Sbjct: 177  HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236

Query: 224  GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
                    +S+V+ L  LD  + H     F+H Y EP   I+   +          H + 
Sbjct: 237  NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296

Query: 282  MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
             +  L +    +    I S  NLP D +K++A+P PIGG L+VG N  IH         +
Sbjct: 297  KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354

Query: 341  ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
            A+N +     S     +S   + L+  +   +  +N   L+    G L ++    DGR V
Sbjct: 355  AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414

Query: 399  QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
              L +    ++   +V+    S I+ +  + FF+GS   DSL+V      G +   +   
Sbjct: 415  SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468

Query: 452  EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS-LYGSASNNTESAQKTFSFAVRDSLV 510
             +   +  D+ +         D   D + GE  + +  S + N        SF V DSL+
Sbjct: 469  RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528

Query: 511  NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
            ++ P+KD + G                    L++ A     +A+A  I  Q+    V   
Sbjct: 529  SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588

Query: 543  -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
             E P  +G WT  V     +    D    AA   E+ A      ++I+S +      ET+
Sbjct: 589  FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647

Query: 594  DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
            D+        + F         G T+ AG +    R+IQV +   R  DG   ++Q L  
Sbjct: 648  DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
               + E+G+      V+S SIADPY+LL   D SI +   D +     ++       S +
Sbjct: 708  --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
             ++ C LY D                +TGV   I    G P  Q  I+  +    GAL I
Sbjct: 763  WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            + +P+ +    V    +G T+ V  ++           + +   GT Q   E +  + V 
Sbjct: 805  YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            +L      A  S P+L     +  I  Y+    E        D     +++L    ++  
Sbjct: 849  KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896

Query: 886  RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
               N   +++P++    +  E P   P +      NI+G+   FL G+ P + +   +  
Sbjct: 897  ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKAT 1003
                 L    +   +  H   C  GFIY  S+G  ++ QLP+ ++++    ++KIP+   
Sbjct: 951  PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKIPVGDA 1010

Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEE 1062
               I Y      Y +  SV   +P           E+    D H   + +   T+  +E 
Sbjct: 1011 IGLIAYHPPMETYAVACSVS--EPF----------ELPKDDDYHKEWAKETITTFPQMER 1058

Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1121
              +++L P      W    T+ +   E A+ ++ + L  +  TKE   L+AIGTA  +GE
Sbjct: 1059 GIIKLLSP----ATWSVIDTVELDPHEVAMCMKTLHLEVSEETKERRMLIAIGTAINRGE 1114

Query: 1122 DVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGP 1173
            D+  RGR+L++        P     N   ++ +KE   +GA++AL  +  QG +L+A G 
Sbjct: 1115 DLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTALCEVGSQGLMLVAQGQ 1174

Query: 1174 KIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQL 1229
            K ++   K  GT L  +AF D    YV S+  V+   + L+ D  K ++F+ + E+  ++
Sbjct: 1175 KCMVRGLKEDGTLLP-VAFMDMS-CYVTSVREVRGTGYCLMADAFKGVWFVGYAEEPYKI 1232

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
             L  K  G  +    +FL+DG  L +VV D+   I +  + P+  +S +G  LL+RA F 
Sbjct: 1233 MLFGKSTGKFEVLTADFLVDGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFS 1292

Query: 1290 VGA-HVTKFLRLQMLATSSDRTGAA 1313
                H T  L L    T++    A+
Sbjct: 1293 AAPNHPTATLSLPRTTTTAQSASAS 1317


>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
          Length = 1361

 Score =  231 bits (588), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 314/1393 (22%), Positives = 553/1393 (39%), Gaps = 249/1393 (17%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
            S   L LV  Y L G + SL  +      NS+   D++++AF DAK+S++E+D ++H + 
Sbjct: 49   SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105

Query: 156  ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
              S+H +E  +   +    +       + VDP  RC         + I+   Q    L  
Sbjct: 106  TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165

Query: 216  DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
            DE         +     G    + +S       S V+ L  LD  + H  D  F+H Y +
Sbjct: 166  DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P + IL+      +     +    + +  ++    K    + S   LP+D Y+++A+P P
Sbjct: 226  PTVGILYSTAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMALPPP 285

Query: 318  IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
            +GG L++G N  IH      + A+A+N  A    S      +++ ++L+      L N  
Sbjct: 286  VGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHLGNPS 345

Query: 377  A--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGS 427
               L+  KTG+L LL+   DGR+V  + L +             ++  T +G++  F+GS
Sbjct: 346  GDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRLFIGS 405

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDMVNGE 482
               DS+L+    G  T+ L                +  R++  + DA      ++    +
Sbjct: 406  EESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEEDGIED 449

Query: 483  ELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ-- 536
            E  LY   ++    N  +     +F + D L +I P+ D +        A  + +++Q  
Sbjct: 450  EDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVTRQEL 509

Query: 537  ---------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
                                 S    ++     G+W  +   +RG         A ++ +
Sbjct: 510  AVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQG------AEEENF 562

Query: 576  HAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
               ++IS        RT +L   D  L  + ES      G  I    L      IQV   
Sbjct: 563  DDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQVLPT 622

Query: 630  GARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
              R+ D  +    + F   + E G   + +  + VS  DPY+++   DGS+ LL  D + 
Sbjct: 623  ELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKADKAG 678

Query: 690  CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
                V+ P  + +    + S TLY D           TD    T           G    
Sbjct: 679  ELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQTSRFY------NGTATP 719

Query: 750  GDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEINSSS 807
            G I +++  + G   +  +PN +  VF  D      TH++ D  + +  ++         
Sbjct: 720  GPILTILT-QDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN--------- 769

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                    K+++  + + +L     ++   +P+L      G ++ Y+++           
Sbjct: 770  --------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP-------- 809

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
             D + + R   V   +A  L +             EE    +  Q +    N++GH   F
Sbjct: 810  -DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGHASVF 856

Query: 928  LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
            + G +P   M     +    +L    + +   +H   C  G + V +   +K C +P  +
Sbjct: 857  IPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNIPDST 916

Query: 988  TYD-NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
                + W ++++PL      + YFA  + Y L                           N
Sbjct: 917  VLGLSDWVIRRVPLGQDITSVAYFAPTDSYILAT-------------------------N 951

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IP--MQSSENALTV-------- 1094
            H                E ++ + D     WQ  AT  +P  +QSS   L+         
Sbjct: 952  HT--------------TEFQLPQDDEWHPEWQGEATKFLPSSIQSSLKLLSAKTHSIISQ 997

Query: 1095 -------RVVTL------FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NA 1138
                   RV+ L       +  T E + L+ +GTA V+GE+V  RG + +F         
Sbjct: 998  YSFDACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIFDVVDVVPEP 1057

Query: 1139 DNPQ-NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILHKWT-GTELNGIAFYDA 1193
            D P+ +L  ++ +KE ++GA+SAL  +  QG LL A G K ++        +  +AF D 
Sbjct: 1058 DRPESDLKIKLITKEDVRGAVSALCDIGSQGFLLAAQGQKSMVRGLKEDMSILPVAFLDM 1117

Query: 1194 PPLYVVSLNIV-KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
                 V+  +      +LGD    ++ + + E+  +L +L +D       A EFL DG  
Sbjct: 1118 RYYVHVARELPGTGLCILGDAFSGLWLVGYSEEPYKLQILGRDLEDPPVLAAEFLPDGKQ 1177

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR-- 1309
            L ++ SD+   +++  Y P+  ++ +G KLL R+ FH GA  TK + L   +A+   R  
Sbjct: 1178 LYIISSDDDGLLRVLQYDPENPKAERGTKLLLRSTFHSGAAPTKMILLPPQVASGRGRDP 1237

Query: 1310 -------TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP-HV 1361
                   +GA P +    R  +L  T +GS+  + PL E T+RRL +LQ  L+ ++  H 
Sbjct: 1238 EIDMDVDSGAGPAA---GRHRILVTTQEGSLCMLTPLSEATYRRLSALQTTLLTTLDFHP 1294

Query: 1362 AGLNPRSFRQFHS 1374
              LNPR++RQ  +
Sbjct: 1295 CSLNPRAYRQVET 1307


>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
          Length = 416

 Score =  229 bits (584), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 129/342 (37%), Positives = 203/342 (59%), Gaps = 14/342 (4%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +     + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
             ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +LT++ DG R
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMR 355

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            V+     K   SVLT+ + T+     FLGSRLG+SLL+++T
Sbjct: 356 SVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397


>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
          Length = 1431

 Score =  229 bits (583), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 329/1387 (23%), Positives = 552/1387 (39%), Gaps = 182/1387 (13%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
            R  ++   ES   G     V  D      L L+    L G V  LA +     +     +
Sbjct: 70   RANDDDGLESSFLGVESLIVRADPSHNTKLVLISEIPLAGTVIGLARVKI--KNTPSGGE 127

Query: 132  SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
            +++LA++ AK+ + E+D   H L  TS+H +E  E   L+        G  V   + DP 
Sbjct: 128  ALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADPG 184

Query: 189  GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
             RC         + IL  +Q    L  D   ED             T G G G      +
Sbjct: 185  SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVKS 244

Query: 229  RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            R   S V+ L  LD  + H     F+H Y EP   IL   +          H T  +  L
Sbjct: 245  RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFTL 304

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
             +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+ 
Sbjct: 305  DLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362

Query: 346  AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
            A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ +
Sbjct: 363  AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKISI 422

Query: 404  ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
                ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +   
Sbjct: 423  RLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476

Query: 457  IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
            +     +         D   D + G + SL   S + N  S     SF V+D+L++I P+
Sbjct: 477  LLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAPI 536

Query: 516  KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
            +D + G      D+    +SK    EL                             + P 
Sbjct: 537  RDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFPE 596

Query: 547  CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
             +G WT+  K      A  +       +Y  Y+I++       E   +   TA     + 
Sbjct: 597  ARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656

Query: 601  ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
            ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       +
Sbjct: 657  DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713

Query: 660  TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
            T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D    
Sbjct: 714  T--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND---- 766

Query: 720  PWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
                   T     + V  +  D AD        I   +   +GAL ++ +P+      V 
Sbjct: 767  -------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------VS 806

Query: 779  KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
            K +         ++ EAL      ++++     G   KE+I  + V +L      A    
Sbjct: 807  KPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQT 852

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P+L        +  Y+      P    +  D   ++  L    V+ S  +       P  
Sbjct: 853  PYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APEV 901

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
            +    E P   P +R     N++G+   FL G+ P + +           L    +   +
Sbjct: 902  SEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGMS 958

Query: 959  VLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYP 1017
              H   C+ GFIYV  +GI ++ QLPS +++ D    V+KI L      I+Y      Y 
Sbjct: 959  TFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHPTGTY- 1017

Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
             +V+   L+P           E+    D H   +   +    T     ++++ P      
Sbjct: 1018 -VVACTKLEPF----------ELPRDDDYHKEWAKETIKFPPTTARGILKLINPVT---- 1062

Query: 1077 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
            W     + ++  E+  +++ + L  +  TKE + L+A+GTA  +GED+  RGRV +F   
Sbjct: 1063 WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIV 1122

Query: 1136 RNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTEL 1185
                 P     N   ++ +KE   +G ++AL+ +  QG +L+A G K ++   K  G+ L
Sbjct: 1123 TVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSLL 1182

Query: 1186 NGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
              +AF D    +V S+  +      ++ D+ K ++F  + E+     +L K  G L    
Sbjct: 1183 P-VAFLDM-NCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGKLPLLV 1240

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             +FL DG  LS+V  D + ++ I  + P+  +S +G  LL R  F     VT       L
Sbjct: 1241 ADFLPDGEDLSMVAVDAEGDMHILEFNPEHPKSLQGHLLLHRTSF----AVTPNTPTSTL 1296

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
                  + + P +  ++   LL     G +  ++PL E T+RRL S+  +L  ++    G
Sbjct: 1297 LLPRTHSPSYPHASSSSHM-LLLACPSGQVAALSPLAESTYRRLLSVTNQLHPAIVAHCG 1355

Query: 1364 LNPRSFR 1370
            L+ ++ R
Sbjct: 1356 LHTKAHR 1362


>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
          Length = 558

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)

Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
           F GS+LGDSLLVQFT     S+ SS +++  GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385

Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
           L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD  ATGI KQ     VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445

Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
           LPGC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Hydra magnipapillata]
          Length = 1122

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)

Query: 57  NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLV      + +Y +     V  +G + SK         ++D +    LEL+  + L GN
Sbjct: 29  NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           + ++ ++  G    S  RDS++LAF+ AK+S++EFD   H L+  SMH FE+ E+   K 
Sbjct: 80  IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G       PLV+VDP+ RC  +L+Y   +++L        +  DE    S G     +  
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I+L  L+  + ++ +  F+HGY +P ++ L E   T  GRV+ +  T  +SA+S++ 
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           T K HP+IWS  NLP D + L  +  PIGGVLV  +N++ Y +QS      +  Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303

Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S  E     P   +    + L  +    +  D  +LS K G++ +L+++ DG R V+   
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             K   SVL S +  I +   FLGSRLG+SLL+++T                   E D+ 
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
           S     +S    ++ M  G                         DSL+NIGP+   + G 
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446

Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
                    G S+Q + E+V                          ELPGC  +WTV  K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
           SS+             + YH+YLI+S +  TMVL+T   +TE+  S  + VQ  TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
               + ++QV  +   +L+ +     +S            +   +   SI+DPYV++  S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604

Query: 677 DGSI 680
            G +
Sbjct: 605 TGQL 608



 Score =  224 bits (570), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/380 (33%), Positives = 215/380 (56%), Gaps = 18/380 (4%)

Query: 1060 VEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTA 1116
            +E + V ++ P      W+T   +   +Q  E+   ++V+ L +       +  L +GT 
Sbjct: 753  IERFVVSLISP----TSWETVPNSRTVLQEFEHVTCMKVLLLHSELVDIGLKQYLVVGTT 808

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIAS 1171
            +  GED+A +GR+L+F        P   +T+     VY KE KG ++A+ +  G+++ A 
Sbjct: 809  FNYGEDLACKGRILIFDVLEVVPEPGQPLTKTKCKCVYDKEQKGPVTAICATSGYIIAAV 868

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            G KI   K+   +L G+AF D+  ++ V+L  ++N I+  DI +SI  + ++ +   L L
Sbjct: 869  GQKIYAFKYKDNDLVGVAFVDSQ-VFTVNLMAIRNVIVAADISRSISLVRFQVEHKSLAL 927

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            +++D  +L+ + +EF IDGS +  VVSD ++NI IF Y P+  ES+ G +LL +A+ ++G
Sbjct: 928  VSRDTKTLEAYTSEFFIDGSQVGFVVSDAERNIVIFSYQPEALESFGGHRLLQKADINIG 987

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            +HV   +R++++    D    +  S++  R  ++  TLDGSIG + PL E  FRRL  LQ
Sbjct: 988  SHVNTMMRIKLI---QDEQSLSKSSEQ--RQLIILPTLDGSIGILFPLSEKPFRRLTMLQ 1042

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             KLVD +PH AGLNPR+FR      +       +I+D +LL  Y  L  +E+ +IA + G
Sbjct: 1043 NKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQERFDIAKKMG 1102

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TT  QIL ++ D+   ++ L
Sbjct: 1103 TTSGQILDDMMDIERASNHL 1122


>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
          Length = 1436

 Score =  228 bits (580), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 338/1441 (23%), Positives = 565/1441 (39%), Gaps = 205/1441 (14%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
            R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68   RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127  SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
                +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125  ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179

Query: 186  DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIES- 232
            DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180  DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239

Query: 233  ---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
               S V+ L  LD  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240  YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300  LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347  VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358  RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417

Query: 405  ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
                +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418  MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475  DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518  FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
             + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535  LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
            G WT+  K         S  A   DEY A      Y+I++ +      ET+D+       
Sbjct: 595  GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651

Query: 598  -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
             E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+
Sbjct: 652  FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709

Query: 655  GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
                  V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY 
Sbjct: 710  ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765

Query: 715  DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
            D                 T      +  D G  ++  I   +   +GAL ++ +P+ +  
Sbjct: 766  D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806

Query: 774  VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
            V+  +       H+  D  +R  L                   KE +  + V +L     
Sbjct: 807  VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
                  P+L        +  Y+               P+   R    SN+SA+    ++ 
Sbjct: 845  DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889

Query: 889  NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
            N+  + TP      + E P   P +R     NI+G+   FL GS P + +   + +    
Sbjct: 890  NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQ 1006
             L    I   +  H   C+ GFIY   +GI ++ Q PS + +      V+K+PL +    
Sbjct: 947  GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRG 1006

Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEV 1065
            I Y      Y  I      +P           E+    D H   +   L    T+    +
Sbjct: 1007 IAYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGVL 1054

Query: 1066 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1124
            +++ P      W     I ++S E+   ++ + L  +  TKE   L+A+GTA  +GED+ 
Sbjct: 1055 KLISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLP 1110

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKII 1176
             RGRV ++        P    T    K +      +G ++A++ +  QG +L+A G K +
Sbjct: 1111 IRGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCM 1170

Query: 1177 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLA 1233
            +   K  G+ L  +AF D       +  + +    L+ D  K ++F  + E+     +L 
Sbjct: 1171 VRGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLG 1229

Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
            K  G L     +FL DG  L++V +D   ++ I  + P+  +S +G  LL R  F V  +
Sbjct: 1230 KSHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPN 1289

Query: 1294 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
              +  L L      S  T   P         LL  +  G +  + PL E  +RRL S+  
Sbjct: 1290 PPSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTN 1343

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            +L+ ++    GLN ++ R             G  +IVD  +L+ +  L   ++ EIA + 
Sbjct: 1344 QLLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKG 1403

Query: 1411 G 1411
            G
Sbjct: 1404 G 1404


>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
            206040]
          Length = 1441

 Score =  227 bits (579), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 334/1473 (22%), Positives = 578/1473 (39%), Gaps = 200/1473 (13%)

Query: 57   NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
            NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28   NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87

Query: 92   LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
              D  +   L L+    L G V  LA +     + +   ++++LA++ AK+ + E+D   
Sbjct: 88   RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
            + L   S+H +E  E +      E F      ++ DP  RC         + IL  ++  
Sbjct: 146  NELETISIHYYEKEE-MQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204

Query: 211  SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
              L     DED               G G    A    S V+ L  LD  + H     F+
Sbjct: 205  EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264

Query: 253  HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
            H Y EP   +L   +   +   S  H +  +  L +    +    I S   LPHD YK++
Sbjct: 265  HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322

Query: 313  AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
            A+P+P+GG L+VG N  IH         +A+N  A    S     +S  ++ L++     
Sbjct: 323  ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382

Query: 372  L--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
            L  +N   LL    G L +++   DGR V  L +    +    +++ S +T I   G + 
Sbjct: 383  LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
            FFLGS   DS+++ +   S         K    D +      +       +      +  
Sbjct: 443  FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499

Query: 483  ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
              +     +N         SF + D+L++I P++D + G                ++AD 
Sbjct: 500  ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559

Query: 528  --ASATGISKQSNYELV------------ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
              A A G  +  +  ++            E P  +G WT+  K     S G NA ++   
Sbjct: 560  QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619

Query: 570  AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
                ++  ++I++       E   +   TA     + E+      G T+ AG +  +  V
Sbjct: 620  DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679

Query: 624  IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            IQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   D S+ L
Sbjct: 680  IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734

Query: 683  LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
               D +     ++   +  +S K  + C LY D                + GV +A  G 
Sbjct: 735  AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
            D       ++   +   +GAL I+ +P+ +  V+  +   S   H+   ++ + +     
Sbjct: 777  DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
                          +E +  + V +L        H  P+L    +   +  Y+       
Sbjct: 832  ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                 +D P            SA+ +        PL+   ++  P   P   + I  N+ 
Sbjct: 873  ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVP---LRICANVG 923

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G+   FL G  P + +   + +     +    +   +  +   C+ GFIY  S+GI ++ 
Sbjct: 924  GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983

Query: 982  QLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
            QLPS + +      V+K+PL      + Y      Y    +V       +   L  D + 
Sbjct: 984  QLPSKTNFTELGVSVKKVPLGNDVRHVAYHHPTETYIAGCAV------TEGFELPKDDDY 1037

Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1099
              +    +LS    H +       ++++ P      W    +I M+  E+   ++ + L 
Sbjct: 1038 HKEWAKESLS---FHPSTV--RGSLKLISPVT----WTVIHSIDMEPGESIECMKTLHLE 1088

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--L 1153
             +  TKE   LLA+GTA  +GED+  RGRV ++        P     N   ++ +KE   
Sbjct: 1089 VSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKKLKLLAKEEIP 1148

Query: 1154 KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NF 1207
            +G ++AL+ +  QG +L+A G K ++   K  G+ L  +AF D    +V S   +     
Sbjct: 1149 RGGVTALSEIGTQGLMLMAQGQKCMVRGLKEDGSLLP-VAFLDM-SCHVASARELPGTGL 1206

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
             L+ D  K ++F  + E+     +L K  GSL     +FL DG  LS+V  D   +I + 
Sbjct: 1207 CLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGDIHVL 1266

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
             + P+  +S +G  LL R  F V  +  T  L L     +S    A+  S       LL 
Sbjct: 1267 EFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATASQDSSTPQPHLLLL 1326

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR----- 1380
             +  GS+  + PL E  +RRL S+  +L+ + VPH  GL+ R+ R     G   R     
Sbjct: 1327 ASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRAPEGGGGMSRMVGVE 1385

Query: 1381 --PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
                  +IVD  +L+ +  L   ++ E+A + G
Sbjct: 1386 TAASGRAIVDGAILTRWNELGAAKRAEVASRGG 1418


>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
          Length = 1440

 Score =  226 bits (577), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 339/1477 (22%), Positives = 591/1477 (40%), Gaps = 209/1477 (14%)

Query: 57   NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
            NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28   NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87

Query: 92   LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
              D  +   L L+    L G V  LA L       +   + ++LA++ AK+ + ++D   
Sbjct: 88   RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
            + L   S+H +E  E L      E F  G  V   + DP  RC         + IL   +
Sbjct: 146  NELETISIHYYEKEE-LQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202

Query: 209  GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
                L     DED  G         +  G S  +E+++       + L D  + H     
Sbjct: 203  SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+H Y EP   +L   +   A  +  K H       ++    +    I S   LPHD YK
Sbjct: 263  FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320

Query: 311  LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
            ++A+P+P+GG L+VG N  IH         +A+N  A  + S     ++  ++ L+  A 
Sbjct: 321  VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380

Query: 368  HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
                ++N   LL    G L +++   DGR V     RL  +    +V+ S    I+ +G 
Sbjct: 381  ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
            + FF+GS  GDS+++ ++        +   + +   I+ D A     L     +   D+ 
Sbjct: 441  NTFFVGSETGDSVVLGWS-----RRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
              +  +     +N   +     SF + D L++I P++D + G                ++
Sbjct: 496  GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555

Query: 526  AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
            +D   A A G  +  +  ++            E P  +G WT+  K     +  S+   A
Sbjct: 556  SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615

Query: 571  YDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ-------GRTIAAGNLFGR 620
             D +    H   +I  +      ET+D+        +   +       G T+ AG +  +
Sbjct: 616  GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675

Query: 621  RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
              VIQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   DGS
Sbjct: 676  MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730

Query: 680  IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
            + L   D +     ++      +S K V+ C LY D                + GV ++ 
Sbjct: 731  VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773

Query: 740  DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
              +  G  D+G +   +   +GAL I+ +P+ +  V+  +   S   H+   ++  R A 
Sbjct: 774  LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832

Query: 797  KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
            +++ TEI                    V +L      + HS P+L    +   +  Y+  
Sbjct: 833  RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869

Query: 857  LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
                   T    D +   +S + S+++ S + +      P D     + P   P +    
Sbjct: 870  RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSD--DTAQPPRYVPLRTCA- 919

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              N+ G+   FL G  P + +   + +     L    +   +  H   C+ GFIY  S+G
Sbjct: 920  --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977

Query: 977  ILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
            I ++ QLPS +        V+K+PL      + Y      Y    ++       +   L 
Sbjct: 978  IARVTQLPSKTNLTELGVSVKKVPLGHDIRHVAYHHPTETYIAGCTI------TENFELP 1031

Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
             D +   +    +LS +      ++    ++++ P      W    +I M+  E+   ++
Sbjct: 1032 KDDDYHKEWARESLSFLP-----SMARGALKLINPIT----WTVIHSIDMEPGESIECMK 1082

Query: 1096 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYS 1150
             + L  +  TKE   LLA+GTA  +GED+  RGRV ++        P     N   ++ +
Sbjct: 1083 TLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKRLKLLA 1142

Query: 1151 KE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
            KE   +G ++AL+ +  QG +L+A G K ++   K  G+ L  +AF D       +  + 
Sbjct: 1143 KEEIPRGGVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCHVSTARELP 1201

Query: 1205 -KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
                 L+ D  K ++F  + E+     +L K  GSL     +FL DG  LS+V  D   +
Sbjct: 1202 GTGLCLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGD 1261

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            I +  + P+  +S +G  LL R  F V  +  T  L L     +S     +P S  +   
Sbjct: 1262 IHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATTSPDSSSSQPH 1321

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR- 1380
             LL  +  G +  + PL E  +RRL S+  +L+ + VPH  GL+ R+ R     G   R 
Sbjct: 1322 LLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRTPEGGGGMSRT 1380

Query: 1381 ------PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
                      +IVD  +L+ +  L   ++ E+A + G
Sbjct: 1381 VGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG 1417


>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
          Length = 1427

 Score =  226 bits (576), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 335/1440 (23%), Positives = 560/1440 (38%), Gaps = 203/1440 (14%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
            R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68   RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127  SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
                +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125  ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179

Query: 186  DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
            DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180  DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239

Query: 234  HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            +       + L D  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240  YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300  LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347  VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358  RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417

Query: 405  ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
                +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418  MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475  DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518  FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
             + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535  LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
            G WT+  K         S  A   DEY     H   +I  +      ET+D+        
Sbjct: 595  GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652

Query: 598  EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
            E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+ 
Sbjct: 653  ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                 V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY D
Sbjct: 710  --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766

Query: 716  KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
                             T     +  +D G  ++  I   +   +GAL ++ +P+ +  V
Sbjct: 767  -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807

Query: 775  FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
            +  +       H+  D  +R  L                   KE +  + V +L      
Sbjct: 808  YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
                 P+L        +  Y+               P+   R    SN+SA+    +  N
Sbjct: 846  TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890

Query: 890  LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
            +  + TP      E E P   P +R     NI+G+   FL GS P + +   + +     
Sbjct: 891  VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQI 1007
            L    I   +  H   C+ GFIY   +GI ++ Q PS + +      V+K+PL +    I
Sbjct: 948  LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007

Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVR 1066
             Y      Y  I      +P           E+    D H   +   L    T+    ++
Sbjct: 1008 AYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGILK 1055

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
            ++ P      W     I ++S E+   ++ + L  +  TKE   L+A+GTA  +GED+  
Sbjct: 1056 LISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPI 1111

Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIIL 1177
            RGRV ++        P    T    K +      +G ++A++ +  QG +L+A G K ++
Sbjct: 1112 RGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMV 1171

Query: 1178 H--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
               K  G+ L  +AF D       +  + +    L+ D  K ++F  + E+     +L K
Sbjct: 1172 RGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGK 1230

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH- 1293
              G L     +FL DG  L++V +D   ++ I  + P+  +S +G  LL R  F V  + 
Sbjct: 1231 SHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNP 1290

Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
             +  L L      S  T   P         LL  +  G +  + PL E  +RRL S+  +
Sbjct: 1291 PSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQ 1344

Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            L+ ++    GLN ++ R             G  +IVD  +L+ +  L   ++ EIA + G
Sbjct: 1345 LLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKGG 1404


>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Ornithorhynchus anatinus]
          Length = 831

 Score =  225 bits (573), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 215/738 (29%), Positives = 340/738 (46%), Gaps = 141/738 (19%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL+E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
             K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS      +  Y VSL+
Sbjct: 68  LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121

Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S    +   P   R    + LD A A ++  D  ++S K G++ +LT++ DG R V+   
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
             K   SVLT+ + T+     FLGSRLG+SLL+++T         S  +E   D  AD  
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240

Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R   +        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356

Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
             +WTV          S +G  A+S         D + H +LI+S E  TM+L+T   + 
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465

Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
            S ++  ++ADPYV++  ++G +   LL  D     T  +++  P  + S  K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523

Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
           Y D            GP  +P LR  S    L   +   +D  +                
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583

Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
                   P D+           + V+  ++GA+EI+ +P +  VF V  F  G+  +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643

Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
           +   +    S  +  +  EE   QG    +  + +V L         +RP+L  +     
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALG-----NRQTRPYLLRLKWAIR 697

Query: 845 ---LTDGTILCYQAYLFE 859
              LT  T +  Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715



 Score =  107 bits (267), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 55/140 (39%), Positives = 85/140 (60%), Gaps = 8/140 (5%)

Query: 1167 LLIASG-----PKIILHKWT--GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LL+A G     P ++  KW    +EL  I F D   LY+  +  VKNFIL  D+ KSI  
Sbjct: 676  LLVALGNRQTRPYLLRLKWAIRDSELTSITFIDMQ-LYIHQIISVKNFILAADVMKSISL 734

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G
Sbjct: 735  LRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGG 794

Query: 1280 QKLLSRAEFHVGAHVTKFLR 1299
             +LL RA+FHVGAHV  F R
Sbjct: 795  MRLLRRADFHVGAHVNAFWR 814


>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
          Length = 1025

 Score =  223 bits (569), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 253/998 (25%), Positives = 418/998 (41%), Gaps = 149/998 (14%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            LE +   RL   V+S AI        +   DS +L F+DAK+S++  + +   L+  S+H
Sbjct: 62   LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119  CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168

Query: 221  GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                        S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169  -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279  HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
             T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218  DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337  S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
              C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276  PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393  YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
             D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333  TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452  EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
                       +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389  -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435

Query: 509  LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
            L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436  LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567  RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
             +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496  SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618  FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
                  +QV          G +I    Y+   L+F   N+          ++N  +L   
Sbjct: 555  ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611

Query: 666  IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
            + + P+V L   D S RL                  ++   S   +S Q  A   +   P
Sbjct: 612  LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671

Query: 707  -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
                   V    LY D       RK +       G+   E     D   +D   I    +
Sbjct: 672  EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728

Query: 754  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
              +  E+G + I+ +P  + V+ V K     +H+ D    +   D E     ++ EGT  
Sbjct: 729  IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780

Query: 813  -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
              G   +  ++K    ++EL +     +  RP LF +L D T+  Y+ + +    N    
Sbjct: 781  MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
                   + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     
Sbjct: 836  GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
            G F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +  ++++ +L
Sbjct: 890  GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949

Query: 984  PSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
            PS    D  +PV++I + AT H + Y    N Y ++ S
Sbjct: 950  PSDMILDASYPVKRINVGATVHSVVYLLHSNTYAVLTS 987


>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1429

 Score =  221 bits (564), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 332/1437 (23%), Positives = 562/1437 (39%), Gaps = 202/1437 (14%)

Query: 75   EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
            ++G + S   G     V  D  +   L LV    L G V  LA +      +    ++++
Sbjct: 72   DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129

Query: 135  LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
            LA++ AK+ + E+D   + L   S+H +E  E     W   +   + +     ++ DP  
Sbjct: 130  LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184

Query: 190  RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
            RC         + IL   Q    L  D   ED  G            G S  +E+++   
Sbjct: 185  RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244

Query: 236  ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
                + L D  + H     F+H Y EP   IL   +          H T  +  L +   
Sbjct: 245  FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
             +    I S  +LP D YK++A+P+P+GG L++G N  IH      +  +A+N+ A  + 
Sbjct: 304  -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362

Query: 351  SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
            S     ++  ++ L+       +++N   LL    G L +++   DGR V  + +    +
Sbjct: 363  SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422

Query: 406  TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
             N   L     S  + +G + FF+GS  GDS+++    G    M     ++         
Sbjct: 423  ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478

Query: 462  PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                 L     D   D + G E +   + + N        SF + D+L++I P++D + G
Sbjct: 479  LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538

Query: 522  -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
                  D+    +SK   S+ +L                           E P  +G WT
Sbjct: 539  KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598

Query: 553  VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
            +  K     S G N           ++  Y+I++       E   +   TA     + E+
Sbjct: 599  MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658

Query: 603  VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
                  G T+ AG +  + RVIQV +   R  DG   +TQ L     + E+G+      V
Sbjct: 659  EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713

Query: 662  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
            +S SIADPY+LL   D S+ +   D +     V+   +   S K  + C LY D      
Sbjct: 714  ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766

Query: 722  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
                      + GV +   G  G   D   I   +   +GAL ++ +P+ +  V+  +  
Sbjct: 767  ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814

Query: 781  VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
                 H+  D  +R  L                   KEN+  + V +L           P
Sbjct: 815  CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852

Query: 840  FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            +L        +  Y+   Y  EG E T         S +L+    S + L       +  
Sbjct: 853  YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904

Query: 898  DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            DA    + P   P +      N++G+   FL G  P + +   + +     L    I   
Sbjct: 905  DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLY 1016
            +  H   C+ GFIY   +GI ++ QLPS + + D    V+K+PL +    I Y      Y
Sbjct: 959  STFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCGIAYHQPTGTY 1018

Query: 1017 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
                +       N+   L  D +   +     L+        T+    ++++ P      
Sbjct: 1019 IAGCTT------NEPFELPRDDDYHKEWAKETLTFAP-----TMPRGVLKLISP------ 1061

Query: 1077 WQTRATI----PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
                 T+     ++S E+   ++ + L  +  TKE   LL +GTA  +GED+  RGRV +
Sbjct: 1062 --VSLTVIHDQELESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRVHV 1119

Query: 1132 FSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWT 1181
            F        P    T    K +      +G ++A++ +  QG +L+A G K ++   K  
Sbjct: 1120 FDIVTVIPEPGKPETNKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKED 1179

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
            G+ L  +AF D       +  + +    ++ D  K ++F  + E+     +L K  G L 
Sbjct: 1180 GSLLP-VAFLDMSCHVSSARELPRTGLCVMADAFKGVWFAGYTEEPYTFKILGKSHGRLP 1238

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
                +FL DG  L++V +D   ++ I  + P+  +S +G  LL R  F V  +       
Sbjct: 1239 LLVADFLPDGEDLAIVAADADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPT---- 1294

Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
             ML        A P     ++  LL  +  G +  + PL E T+RRL S+  +L+ ++  
Sbjct: 1295 SMLLLPRTTPPAHPSPSDPSQI-LLLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTP 1353

Query: 1361 VAGLNPRSFRQFHSNGKAHRP-GPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              GLN + +R         RP G D     +IVD  +L+ +  L   ++ EIA + G
Sbjct: 1354 YGGLNAKGYRL----PSGTRPVGVDAAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406


>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 880

 Score =  221 bits (564), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSL 349
           T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+     
Sbjct: 68  TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
            +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+ 
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245

Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361

Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487



 Score =  129 bits (324), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/270 (29%), Positives = 128/270 (47%), Gaps = 15/270 (5%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 602  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 658  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 708  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 768  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
             WPV+KIPL+ T H + Y  E  +Y +  S
Sbjct: 828  PWPVRKIPLRCTAHYVAYHVESKVYAVATS 857


>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1433

 Score =  221 bits (564), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 333/1393 (23%), Positives = 550/1393 (39%), Gaps = 192/1393 (13%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
            R  ++   ES   G     V  D  +   L L+    L G V  LA +     +     +
Sbjct: 70   RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127

Query: 132  SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
            +++LA++ AK+ + E+    H L  TS+H +E  E   L+      + G  V   + DP 
Sbjct: 128  ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184

Query: 189  GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
             RC         + IL  +Q    L  D   ED             T G G G      +
Sbjct: 185  SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244

Query: 229  RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            R   S V+ L  LD  + H     F+H Y EP   IL   +   A      H T  +  L
Sbjct: 245  RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
             +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+ 
Sbjct: 305  DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362

Query: 346  AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
            A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ +
Sbjct: 363  AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422

Query: 404  ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
                ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +   
Sbjct: 423  RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476

Query: 457  IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
            +     +         D   D + G + SL   S + N  S     SF ++DSL++I P+
Sbjct: 477  LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536

Query: 516  KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
            +D + G   +  D+    +SK    EL                             + P 
Sbjct: 537  RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596

Query: 547  CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
             +G WT+  K      A  +       +Y  Y+I++       E   +   TA     + 
Sbjct: 597  ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656

Query: 601  ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
            ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       +
Sbjct: 657  DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713

Query: 660  TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
            T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D    
Sbjct: 714  T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766

Query: 720  PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
                   T       V E          D+ D I   +    GAL ++ +P+ +  VF  
Sbjct: 767  -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
            +   S     +  ++  A           + +G     KE+I  + V +L      A   
Sbjct: 813  EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851

Query: 838  RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
             P+L        +  Y+   Y  EG    S S      V+TS + +   VS         
Sbjct: 852  TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
                 DA    E P   P +R     N++G+   FL  + P + +           L   
Sbjct: 905  -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952

Query: 953  SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFA 1011
             +   +  H   C+ GFIYV  +GI ++ QLPS +        V+KI L      I+Y  
Sbjct: 953  GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012

Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
                Y  +V    L+           +E   +  N            T+    ++++ P 
Sbjct: 1013 PTGTY--VVGCTKLEQFELPRDDDYHKEWAKETSNF---------PPTMARGILKLINPV 1061

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
                 W     + ++  E+  +++ + L  +  TKE + L+A+GTA  +GED+  RGRV 
Sbjct: 1062 T----WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQ 1117

Query: 1131 LFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1180
            +F        P     N   ++ +KE   +G ++AL+ +  QG +L+A G K ++   K 
Sbjct: 1118 VFDIVTVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKE 1177

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
             G+ L  +AF D    +V S+  +      ++ D+ K ++F  + E+     +L K  G 
Sbjct: 1178 DGSLLP-VAFLDM-SCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGK 1235

Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
            L   A +FL DG  LS+V  D + ++ I  + P+  +S +G  LL R  F V  +     
Sbjct: 1236 LPLLAADFLPDGEDLSMVAVDAEGDLHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSS- 1294

Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS- 1357
               +L   +        S  ++   LL     G +  ++PL E T+RRL S+  +L  + 
Sbjct: 1295 --TLLLPRTHSPSYPQASSSSSSHMLLLACPSGQLAALSPLAESTYRRLLSVTNQLHPAI 1352

Query: 1358 VPHVAGLNPRSFR 1370
            VPH  GL+ ++ R
Sbjct: 1353 VPH-GGLHSKAHR 1364


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
          Length = 1376

 Score =  221 bits (564), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 168/629 (26%), Positives = 295/629 (46%), Gaps = 62/629 (9%)

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
            RPFLFA++ +  +L Y+A+ +  P+   +                            +DD
Sbjct: 755  RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813

Query: 870  -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
                      R  S  + + SR R  R S    +A+  E     + AP ++++ F+N++G
Sbjct: 814  FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+ G  P +C + ++  LR+HP   DG +VAF    +      F Y T+ G++++ 
Sbjct: 873  YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LPS   +D   P  K+ L    H + Y  E + Y L  S  +  P ++V++L+ D +  
Sbjct: 933  SLPSKFDFDRSIPSMKVELGRAAHFVVYLMESHTYALTTSEQM--PCHKVVTLIGDDKQF 990

Query: 1042 HQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTR--ATIPMQSSENALTVRVVT 1098
               D         H  Y T+E++++++   D     W     A +     E+    + V 
Sbjct: 991  ETFDREAP-----HFIYPTMEQFKLQLYSADT----WLPVPGAELDFDEFEHVTACQEVQ 1041

Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKE 1152
            L +  +    ++ LAIGT    GE+V  RGR+L+         P   +T+     VYSKE
Sbjct: 1042 LKSEGSASGLQSYLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEPDRPMTKFKLKVVYSKE 1101

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             KG +++L SL+G+LL   G K+ + ++    L GI+F D   +YV  +  ++   L  D
Sbjct: 1102 QKGPVTSLCSLRGYLLTGMGQKVYIWQYKDNALVGISFLDLQ-VYVHQMASIRYLALTAD 1160

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
                +  L ++E+   L+L+++D    +  A EFL+D + LS +++    +I  + Y P+
Sbjct: 1161 AFFGVSLLRYQEEYKALSLVSRDPRPDEVLAVEFLVDRTDLSFLMTSAAGDILTYVYLPE 1220

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
              +S+ GQ+L+ +A++H G+ V  F+R++     +         +   R  L+F + DGS
Sbjct: 1221 SLDSFGGQRLVPQADYHFGSQVNAFVRMR---CHAQEIAGRKRQEVLQRQGLIFASSDGS 1277

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +  + PL E  +R L  LQ  L+D +P  AGLN   +R         R    +I+D  + 
Sbjct: 1278 VNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNVDDYRTVRFPNSCLREPTKNIIDGNIC 1337

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
              Y  +   +Q +I  Q G++ SQI+  L
Sbjct: 1338 MLYLYIDALQQEDIVRQIGSSHSQIMLEL 1366



 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           AS ELV   +++G + S+AI    G     + D I+LA +DAK+SV+ +D   H L   S
Sbjct: 65  ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
           MH +E   +   K G   F   P++++DP+ RC  + +YG  +++L   + S   S  + 
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177

Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
           D      G  FS R+ S  V  N  D  M +V D  F+HG+ EP +++L+E   T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             +  T  I A+S++   K H +IW+  NLP DA+ LLA+P P+GGVL+   N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296

Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
           S   C + +N+      +     RS   V LD +HA  + +  A L  ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356

Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           D   +V+ + L+ ++     PS +++    + ++  F+GS +G+S L  +       +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
             L                 R +  + L DM       LYG       TE+  Q  F F 
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
             D L ++GP +  + G    A  +       S++ +   PG  G +TV  +S R     
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512

Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
            +R+    + +                   Y++++ + RT+V   +  +TE+ ++  + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           +  TIA G++     V+QV +    +L      Q ++                V   S+ 
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621

Query: 668 DPYVLLGMSDGSIRLL 683
           DPYV L +  G + L 
Sbjct: 622 DPYVALIVRHGHVLLF 637


>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_b [Homo sapiens]
          Length = 377

 Score =  218 bits (555), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDIT 416
            SVLT+ ++
Sbjct: 367 ASVLTTSVS 375


>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
 gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
          Length = 1269

 Score =  218 bits (554), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 311/1370 (22%), Positives = 548/1370 (40%), Gaps = 215/1370 (15%)

Query: 98   AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
            A  LEL+  Y L G V  +  +     DN    DS+ ++ + AK  ++ ++ S   +   
Sbjct: 51   APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107

Query: 158  SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            S+H +E  + L      E       V  +       +L    +M  L   + G   + D+
Sbjct: 108  SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162

Query: 218  DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
            +   SG  F    + S ++    LD  +++V D  F+H Y E  M IL + +  W G  +
Sbjct: 163  EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218

Query: 276  WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
                T   + +S+    K   +I +   LP DA K++ +P+P+GG L++ ANTI Y   S
Sbjct: 219  ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278

Query: 336  ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
            AS    + N     +S   + R  S+  + L+ A   ++Q   + ALL T+ G    L  
Sbjct: 279  ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338

Query: 392  VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
              DGR V  L+L    P   +L+  S +    +   FLGSR GDSLLV++  G   S   
Sbjct: 339  NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
              L                          D  N  +  LYG  +  TE +  +     + 
Sbjct: 399  ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432

Query: 507  DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
            D ++ +GP++  + G    +      + TG+   S   ++              ++PG +
Sbjct: 433  DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492

Query: 549  GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
              W+V  +    G  A S       D  ++YL     A   V+E   L T+  ++  +FV
Sbjct: 493  -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               T    +      ++QV      I D                  S  + +T +   + 
Sbjct: 550  SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
              YVL+ +SDG +++L        VS+     +++++  +   +     G        +T
Sbjct: 601  GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648

Query: 728  DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
            D+ L+ G         G P        VVCY  G+L             +    S    I
Sbjct: 649  DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            +D           ++++   +E  GQ        +++V++A+      H       +LT 
Sbjct: 697  LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
              ++ Y+ Y +                               LRF +  L     E TP 
Sbjct: 744  QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770

Query: 908  GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
                +R+T    I+G     ++G       +  + L   P+L +       VAFT     
Sbjct: 771  INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKNLYPLIVSV 1022
            + +  F Y+T  G +  C+     + +  WPV+ + L   T  ++TY    ++Y      
Sbjct: 823  SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
             V+     V  ++ D+      D+  + S+  D+    T +   +RI+ P      W   
Sbjct: 877  -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924

Query: 1081 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGR-- 1136
             +   +  +E AL    V L  +  K +   ++A+GT+ ++GED+AARG + LF      
Sbjct: 925  DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIV 984

Query: 1137 -NADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-ELNGIAFYD 1192
               + P+    + ++    ++GA +A+  + G LL   G K+++        L  +AF D
Sbjct: 985  PEKERPETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLD 1044

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
                YV     + + +LLGD  +S+ F+ +     Q+   A+D   +     +F I+G  
Sbjct: 1045 MQ-TYVAVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARDLQRVLVTTCDFAIEGEN 1103

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            L+ VV+D QK + I  Y P   +S+ G +LL R+ F+ G  +             D +  
Sbjct: 1104 LTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVI-------------DSSAM 1150

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
             P ++  +RF ++    DGS+  + P  E  +RRL ++Q ++ D   HV GL+PR++R  
Sbjct: 1151 VPINE--DRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLHPRAYRYD 1208

Query: 1373 H----SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
                 +    HRP    I+D   L  +  LP  +Q   A++ G    Q++
Sbjct: 1209 PILPGTGNSPHRP----ILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI 1254


>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1441

 Score =  218 bits (554), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 342/1447 (23%), Positives = 575/1447 (39%), Gaps = 203/1447 (14%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
            +V ++   ES   G     V  +  +   L L+    L G V  LA L    +  +   +
Sbjct: 68   QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125

Query: 132  SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
             +++A++ AK+ + E+D   + L   S+H +E  E L      E F  G  V   + DP 
Sbjct: 126  VLLIAYKAAKLCMAEWDPRKNELETISIHYYEKEE-LQGAPWEEVF--GEYVNHLEADPG 182

Query: 189  GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
             RC  +      + IL   +    L     DED  G         +  G S  +E+++  
Sbjct: 183  SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242

Query: 236  -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
                 + L D  + H     F+H Y EP   +L   +   A   +  H +  +  L +  
Sbjct: 243  SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302

Query: 291  TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
              +    I S   LPHD Y+++A+P+P+GG L+VG N  IH      S  +A+N  A   
Sbjct: 303  --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360

Query: 350  DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
             S     +S   + L+  A     ++N   LL    G L +++   DGR V     RL  
Sbjct: 361  TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420

Query: 404  SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
            +    +VL S  T +   G +  F+GS   DS+++ +   S         K    D +  
Sbjct: 421  ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477

Query: 461  APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                +       +      +    +      N   +     +F + D L++I P++D + 
Sbjct: 478  LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537

Query: 521  GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
            G                + AD   A A G  +  +  ++            E P  +G W
Sbjct: 538  GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597

Query: 552  TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
            T+  K    +   A++     YD   ++  ++I++       E   +   TA     + E
Sbjct: 598  TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
            +      G T+ AG +  +  VIQV +   R  +G   + Q L       +  +G+E   
Sbjct: 658  TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
            V S SI DPY+ +   DGS+ L   D +     ++   +  +S K V+ C     KG   
Sbjct: 714  V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
              + + +D+   T   EA+                +   +GAL IF +P+ +  V+  + 
Sbjct: 770  IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814

Query: 780  FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
              S   H+   Y   R A +++ TEI                    V +L      A H+
Sbjct: 815  LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
             P+L    +   +  Y+      P N         T+ +LS           L F ++P 
Sbjct: 852  SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892

Query: 897  -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
             + A +  E P     Q      + I  N+ G+   FL G  P + +     +     L 
Sbjct: 893  AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITY 1009
               +   +  H   C+ GFIY  S+GI ++ QLPS + +      V+K+PL      + Y
Sbjct: 953  GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPLGFDVRHVAY 1012

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN---LSSVDLHRTYTVEEYEVR 1066
                  Y  I    V +          + E+    D H      SV L  T       ++
Sbjct: 1013 HHPTETY--IAGCAVTE----------NFELPKDDDYHKEWARESVPLPPTAV--RGALK 1058

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
            ++ P      W    +I M++ E+   ++ + L  +  TKE   LLA+GTA  +GED+  
Sbjct: 1059 LINPIT----WTVIHSIDMEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLPT 1114

Query: 1126 RGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIIL 1177
            RGRV ++        P     N   ++ +KE   +G ++AL+ +  QG +L+A G K ++
Sbjct: 1115 RGRVQVYDIVTVIPEPGKPETNKRLKLLAKEDIPRGGVTALSEIGTQGLMLVAQGQKCMV 1174

Query: 1178 H--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLA 1233
               K  G+ L  +AF D    +V S+  +      L+ D  K ++F  + E+     +L 
Sbjct: 1175 RGLKEDGSLLP-VAFLDM-SCHVSSVRELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1232

Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
            K  GSL     +FL DG  LS+V  D   ++ +  + P+  +S +G  LL R  F V  +
Sbjct: 1233 KSSGSLPLLVADFLPDGEDLSMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPN 1292

Query: 1294 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
              T  L L     +S  +  +  S  T    LL  +  GSI  + PL E  +RRL S+  
Sbjct: 1293 PPTSTLLLPRTLPASQSSQDSSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTN 1352

Query: 1353 KLVDS-VPHVAGLNPRSFRQFHSNGKAHR-------PGPDSIVDCELLSHYEMLPLEEQL 1404
            +L+ + VPH  GL+ R+ R     G   R           +IVD  +L+ +  L   ++ 
Sbjct: 1353 QLLPALVPH-GGLHARAHRTPEGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRA 1411

Query: 1405 EIAHQTG 1411
            E+A + G
Sbjct: 1412 EVATRGG 1418


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score =  217 bits (553), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 179/678 (26%), Positives = 315/678 (46%), Gaps = 60/678 (8%)

Query: 760  SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
            +G+LEI+ +P+  C+    D+  +    I++T               S  EG+ +GR+ +
Sbjct: 814  NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855

Query: 819  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
            +  + V E+ +       S P++  ++ D  ++    Y F    N  +++ PV + R + 
Sbjct: 856  V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + +    + + LR      D  ++ +  +    ++   F NIS H G FL G+ P W   
Sbjct: 909  LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961

Query: 939  FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
             +  RL +H    +G + AFT   N  C  GF+Y   S   L +  L     YD  WP +
Sbjct: 962  GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021

Query: 997  KIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL--IDQEVGHQIDNHNLSSVDL 1054
            KI L  TP   +Y  E+ +      + V    ++ + +L  I+ E GH+ +  +L  V  
Sbjct: 1022 KIKLNYTPCFSSYDLEQKV------LTVCGSRSEKIEMLPKINAE-GHK-EYEDLPEVQN 1073

Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE-NETLL 1111
              T    ++ V +  P      W+    + I M + E+ L  R V L +  +    +  +
Sbjct: 1074 VETQLFPQFFVEMFSP----ASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYI 1129

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGH 1166
            AIGT+ + GED  +RGR++L         P   +T      V+    +G +SA+ SL G 
Sbjct: 1130 AIGTSNICGEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGA 1189

Query: 1167 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
            L+ A G K+ +H +    L    F D   LY  + +  KN+ L+GDI + I  L  + + 
Sbjct: 1190 LIAAIGQKVFIHAFQDDNLRATGFVDTQ-LYTHATHCFKNYALVGDIQQGITLLRHQGER 1248

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
              ++ +++   + +  A   L+DG+ + LV +D Q+N+Q++ Y P   ES  G++L+ +A
Sbjct: 1249 NCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 1308

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
            + ++G  V        L   +D       ++   R    +  LDGSIG I P+ E  FRR
Sbjct: 1309 DINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVTFYAGLDGSIGDIVPVSEKVFRR 1366

Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1406
            L+ LQ  +   +PH  GLNPR +R   +  +       +I+D +LL  +  L   EQ ++
Sbjct: 1367 LEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDL 1426

Query: 1407 AHQTGTTRSQILSNLNDL 1424
            + + G TR  +L ++ D+
Sbjct: 1427 SRKIGVTREALLDDMMDV 1444



 Score =  188 bits (478), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 182/695 (26%), Positives = 313/695 (45%), Gaps = 77/695 (11%)

Query: 57  NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           NL V A N++ +Y +R  V E G+   +                   EL   + L G V 
Sbjct: 45  NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  L   G+     RDS++L+ E++K  ++E++     L   SMH F+  +   L+RG 
Sbjct: 87  CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
              +   L +VD   RC  VLVYG  + +L   +     L G   +  F    GF A   
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             ++I+L+ L +K   V DF F+ GY +P +++L+E   TW GRV+ +  TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254

Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
              + HP+IW    LP  + Y   L  +P P+GG L+   N++ Y  QS     +ALN+ 
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
            +   +     +    + L    A  L +D   +S ++GD+ ++T+  D    V+R  L 
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           +   SV+ + ++ + ++L FLGSRLG+SLL+++ C   +   S+ L+    D        
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434

Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
           +     + +  +   NG      +++  YG    N +    ++ F   D+L NIGP    
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493

Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
                 N + +   +   ++  N ++  L G      G  TV HKS R   A        
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553

Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
             D   +   ++E H+ L+++ + +TMV +T  +L E+  E        +TI    +   
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
           + ++QV  R   ++D    TQ+         SG       ++ V+  DPYV++  S G+ 
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
           I L++ + S  T  ++T  A E   +      + H
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQDDPEKKIMH 699


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score =  217 bits (552), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 242/987 (24%), Positives = 419/987 (42%), Gaps = 144/987 (14%)

Query: 501  FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
            + F + D L N+GP++D + G      + D      S  +N ELV   G           
Sbjct: 376  YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435

Query: 547  ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                             G  +VY K  +  +   S        Y  YL++S      + +
Sbjct: 436  REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495

Query: 587  TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
            ++V   +    E T++ ++   + RTI  G L    RV+QV +   R  D G  + Q   
Sbjct: 496  SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555

Query: 645  FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
                +      SE  +V+  S ADPYVL+   D SI LL  D S      +T   I S+ 
Sbjct: 556  VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610

Query: 705  KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
                S +LY DK                     +    +G P + Q D +   +      
Sbjct: 611  --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649

Query: 763  LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
            L +F +PN    VFT +                   D   +I S+         +E I  
Sbjct: 650  LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L      +    P+L    ++  ++ Y+ Y +             ST R  S   
Sbjct: 693  LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
            +   ++ N  F ++  ++   +   H A C  I+    +  ++ G++  F+ G+ PC+ +
Sbjct: 736  LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792

Query: 938  VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
                 +     L   ++ + +  +   C  GF+YV +  ++++C+ P  + +D  W  +K
Sbjct: 793  KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852

Query: 998  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
            I L      + Y +    Y +  +  V        +L  D E+  +  N  +S +     
Sbjct: 853  IGLGEQVDAVEYSSSSETYVIGTNQKV------DFNLPEDDEIHPEWRNEVISFLP---- 902

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
              +++  V++L P      W    +  ++++E  + V+ + L  +  T E +  + +GTA
Sbjct: 903  -QIDKGSVKLLTPRT----WSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGTA 957

Query: 1117 YVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISALASL--QGHLLI 1169
              +GED+AARG + +F         D P+    +  +  +E+KGA+++L+ +  QG L+ 
Sbjct: 958  LTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIA 1017

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
            A G K I+   K  G+ L  +AF D    YV  L  +K     ++GD  K ++F  + E+
Sbjct: 1018 AQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDALKGLWFAGYSEE 1075

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              +L+L +KD G+L   A +FL DG+ L ++V+D+  NI +  Y P+   S KG +LL R
Sbjct: 1076 PYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHR 1135

Query: 1286 AEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTL----DGSIGCIAPLD 1340
            + F  G   +    L   AT SS R  A P     +    L   L     GSI  I P+ 
Sbjct: 1136 STFQTGHFASTMTLLPRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVS 1195

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E ++RRL +LQ +L +++ H  GLNPR+FR   S+G   R     +VD +L+  +  L  
Sbjct: 1196 ETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGGR----GMVDGDLVKRWLDLGT 1251

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALG 1427
            + + EIA++ G    +I ++L  +  G
Sbjct: 1252 QRKAEIANRVGADVWEIRADLEAIGKG 1278



 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    +++++ +     GS   +   +T+ +          L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   H +  TS+H +E  + +++     +
Sbjct: 81  GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
            A  P  + VDP  RC  VL +G + + IL   Q G  LV D+        F + +E  H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188


>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
          Length = 1153

 Score =  214 bits (546), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1232 (22%), Positives = 503/1232 (40%), Gaps = 181/1232 (14%)

Query: 293  KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
            K    I S   LP+D ++++ +  P+GG L+VG N  IH      +  +A+N +A     
Sbjct: 10   KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69

Query: 352  SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
               L ++   + L+      L  +N   L+   +GD+ +L+   DGR V  L + + +  
Sbjct: 70   FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129

Query: 409  ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
               ++LT   S ++++G    F+GS + DS+++ +   SG +       +     E D  
Sbjct: 130  LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187

Query: 463  STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                      +   D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++
Sbjct: 188  -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246

Query: 521  G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
            G   L +  D        QS  +LV                          +LP  +GIW
Sbjct: 247  GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306

Query: 552  TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
            T+  K  + +G   +  +         D +Y   +I+S   +A   + E+A     D   
Sbjct: 307  TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366

Query: 598  EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
            E     ++    G TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+ 
Sbjct: 367  EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                 ++S S ADP++LL   D SI +   D       ++    I  S K ++ C LY D
Sbjct: 424  --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480

Query: 716  KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                       +D+  S   GE             ++   +    GAL I+ +P+ +   
Sbjct: 481  ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520

Query: 776  TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             V +   FV           + A +++ TEI                       L     
Sbjct: 521  YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
             +    P+L    ++  +  Y+ +  +            S S  L  S +   +++N   
Sbjct: 558  DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605

Query: 893  SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            ++ P    + EE   GA      P + I+   N+ G+   F+ G  P + +   +     
Sbjct: 606  TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPH 1005
              L    + + +  H   C+ GFIY +++GI ++ Q P  +T+ D    ++KI +    H
Sbjct: 661  LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVH 720

Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYE 1064
             + Y      Y +  S               D E+    D+       ++    ++E+  
Sbjct: 721  AVAYHPPLQTYVIGTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSF 768

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDV 1123
            ++++ P      W     I ++  E    ++ + L  +  T E + L+ +GTA  +GED+
Sbjct: 769  LKLVSPVN----WSVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDL 824

Query: 1124 AARGRVLLF---STGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASG 1172
            A  GR+ ++   +     D P+       + +E+ ++   G ++ L+ +  QG +L+A G
Sbjct: 825  ATTGRLYVYDVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQG 884

Query: 1173 PKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQ 1228
             K ++   K  GT L  +AF D    YV S+  +      ++ D  K ++F  + E+  +
Sbjct: 885  QKCMVRGLKEDGTNLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYR 942

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            + L  K    ++    + L DG  L +V +D   N+ I  Y P+  +S +G  LL R  F
Sbjct: 943  MLLFGKSAAKMEVLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTF 1002

Query: 1289 HVGAH-------VTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPL 1339
             +GAH       +     L  L T+   +      + T   +  LL  +  G++  ++PL
Sbjct: 1003 SLGAHHPTTMTLLPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPL 1062

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
             E  +RR  +L   L +++ H  GLNPR++R    +      G  +I+D  +L  +  L 
Sbjct: 1063 TESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELG 1121

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             + + E+A + G    ++   L+ L  G  F+
Sbjct: 1122 SQRRGEVAGRVGVDVLELRDELSGLRGGLGFI 1153


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
          Length = 1454

 Score =  213 bits (542), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 193/748 (25%), Positives = 344/748 (45%), Gaps = 83/748 (11%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            ++   DA +S+  GE  D      +D    YS      VV +++G + I  +P+   V+ 
Sbjct: 737  KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
            + +F +    +VD  + E  K+ + +  ++ E+           +  ++ ++S       
Sbjct: 792  IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KVVE  +     + + P L AI+ D  ++ Y+ +    P+        V+  +   +  +
Sbjct: 852  KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
              S   N+   R P +     E  HG     I  F+ IS  + G  + G+ P   +V+  
Sbjct: 908  RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPTL-LVYGA 962

Query: 942  --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKI 998
               ++ H    DGSI AFT  +N N  HGF+Y+T Q   L+I ++     YD  +PV+KI
Sbjct: 963  WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHR 1056
             +  T H + Y    ++Y ++ SVP  KP N++  ++ D  QE  H+ D + +  +    
Sbjct: 1023 EVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPP 1078

Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ETL 1110
             YT+  +              Q  A +P    E      V  + +   K        +T 
Sbjct: 1079 KYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTY 1126

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
            LA+ T    GE+V  RGR++L         P        +  +Y KE KG ++ L ++ G
Sbjct: 1127 LALATVNNYGEEVLVRGRIILCEVIEVVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAING 1186

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
             LL   G K+ + ++   +L GI+F D    YV  L+ ++   L  D  +S+  + ++E+
Sbjct: 1187 LLLSGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSIRTIALALDARESMSLIRFQEE 1245

Query: 1226 GAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
               +++ ++D     C     A+EFL+DG  +  ++SDE  NI +F Y+P+  ES  G++
Sbjct: 1246 NKAMSIASRD--DRKCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGER 1303

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            L  +A  ++G ++  FLR++   +  D +      +   R   +FG+LDGS G I PL E
Sbjct: 1304 LTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTE 1363

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYE 1396
             ++RRL  LQ  +    P +AGL+ +  R     Q   NG+  R    +++D +++  Y 
Sbjct: 1364 KSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNAR----NLIDGDVVEQYL 1419

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             L + ++ ++A + G  R  IL +L  L
Sbjct: 1420 HLSVYDKTDLARRLGVGRYHILDDLMQL 1447



 Score =  188 bits (478), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL++P P+GG +V G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
             F+GSRLGDS L+++T                  ++    S K+ R    +  +  ++ 
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410

Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           +++ LYG A     +++ E   ++  F   D L+N+GP+K   +G R N  ++    +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469

Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
            +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+S 
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
             R+ ++          E   +     T+AAG L      +QV     A + DG  M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           +              N  V+  SI DPYV +   +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score =  211 bits (536), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 185/736 (25%), Positives = 339/736 (46%), Gaps = 69/736 (9%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            R+   DA +S+  GE  D      +D    YS      +V +++G L I  +P+   V+ 
Sbjct: 741  RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
            + +F +    ++D    E  K+ + +   ++++      +      K+ E  M+      
Sbjct: 796  IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              + S P L AI+ D  ++ Y+ +    P+        ++  +      +  S   N   
Sbjct: 856  GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
             R P +     E  +G     I  F+ IS  + G  + G+ P   +      ++ H    
Sbjct: 912  KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
            DG I AFT  +N N  HGF+Y+T Q   L+I ++     Y+  +P++KI +  T H + Y
Sbjct: 968  DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKIEVGRTIHNVRY 1027

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
                ++Y ++ S+P  KP N++  ++ D  QE  H+ D + +  +     YT+  +    
Sbjct: 1028 LMNSDVYVVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF---- 1079

Query: 1068 LEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVA 1124
                 +   W+      I  +  E       V+L + +T    ET LA+GT    GE+V 
Sbjct: 1080 -----SSQDWKAVPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEVL 1134

Query: 1125 ARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHK 1179
             RGR++L         P    +      ++ KE KG ++ L ++ G LL   G K+ + +
Sbjct: 1135 VRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLSGMGQKVFIWQ 1194

Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--FG 1237
            +   +L G++F D    YV  L+ ++   L  D  +S+  + ++E+   +++ ++D    
Sbjct: 1195 FKDNDLMGLSFLDMH-YYVYQLHSLRTIALACDARESMSLIRFQEENKAMSIASRDDRRT 1253

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
            +    A +F++DG+ L  ++SDE  NI +F Y+P+  ES  G++L  RA  ++G +V  F
Sbjct: 1254 AKPPMAAQFVVDGAHLGFLLSDENGNITLFNYSPEAPESNGGERLTVRAAMNIGTNVNAF 1313

Query: 1298 LRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
            LR++    +L   SD    +       R + +FG+LDGS G + PL E ++RRL  LQ  
Sbjct: 1314 LRVKGHTSLLNLQSDEEKES----VEQRMSTIFGSLDGSFGFVRPLSEKSYRRLHFLQTF 1369

Query: 1354 LVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
            +    P +AGL+ +  R     Q   NG+  R    +++D +++  Y  L L ++ ++A 
Sbjct: 1370 IGSVTPQIAGLHIKGARSARPAQPIVNGRNAR----NLIDGDVVEQYLHLSLYDKTDLAR 1425

Query: 1409 QTGTTRSQILSNLNDL 1424
            + G  R  I+ +L  L
Sbjct: 1426 RLGVGRYHIIDDLMHL 1441



 Score =  181 bits (460), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG LV G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD A + ++++    +  + G L LL +V    G  V+ ++ S+   + +   +T     
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             F+GSRLGDS LV++T    T  S     ++++ G+IE D                   
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
             +++ LYG A     +++ E   ++  F   D L N+GP+K   +G R N  +S     
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           Q++              N  V+  SI DPYV +   +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
            Silveira]
          Length = 1387

 Score =  210 bits (535), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 153/533 (28%), Positives = 266/533 (49%), Gaps = 41/533 (7%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
            + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 878  YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
            ++++C+LP  + +DN W  +K+ +      + YFA   +Y L  S  V   L +      
Sbjct: 938  MVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYALGSSHKVDFKLPE------ 991

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            D E+  +  +  +S +       +E   +++L P      W    +  +  +E  + ++ 
Sbjct: 992  DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----WSVVDSYELGDAERVMCMKT 1042

Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSK 1151
            + +  +  T E + +L +GTA V+GED+  RG + +F     A +P     N   ++++K
Sbjct: 1043 INMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDPDRPETNRKLKIFAK 1102

Query: 1152 E-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            + +KGA++A++ +  QG L++A G K ++   K  G+ L  +AF D    YV  L  ++ 
Sbjct: 1103 DDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVKVLKELQG 1160

Query: 1207 --FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
                ++GD  K I+F  + E+  +L L  KD   L   A +FL DG  L ++V+D+   I
Sbjct: 1161 TGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADFLPDGKRLYILVADDDCTI 1220

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN---- 1320
             +  Y P+   S KG +LL R+ FH+G H T  + L +   SS  +   PG D  +    
Sbjct: 1221 HVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQHSSSPSADDPGEDDMDVDYV 1278

Query: 1321 --RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
               + +L  + +GSIG + PL E ++RRL +LQ +LV S+ H  GLNP+++R   S+G  
Sbjct: 1279 PKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFG 1338

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             R     IVD  LL  +  + ++ + EIA + G     I  +L  ++ G  FL
Sbjct: 1339 GR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIRVDLEKISGGLDFL 1387



 Score =  116 bits (290), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ +   S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437

Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           S L    K E  D    +              D     VN   LS   S +N     +  
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 160/559 (28%), Positives = 275/559 (49%), Gaps = 49/559 (8%)

Query: 891  RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
            RF  +P  AY     PH    + +  + +I G++  F+SGS PC+ M          +L 
Sbjct: 860  RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
              ++ + +  H   C  GF YV +  ++++C+LPS + +DN W  +K+ +      + YF
Sbjct: 912  GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYF 971

Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1070
            A   +Y L  S  V   L +      D E+  +  +  +S +       +E   +++L P
Sbjct: 972  AHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSP 1020

Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
                  W    +  +  +E  + ++ + +  +  T E + +L +GTA V+GED+  RG +
Sbjct: 1021 RT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSI 1076

Query: 1130 LLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KW 1180
             +F     A +P     N   ++++K+ +KGA++A++ +  QG L++A G K ++   K 
Sbjct: 1077 YVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKE 1136

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
             G+ L  +AF D    YV  L  ++     ++GD  K I+F  + E+  +L L  KD   
Sbjct: 1137 DGSLLP-VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEY 1194

Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
            L   A +FL DG  L ++V+D+   I +  Y P+   S KG +LL R+ FH G H T  +
Sbjct: 1195 LQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTG-HFTSTM 1253

Query: 1299 RLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
             L +   SS  +   P  D  +       + +L  + +GSIG + PL E ++RRL +LQ 
Sbjct: 1254 TL-LPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQS 1312

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            +LV S+ H  GLNP+++R   S+G   R     IVD  LL  +  + ++ + EIA + G 
Sbjct: 1313 QLVTSMEHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGA 1368

Query: 1413 TRSQILSNLNDLALGTSFL 1431
                I  +L  ++ G  FL
Sbjct: 1369 DIESIRVDLETISGGLDFL 1387



 Score =  113 bits (282), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++++++  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV DE     D    G      
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      +  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ ++  S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439

Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           SM    ++   + FG  +              D     VN   LS   S +N     +  
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735


>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
          Length = 1452

 Score =  207 bits (527), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 330/1440 (22%), Positives = 574/1440 (39%), Gaps = 223/1440 (15%)

Query: 91   VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
            +L D      L LV    + G +  LA L     S GG       ++++LA+  AK+ + 
Sbjct: 94   LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146

Query: 146  EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
            E++     L   S+H +E  E   L+        G  V   + DP  RC         + 
Sbjct: 147  EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203

Query: 203  ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            IL   Q                      + L  + D  G   G  ++   S V+ L  LD
Sbjct: 204  ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263

Query: 243  --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
              + H     F+H Y EP   IL   + T        H T  +  L +    +    I S
Sbjct: 264  PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
               LP D  +++ +P+P+GG L+VGAN  IH      +  +A+N  A      SL+   E
Sbjct: 322  VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
            L   ++ +E  A     +++   LL      L +++   DGR V  + L   S+ N   L
Sbjct: 382  L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438

Query: 412  ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S I+ IG S  F+GS  GDS+++ ++        S   +++   ++A+       
Sbjct: 439  LKSHVSCISRIGKSSMFIGSEYGDSVVLGWS-----RKQSQEKRKKSRVLDAELALDVDD 493

Query: 468  RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                     D + G E  +   S + N  +      F ++DSL+ + P+ D + G     
Sbjct: 494  IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553

Query: 522  -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHKS 557
                   LR         A A G  K       N E+        E P  +G WT+  K 
Sbjct: 554  LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613

Query: 558  ----SRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                + G NA  S  + YD  E +   +I  +      ET+D+        +        
Sbjct: 614  PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671

Query: 607  -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
               G T+ AG +  + R+IQV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 672  PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726

Query: 665  SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
            SIADPY+++   D SI +               A I+S+ +      +  DKGP   ++ 
Sbjct: 727  SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767

Query: 725  TSTDAWLSTGVGEAIDGADGGPL--DQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDK 779
                 W  TG   A       P   D+G    ++ +    +GAL I+D+ N +       
Sbjct: 768  -----W-QTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS------- 814

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
                       Y+ E L  S     S++  G     KE +  + V +L           P
Sbjct: 815  --------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSP 861

Query: 840  FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
            +L        +  Y+   +  P ++S        S +L     + S +     +    D 
Sbjct: 862  YLILRHDTDDLTLYEPVRYHEPNSSS-----APLSDTLFFKKSTNSTIAKSAPASDKEDD 916

Query: 900  YTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
             T+++     P Q   +  N+ G+   FLSG  P + +   + +     L    +   + 
Sbjct: 917  ETQQK--RFVPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQGQGVQGMST 971

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPL 1018
             H   C+ GFIY  ++GI ++ QLP+ + Y      V+KIPL    +++++ +    Y  
Sbjct: 972  FHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCDVNRVSFHSHTATY-- 1029

Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
            I +    +P           E+    D H   +   ++   T+    ++++ P      W
Sbjct: 1030 IAACSTREPF----------ELPKDDDYHKEWARETVNFAPTMPRGILKLISP----AAW 1075

Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
                ++ ++S E   ++  + L  +  TKE   ++A+G+A  +GED+  RGRV +F    
Sbjct: 1076 TVIHSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGRVQVFDIVT 1135

Query: 1137 NADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILHKWTGTELNG- 1187
                P     N   ++ +KE   +G +++L+ +   G LLIA G K ++    G   +G 
Sbjct: 1136 VIPEPGRPETNKRLKLLAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR---GLREDGG 1192

Query: 1188 ---IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
               +AF D    +++ +  ++     L+ D  K ++F  + E+     +L K  G +   
Sbjct: 1193 LLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLGKSGGQIPML 1251

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRL 1300
              +FL DG  L+++  D   ++ +F + P   +S +G  LL R  F +  +   T  L  
Sbjct: 1252 VADFLPDGEDLNMIGVDADGDLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLE 1311

Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
            + +  S  +     G++  +   LL     G +  + PL E  +RRL SL  +L+ +V  
Sbjct: 1312 RTIPASQPQPQGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVP 1369

Query: 1361 VAGLNPRSFRQFHSNGK---AHRPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTG 1411
              GL+P++ R     G    A   G ++      IVD  +L+ +  L   ++ E+A ++G
Sbjct: 1370 YGGLHPKAHRLPEGRGAQSHARAVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSG 1429


>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1444

 Score =  207 bits (527), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 330/1450 (22%), Positives = 575/1450 (39%), Gaps = 236/1450 (16%)

Query: 85   GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
            GET   +L D      L LV    + G V  LA L     ++    ++++LA+  AK+ +
Sbjct: 85   GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140

Query: 145  LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
             E++     L   S+H +E  E   L+        G  V   + DP  RC         +
Sbjct: 141  TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197

Query: 202  IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
             IL   Q    L + D D    G            G     ES H    V+ L  LD  +
Sbjct: 198  AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257

Query: 244  KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
             H     F+H Y EP   IL   + T        H T  +  L +    +    I S   
Sbjct: 258  LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315

Query: 304  LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            LP D  +++ +P+P+GG L+VG N  IH      +  +A+N  A  + S     +S  + 
Sbjct: 316  LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375

Query: 363  ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
             L+  A     +++   LL      L +++   DGR V  + L   S+ N   L     S
Sbjct: 376  RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
             I+ IG +  F+GS  GDS+++ ++        S   +++   ++ D      L     D
Sbjct: 436  CISRIGKASMFIGSESGDSVVLGWS-----RKQSQEKRKKSRALDTD------LALDVED 484

Query: 474  ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
               D    E+  LYG+ S   + +Q            F ++D+L+ + P+ D + G  + 
Sbjct: 485  IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544

Query: 526  AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
                               A A G  K       N E+        E P  +G W +  K
Sbjct: 545  PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604

Query: 557  SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                    SS +  + YD  ++Y  ++I++ +      ET+D+        +        
Sbjct: 605  KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663

Query: 607  -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
               G T+ AG +  + R++QV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 664  PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718

Query: 665  SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
            SIADPY+++   D S+ +           + +   +E  +K         DKGP      
Sbjct: 719  SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754

Query: 725  TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
              +  W +  +    DG      P D     +++   S  GAL ++D+ N +        
Sbjct: 755  LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L  S     S++  G     KE +  + V +L           PF
Sbjct: 807  -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854

Query: 841  LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
            L        +  Y+   ++ P ++S         K     + ++S S  +      +  R
Sbjct: 855  LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
            F   PL            PC       N+ G+   FLSG  P + +   + +     L  
Sbjct: 915  F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYF 1010
              +   +  H   C+ GFIY  ++GI ++CQLP+ + Y      V+KIPL    +++++ 
Sbjct: 956  QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISVKKIPLDCDVNRVSFH 1015

Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HRTYTVEEYEVRILE 1069
            +    Y  I +    +P           E+    D H   + ++     T+    ++++ 
Sbjct: 1016 SHTATY--IAACSTREPF----------ELPKDDDYHKEWAREVVSFAPTMPRGMLKLIS 1063

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
            P      W    ++ ++S E   ++  + L  +  TKE   L+A+G+A  +GED+  RGR
Sbjct: 1064 P----AAWTVIHSLDLESCETIESMMALHLEISEETKERRMLVAVGSAICKGEDLPTRGR 1119

Query: 1129 VLLFST-------GRNADNPQNLVTEVYSKELKGAISALASL--QGHLLIASGPKIILHK 1179
            V +F         GR   N + L  +   +  +G +++L+ +   G LLIA G K ++  
Sbjct: 1120 VQVFDIVTVIPEPGRPETN-KRLKLQAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR- 1177

Query: 1180 WTGTELNG----IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLA 1233
              G   +G    +AF D    +++ +  ++     L+ D  K ++F  + E+     +L 
Sbjct: 1178 --GLREDGGLLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLG 1234

Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
            K  G +     +FL DG  LS++  D   ++ +F + P   +S +G  L+ R  F +  +
Sbjct: 1235 KSGGQIPMLVADFLPDGEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPN 1294

Query: 1294 --VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
               T  L  + +  S  +     G++  +   LL     G +  + PL E  +RRL SL 
Sbjct: 1295 EPTTTVLLERTIPASQPQPKGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLT 1352

Query: 1352 KKLVDS-VPHVAGLNPRSFR-------QFHSN--GKAHRPGPDSIVDCELLSHYEMLPLE 1401
             +++ + VPH  GL+P++ R       Q HS   G         IVD  +L+ +  L   
Sbjct: 1353 NQVLPAVVPH-GGLHPKAHRLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAA 1411

Query: 1402 EQLEIAHQTG 1411
            ++ E+A ++G
Sbjct: 1412 KRAEMALKSG 1421


>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
          Length = 1399

 Score =  206 bits (523), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 322/1424 (22%), Positives = 547/1424 (38%), Gaps = 193/1424 (13%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
            Y  R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG
Sbjct: 65   YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124

Query: 124  ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
                   +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      
Sbjct: 125  -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            ++ DP  RC         + IL   Q    L  D+      G    + ES+ V N  D D
Sbjct: 177  LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVAN-GDSD 234

Query: 243  MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
                          EP   IL   +          H T  +  L +    +    I S  
Sbjct: 235  TLEPA---------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283

Query: 303  NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
            +LP D +K++ +P+P+GG L++G N  IH      S  +A+N+ A  + S     ++  +
Sbjct: 284  DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343

Query: 362  VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
            + L+        ++N   LL    G + ++T   DGR V     R+   +   +++ S  
Sbjct: 344  LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403

Query: 416  TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T   +G + +F+GS +GDS+++ +T   G        K    D E              
Sbjct: 404  STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
            D   D+   E  +   + + N        SF + D+L++I P+KD + G +  + D+   
Sbjct: 461  DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520

Query: 532  GISKQSNYEL----------------------------VELPGCKGIWTVYHKS----SR 559
             +S+    +L                             E P  +  WT+  K     + 
Sbjct: 521  TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580

Query: 560  GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
            G N           ++  Y+I++ +      ET+D+        E  +  ++    G T+
Sbjct: 581  GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639

Query: 613  AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
             AG +  + R+IQV +   R  DG   +TQ L     + E+G+      V S SIADPY+
Sbjct: 640  EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
            LL   D S+ L   D +     V+   A   + K  S C     KG     +  ++D   
Sbjct: 695  LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751

Query: 732  STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
             T                  I   +   +GAL ++ +P+ +  V+  +       H+  D
Sbjct: 752  ETE----------------KIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795

Query: 790  TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
              +R  L                   KEN+  + V +L           P+L        
Sbjct: 796  YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            +  Y+      P    +     S S +L+    S + L  +       D     E P   
Sbjct: 834  LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            P +      NI+G+   FL G  P + +   + +     L    +   +  H   C+ GF
Sbjct: 884  PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940

Query: 970  IYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
            IY   +GI ++ QLP  + +      V+K+PL A    I Y      Y  I    + +P 
Sbjct: 941  IYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGIAYHQPTGAY--IAGCMISEPF 998

Query: 1029 NQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                      E+    D H   +   L    T+    ++++ P      W     + ++S
Sbjct: 999  ----------ELPKDDDYHKEWAKETLTFPPTMPRGVLKLISPVS----WTVIHEVELES 1044

Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
             E+   ++ + L  +  TKE   L+ +GTA  +GED+  RGRV +F        P    T
Sbjct: 1045 CESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVHVFDIVTVIPEPGRPET 1104

Query: 1147 EVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1196
                K +      +G ++A++ +  QG +L+A G K ++   K  G+ L  +AF D    
Sbjct: 1105 NKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCH 1163

Query: 1197 YVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
               +  + +    L+ D  K ++F  + E+     +L K  G L     +FL DG  L++
Sbjct: 1164 VSTARELPRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHGRLPVLVADFLPDGEDLAI 1223

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRLQMLATSSDRTGAA 1313
            V +D   ++ I  + P+  +S +G  LL R  F V  +   T  L  + L  S       
Sbjct: 1224 VAADADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTLPPSHPPP--- 1280

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
                +     LL  +  G +  + PL E T+RRL S+  +L+ ++    GLN ++    H
Sbjct: 1281 ----QDPPHILLLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLNAKA----H 1332

Query: 1374 SNGKAHRP------GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
                  RP      G  +IVD  +L+ +  L   ++ EIA + G
Sbjct: 1333 RLPDGIRPVGVEAAGGRTIVDGAILARWAELGAAKRAEIAGKGG 1376


>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1342

 Score =  204 bits (519), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 331/1438 (23%), Positives = 568/1438 (39%), Gaps = 253/1438 (17%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G V SLA       DN+   D+II+AF DAK+S++E+D   H +   S+H
Sbjct: 46   LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             +E    +    G        ++ VDP  RC  +     Q+ IL     G  L G+E+  
Sbjct: 104  YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163

Query: 221  GSG----GGFSARIESSH--------------VINLRDLD--MKHVKDFIFVHGYIEPVM 260
            G         S R ES+H              V+ L  LD  + H     F+H Y EP  
Sbjct: 164  GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223

Query: 261  VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
             IL          +  +      +  ++    +    + +   LP   +K+  +P PIGG
Sbjct: 224  GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283

Query: 321  VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
             L+VG N + +  QS    A A+N +A       +S  S   L     S+E     +  L
Sbjct: 284  ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
                 LL T  G L ++     GR +  ++++        T+ S   S I  + N   F+
Sbjct: 344  -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
            GS  G S L+ ++  +          +  G        +A       L  ++ +A +  V
Sbjct: 399  GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
            +  EL           S+   + F +RD L ++GP+     G +  +      A+ATG  
Sbjct: 459  SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507

Query: 535  KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
            KQS      N ++V  PG         ++SR  NA S+   R     DE       +L+ 
Sbjct: 508  KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552

Query: 586  RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
            +  V E A+    E++ + ++FV              +G T+    L   + ++Q  ++ 
Sbjct: 553  KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612

Query: 631  ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
             R  D +  M Q L   P   E    +EN   ++ +++ DPYVL+   D SI++L     
Sbjct: 613  VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
                SVQ      +  +P+ +     +K             WL+  +         G L 
Sbjct: 661  ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
            QG     +    G L  F +P+   +F +         I VD   R A            
Sbjct: 693  QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                  G +E +  + V +L         ++P+L        ++ Y+ + +  P+ + + 
Sbjct: 741  ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--PQTSGRK 788

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
                S  + L        R R + FS  P  + +  E+    P    ++   I  +    
Sbjct: 789  ----SWHQDL--------RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834

Query: 928  LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
            + G+ PC  +       + L +        +     ++ V C +GF  + +   L+  QL
Sbjct: 835  IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894

Query: 984  PSGSTYDNYWPVQKIPLKATPHQ------ITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
            P  + Y   W V ++P+   P+Q      I Y  E+ LY       V+    +V     +
Sbjct: 895  PLNTWYGTGWSVHQVPI-GHPNQIEDVRRIAYHEERGLY-------VVATCREVDFYFAE 946

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP------DRAGGPWQTRAT----IPMQS 1087
            ++  H   +      D+     V +Y V ++        D    P+    T    + +++
Sbjct: 947  EDGRHPEQD------DITLRPKVPQYNVHLISAISHHIIDTVHMPYLAAITDLQVMMLEA 1000

Query: 1088 SENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL--- 1144
            SEN             T E + L+ +  A  +GED+ A+G + ++       +P      
Sbjct: 1001 SEN-------------THEQKPLVVVSAAAQRGEDMPAKGTLYVYDIIDVVPDPDIAESG 1047

Query: 1145 --VTEVYSKELKGAISALAS--LQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL-Y 1197
              + ++  +E +GAI+ALA     G +  A G K+++   K  G+ L  +AF DA    +
Sbjct: 1048 VKLHQLAREENRGAITALAGPFPGGFIGTAQGLKVMIRGMKEDGSCLP-VAFLDAQSYTH 1106

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG-SLDCFATEFLIDGSTLSLV 1256
            V+     +   L GD  K ++F  + E+  ++ +L K     ++  + EFL     L +V
Sbjct: 1107 VLKTLPGRGMWLAGDAWKGLWFGGFTEEPYRVTVLGKAPKMHMEVMSAEFLPFDGALYIV 1166

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR-----T 1310
            V D   ++ +  Y P+  +S  G +LL R+ FH+G   T  + L   LA+ + +      
Sbjct: 1167 VLDADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFTTNSMLLPSTLASFAAQQHEMMN 1226

Query: 1311 GAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
            G +    K +    +L  +  G+IG I PLDE  +RRL +LQ  L   + H AGLNPR++
Sbjct: 1227 GGSKAEVKPDPLQHVLTSSTSGAIGLITPLDEQAYRRLSALQTHLTSILEHAAGLNPRAY 1286

Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            R   S       G   +VD  L+     L    + ++  + G +   + S+L  +  G
Sbjct: 1287 RSIESESFG---GARGVVDGLLVRRIHELGAARRADVLGRAGVSAWGLRSDLEIIGGG 1341


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score =  202 bits (515), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 149/529 (28%), Positives = 265/529 (50%), Gaps = 39/529 (7%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 900  DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959

Query: 979  KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
            ++C+ P  + +D  W  +KI L      + Y +    Y L  S  V   L +      D 
Sbjct: 960  RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPE------DD 1013

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+  +  N  +S         +++  V++L P      W    +  +++SE  + V+ + 
Sbjct: 1014 EIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTSERVMCVKCLN 1064

Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
            L  +  T E + ++A+GTA  +GED+AARG + +F   +     D P+    +  +  +E
Sbjct: 1065 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1124

Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
            +KGAI++L+ +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +K   
Sbjct: 1125 VKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGTG 1182

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
              ++GD  K ++F  + E+  +L+L +KD GSL   A +FL DG  L ++V+D+  NI +
Sbjct: 1183 MCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPDGKRLYIMVADDDCNIHV 1242

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF- 1322
              Y P+   S KG +LL R+ FH G   + +T   R  +L+   +    A   D +    
Sbjct: 1243 LQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPETEANAMDLDLSGPLH 1302

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
             +L  +  GSI  I P+ E+ +RRL +LQ ++++++ H  GLNPR+FR   S+G   R  
Sbjct: 1303 QVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR-- 1360

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               +VD +L+  +  L  + + EIA + G    +I ++L   A+G + L
Sbjct: 1361 --GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1405



 Score =  125 bits (314), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81  GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
            A  P  + VDP  RC  VL +G + + IL   Q G  LV                    
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196

Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
              DE     G  +     SS V+ +  L+  M H     F++ Y EP   IL+ +    
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
           +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N  +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
           H      + A+ +N +A    S     +S   + L+      L  +N   LL    G + 
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376

Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           +++   DGR V  + L            +T PS        +G    F GS  GDS+L+ 
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431

Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
                G S LS+G K     E G+   D  +         D   D  +  E  LY     
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483

Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
            G    NT S   T    + F + D L N+GP++D + G      + D   +  S  +  
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543

Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
           ELV   G                            G+ +V+ K  +  +   S  A    
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603

Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
            Y  YL++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV 
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663

Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           +   R  D G  + Q       ++     SE  +V+  S A+PYVL+   D SI LL  D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
            S     ++T   I+S+     S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746


>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
          Length = 1989

 Score =  201 bits (511), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)

Query: 383  TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
            +G+L+LLT+V DGRVV +L LSK+  SV TS I  IG+SL F GS+LGDSLLVQF     
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711

Query: 443  TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
            T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY               
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756

Query: 503  FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
              V DSL+N+GPLKDF+YGLRIN D  ATGI KQSNYEL                     
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814

Query: 542  -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
                 VELPGCKGIWTVYHK++RGHNADS +M+  +D E+ A++  SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1394

 Score =  201 bits (510), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 154/552 (27%), Positives = 267/552 (48%), Gaps = 60/552 (10%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
            + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 866  YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925

Query: 975  -----------------QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
                             Q ++++C+LP  + +DN W  +K+ +      + YFA   +Y 
Sbjct: 926  CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYA 985

Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
            L  S  V   L +      D E+  +  +  +S +       +E   +++L P      W
Sbjct: 986  LGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----W 1030

Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
                +  +  +E  + ++ + +  +  T E + +L +GTA V+GED+  RG + +F    
Sbjct: 1031 SVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIE 1090

Query: 1137 NADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNG 1187
             A +P     N   ++++K+ +KGA++A++ +  QG L++A G K ++   K  G+ L  
Sbjct: 1091 VAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP- 1149

Query: 1188 IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
            +AF D    YV  L  ++     ++GD  K I+F  + E+  +L L  KD   L   A +
Sbjct: 1150 VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAAD 1208

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
            FL DG  L ++V+D+   I +  Y P+   S KG +LL R+ FH+G H T  + L +   
Sbjct: 1209 FLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQH 1266

Query: 1306 SSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
            SS  +   PG D  +       + +L  + +GSIG + PL E ++RRL +LQ +LV S+ 
Sbjct: 1267 SSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSME 1326

Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
            H  GLNP+++R   S+G   R     IVD  LL  +  + ++ + EIA + G     I  
Sbjct: 1327 HPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIRV 1382

Query: 1420 NLNDLALGTSFL 1431
            +L  ++ G  FL
Sbjct: 1383 DLEKISGGLDFL 1394



 Score =  116 bits (290), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ +   S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437

Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           S L    K E  D    +              D     VN   LS   S +N     +  
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735


>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
          Length = 363

 Score =  199 bits (507), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL+  +   GN+ S+A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H
Sbjct: 66  LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +L+YG Q+++L       ++   GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +        G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++   K HP+IWS  +LP+D  + LAVP P+GGV++   N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     ++LN+      S    P+    + LD + AT++  D  ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350

Query: 393 YDG 395
            DG
Sbjct: 351 TDG 353


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
          Length = 1204

 Score =  199 bits (507), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 144/547 (26%), Positives = 262/547 (47%), Gaps = 46/547 (8%)

Query: 898  DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            D  + EE     P   + I  NISG    F  G  P + +           L  G   + 
Sbjct: 678  DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
            +      C  GFI + S+G++ +CQ+P G   D  W +Q+IP+      + Y +   +Y 
Sbjct: 735  SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSSSSGMYV 794

Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
            +  S            L  D E+  +  N   S         V+   ++++ P      W
Sbjct: 795  IGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPKT----W 839

Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
                +  +  +E+ + V+ ++L  +  T E + ++ +GTA+ +GED+A+RG V +F   +
Sbjct: 840  TVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVIK 899

Query: 1137 NADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNG 1187
               +P+    +     V  + +KGA++AL+ +  QG L++A G K I+   K  G+ L  
Sbjct: 900  VVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKEDGSLLP- 958

Query: 1188 IAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
            +AF D        +++VK        ++ D  K ++F  + E+  +++L AKD   L+  
Sbjct: 959  VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDYLEVL 1014

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            A +FL DG+ L ++V+D   N+ +  Y P+  +S  G +LLSR++FH G  ++    L  
Sbjct: 1015 AADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPR 1074

Query: 1303 LATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
             + SS++      A     K  R  +L  + +GS+G +  + E ++RRL +LQ +L +++
Sbjct: 1075 TSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTI 1134

Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
             H  GLNPR+FR   S+G A R     ++D +LL  +  +  + ++EIA + G    +I 
Sbjct: 1135 EHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVGANEWEIK 1190

Query: 1419 SNLNDLA 1425
            ++   ++
Sbjct: 1191 ADFEAIS 1197



 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88  EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
           C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
              P           S    +G+   FLGS   DS+L+      G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score =  198 bits (503), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 146/529 (27%), Positives = 263/529 (49%), Gaps = 49/529 (9%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 877  DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 936

Query: 979  KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
            ++C+ P  + +D  W  +KI L      + Y +    Y L  S                Q
Sbjct: 937  RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTS----------------Q 980

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            +V  ++   +    ++H  +  EE  V++L P      W    +  ++++E  + V+ + 
Sbjct: 981  KVDFKLPEDD----EIHPEWRNEE-SVKLLNPRT----WSIIDSYQLRTAERVMCVKCLN 1031

Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
            L  +  T E + ++A+GTA  +GED+AARG + +F   +     D P+    +  +  +E
Sbjct: 1032 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1091

Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
            +KGAI++L+ +  QG L+ A G K I+   K  G+ L  +AF D    YV  L  +K   
Sbjct: 1092 VKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGTG 1149

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
              ++GD  K ++F  + E+  +L+L +KD GSL   A +FL  G  L ++V+D+  NI +
Sbjct: 1150 MCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPHGKRLFIMVADDDCNIHV 1209

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF- 1322
              Y P+   S KG +LL R+ FH G   + +T   R  +L+   +    A   D +    
Sbjct: 1210 LQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPLH 1269

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
             +L  +  GSI  I P+ E+ +RRL +LQ ++++++ H  GLNPR+FR   S+G   R  
Sbjct: 1270 QVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR-- 1327

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               +VD +L+  +  L  + + EIA + G    +I ++L   A+G + L
Sbjct: 1328 --GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1372



 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 181/747 (24%), Positives = 302/747 (40%), Gaps = 110/747 (14%)

Query: 53  GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           G V    V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G 
Sbjct: 4   GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           V  L  +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 57  VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113

Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSAR 229
              + A  P  + VDP  RC  VL +G + + IL   Q G  LV GD   F S      +
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQ 169

Query: 230 IESSHVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVIL 263
           I+++H    RD                            M H     F++ Y EP   IL
Sbjct: 170 IDTNHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGIL 229

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           + +    +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+
Sbjct: 230 YSQVAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALL 289

Query: 324 VGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLS 380
           VG+N  +H      + A+ +N +A    S     +S   + L+      L  +N   LL 
Sbjct: 290 VGSNELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLV 349

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRL 429
              G + +++   DGR V  + L            +T PS        +G    F GS  
Sbjct: 350 LLNGVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEE 405

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           GDS+L+ ++  S  + +     E   D  A+    +       DA +D +    ++  G 
Sbjct: 406 GDSMLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGI 463

Query: 490 ASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELV 542
            S NT S   T    + F + D L N+GP++D + G      + D   +  S  +  ELV
Sbjct: 464 NSRNTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELV 523

Query: 543 ELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYH 576
              G                            G+ +V+ K  +      S        Y 
Sbjct: 524 TTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYD 583

Query: 577 AYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERG 630
            YL++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV +  
Sbjct: 584 HYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGE 643

Query: 631 ARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
            R  D G  + Q       ++     SE  +V+  S ADPYVL+   D SI LL  D S 
Sbjct: 644 VRSYDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESG 698

Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDK 716
               ++T   IES+     S +LY DK
Sbjct: 699 DLDEIETDGIIESTT--WISGSLYQDK 723


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score =  197 bits (500), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 186/696 (26%), Positives = 307/696 (44%), Gaps = 99/696 (14%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
            +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++  
Sbjct: 445  IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
             T   + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 501  DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                 + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G 
Sbjct: 552  LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPS 985
            F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +L  
Sbjct: 606  FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKL-- 663

Query: 986  GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
                    PV K+ +     +   F E             KP         D  V  ++D
Sbjct: 664  --------PVTKMCVLINDDKT--FEEHE-----------KP---------DTFVYPEMD 693

Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
             + L      + Y+ E+++           P Q    +  +         VV     T  
Sbjct: 694  QYKL------QLYSPEDWK-----------PVQNVEVLFEEFEVVTCCEEVVLRSEGTVS 736

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
              +  LA+GTA   GE+V  RGR+++         P     ++ +  +Y KE KG +++L
Sbjct: 737  GVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSL 796

Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
             S  G+LL   G K+ +  +    L GI+F D    YV  L  V+N  L  D+++S+  L
Sbjct: 797  CSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSVALL 855

Query: 1221 SWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             ++E+   L+L ++D  S      A +F+ID   +  V+SDE  NI IF Y P+  ES  
Sbjct: 856  RYQEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLG 915

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-----LFGTLDGSI 1333
            G+KL  RAE ++G  V  F+R++   +S        G  +   F+L     LF +LDGS 
Sbjct: 916  GEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELFSLERQSVLFASLDGSF 967

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            G + PL E  FRRL  LQ+ +   VP  AGLN +  R        H     ++VD +++ 
Sbjct: 968  GFLRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVM 1027

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             Y  L L E+ ++A + GT+R  I+ +L ++   T+
Sbjct: 1028 QYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1063



 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 150/387 (38%), Gaps = 49/387 (12%)

Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
           +D   + P   F    + LD    T +  +  LL  + G L  L +V D    V+ L+L 
Sbjct: 1   MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
               +V+   +T+      F+GSRL DS+ +       T             ++  AP  
Sbjct: 61  FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
            +L  +  +A +D    E+  LYG         +SA++  +  V D L+N+GP K  + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162

Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
               +        K   ++LV   G    G   ++ +S R     SS +           
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222

Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L      +QV    
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281

Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
             ++      Q +    +            V S SI DPY+ +   +G + +  L   P 
Sbjct: 282 LVMVAEGQQIQHIPLQLT----------FPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
                +     +     P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score =  196 bits (497), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 154/533 (28%), Positives = 265/533 (49%), Gaps = 45/533 (8%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
            + I  NISG    F+ G+   +  VFR   +  P   +L  G     +   +V+   ++G
Sbjct: 877  LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
            FIYV SQ  ++ CQLPS + +D  W ++K+P++     + Y      Y L  S       
Sbjct: 934  FIYVDSQNCVRACQLPSQTQFDYPWTLRKVPIEEQVDFLAYSTSSETYVLGTS------R 987

Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
                 L    ++  +  N  LS         + E  ++++ P      W    + P+   
Sbjct: 988  EGDFKLPEGDDLHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPD 1038

Query: 1089 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT- 1146
            E    V+ V +  +  T E   L+ +GTA V+GED+ ARG + +F   + A +P+   T 
Sbjct: 1039 EQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARGTIYVFDVIKVAPDPEKPETG 1098

Query: 1147 ---EVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1198
               ++  KE +KGA++AL+ +  QG +++A G K ++   K  G+ L  +AF D    YV
Sbjct: 1099 HKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV 1156

Query: 1199 VSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
                 +K    ++LGD  K ++F  + E+  ++ L  KD   L+  A +FL DG+ L ++
Sbjct: 1157 TVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYML 1216

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAA 1313
            V+D   N+ +  Y P+  +S  G +LLSR++F+ G   + VT   R  + +  ++ +   
Sbjct: 1217 VADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSERTESSEEG 1276

Query: 1314 PGSDKT-NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
               D+T  R  +L  + +GS+  +  + E ++RRL +LQ +L+++V H AGLNPR+FR  
Sbjct: 1277 MDLDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLINTVDHPAGLNPRAFRAI 1336

Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
             S+G A R     +VD  LL  +  +  + Q EIA + G T  +I ++L  + 
Sbjct: 1337 ESDGAAGR----GMVDGNLLRLWLNMGKQRQTEIAGRVGATEWEIKADLETIG 1385



 Score =  114 bits (286), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)

Query: 57  NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
           NLVV   ++++++  V  V  +  KE     S  S + + +++++            Y L
Sbjct: 28  NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77

Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            G V  L   S+    N+R   ++I++A  +AK+S++E+D    G+   S+H +E  +  
Sbjct: 78  SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
                 +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+  
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193

Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
           +  SGGG  A+IE             SS V+ L  LD  + H     F++ Y EP   IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           + +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311

Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
           +G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
             +G++ L+    DGR V  + +    P+    DI        T +G+   F+GS   DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
           +L++++  S                     ST++ R   + ++D L D+       E+  
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470

Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
           LY SA    +         S  + ++F + D L +IGPL+D + G      N  + AT  
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530

Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
           +  +  ELV   G                             +W+      RG       
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583

Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
            ++ D+    Y++IS    +   E  ++     +S+  F        +  T+  G+L G 
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
            R++QV     R    SY   D+  G S          S+     S S  DPY+++   D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695

Query: 678 GSIRLLVGDPS 688
            S+ LL  D S
Sbjct: 696 SSVLLLQADES 706


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score =  194 bits (494), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 151/558 (27%), Positives = 269/558 (48%), Gaps = 59/558 (10%)

Query: 904  ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM--------VFRERLRVHPQLCDGSIV 955
            E  H  P + +    +I G++  F+ G  PC+ +        V R R +        ++ 
Sbjct: 820  EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFILKSAITQPHVLRLRGK--------AVQ 871

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
            + +  H   C  GF YV    I+++ +LPS + +D+ W  +KIPL      I Y +    
Sbjct: 872  SLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKIPLGEQVDCIVYSSASES 931

Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
            Y +  SV       +   L  D E   +  N  ++ +       +E   V++L+P     
Sbjct: 932  YVIGTSV------KEDFKLPEDDESHTEWQNEFITFLP-----QLERGTVKLLDPKN--- 977

Query: 1076 PWQTRATIP----MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
             W      P    ++ +E    + V+ L  +  T E + ++ +G+A V+GED+  +G + 
Sbjct: 978  -WSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPKGCIR 1036

Query: 1131 LFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWT 1181
            +F       +P     N   +++++E +KGA++AL+ +  QG L++A G K ++   K  
Sbjct: 1037 VFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMVRGLKED 1096

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
            G+ L  +AF DA   YV  L  +K     ++GD  K ++F  + E+  +L+L  K+  ++
Sbjct: 1097 GSLLP-VAFKDAQ-CYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFGKENENI 1154

Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1299
               A +FL DG+ L ++V+D+  N+ +  Y P+   S KG +LL R  FHVG H    + 
Sbjct: 1155 AVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVG-HFASTMT 1213

Query: 1300 LQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
            L    + +  + A   +  T+      ++ +L     GS+G I PL+E ++RRL +LQ +
Sbjct: 1214 LLPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQ 1273

Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
            LV+++ H  GLNPR +R   S+G   + G   ++D  LL  +  +  + + EIA + G  
Sbjct: 1274 LVNALEHPCGLNPRGYRAVESDGIGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGAD 1330

Query: 1414 RSQILSNLNDLALGTSFL 1431
               I  +L  L  G ++L
Sbjct: 1331 VGAIRMDLEKLHGGLAYL 1348



 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 160/744 (21%), Positives = 297/744 (39%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  + +  +  R    D    A L LV  Y++ G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    + +    D+I+++  +AK+S++E+D   HG+   S+H +E  E        + 
Sbjct: 85  ERVKISNSKSGG--DAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEESHMSPWVPDL 142

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE------------------ 217
            +    + VDP G C  +  +G+  + IL   Q G  LV D+                  
Sbjct: 143 GSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDTQ 201

Query: 218 -DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             T G     S   E S V+ L  LD  + H     F+H Y EP   IL+ +        
Sbjct: 202 KPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLS 261

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      +  ++    +    + +   LP D +K++++P P+GG L++G N  +H   
Sbjct: 262 IDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVDQ 321

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +   + +S   + L+      L +D    LL    G + +LT 
Sbjct: 322 AGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILTF 381

Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DGR V  + L     ++  S++ +  +    +G S  F GS  GDS+L+ ++  S  +
Sbjct: 382 KVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSNT 441

Query: 445 MLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLYG 488
              +   E   E G  E      +               + +  LQ+  +VNG++ +   
Sbjct: 442 KKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA--- 498

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASAT 531
                       F F + D L ++GP +D + G      L+   D            +A 
Sbjct: 499 -----------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAAR 547

Query: 532 GISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
           G  K     +            +++    G+W++     +   A ++       +Y  YL
Sbjct: 548 GFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKYL 604

Query: 580 II-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARI 633
           ++     S +  ++V    +   +  ++ ++   +  TI  G L    RV+QV     R 
Sbjct: 605 LLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIRS 664

Query: 634 LDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
            D +  +TQ       ++     SE  TV+  S A+PY+L    D S+ +L  D +    
Sbjct: 665 YDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDLD 719

Query: 693 SVQTPAAIESSKKPVSSCTLYHDK 716
            V+    + +S   VS C LY D+
Sbjct: 720 EVEI-QGLATSADWVSGC-LYEDR 741


>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
            plexippus]
          Length = 328

 Score =  194 bits (494), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 108/320 (33%), Positives = 183/320 (57%), Gaps = 20/320 (6%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
            +AIGT Y  GED+ +RGR+L++        P     +N   E+Y+KE KG ++AL  + G
Sbjct: 16   IAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVTALTQVLG 75

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
             L+ A G KI L +    +L G+AF D   +YV  +  VKN IL+ D++KSI  L ++ Q
Sbjct: 76   FLISAVGQKIYLWQLKDNDLVGVAFIDTQ-IYVHRMLAVKNLILVADVYKSISLLRYQHQ 134

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
               L+L+++D  +   +  +F+ID ++L  +VS+ + N  ++ + P+  ES+ GQ+L+ +
Sbjct: 135  HRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRK 194

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
             ++H+G  V    RL           AA G  +T+    +F TLDG +G + P+ E  +R
Sbjct: 195  CDYHLGQRVHAMFRL-----------AARGERQTH--VTMFTTLDGGVGYVLPVSEKVYR 241

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG-PDSIVDCELLSHYEMLPLEEQL 1404
            RL  LQ  + +   H+AGLNP+++R +  + +A   G    ++D +L+S Y  +P  EQ 
Sbjct: 242  RLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQ 301

Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
            +IA + GT   +I+S+L ++
Sbjct: 302  DIARKIGTKVEEIMSDLYEI 321


>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
          Length = 1347

 Score =  194 bits (493), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 314/1410 (22%), Positives = 560/1410 (39%), Gaps = 213/1410 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G V +LA +     D     D+++LAF+DAK++++E+D   H +   S+H
Sbjct: 51   LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             +E    +    G        ++ VDP  RC  +     Q+ +L   Q    L  +ED  
Sbjct: 109  YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168

Query: 221  GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
                  + +                  ++S V+ L  L  D+++  D  F++GY E  + 
Sbjct: 169  AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228

Query: 262  ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
            IL       +  +  +          +     +   +     LP+  +K++ +P+P+GG 
Sbjct: 229  ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288

Query: 322  LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
            L+VG N+ +H    +   ++A+N +A +L+S + +  +S  +++L+      L  ++   
Sbjct: 289  LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347

Query: 378  LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
            L+  + G L  +     GR +Q L           DL KT PS     +  + ++  F+G
Sbjct: 348  LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
            S  G S LV+++     S LS   K   G +               D         E   
Sbjct: 404  SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458

Query: 487  YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
               A++ T++ +   SF ++D L ++GP+ D   G    A          TG  + S   
Sbjct: 459  -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517

Query: 538  --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
              N ++V +          K  W V+ K +             DD++H  L+ + + +  
Sbjct: 518  CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566

Query: 589  VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
             +   D +  +  +   F  +G TI    L     V+Q  +   R  D    ++Q +   
Sbjct: 567  KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
               ++     E   ++ +S  DPY+L+  +D                             
Sbjct: 627  DEETD-----EEYEIVYLSFCDPYLLVVRND----------------------------- 652

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
             SS  + H +G E    +   D      +G +I     G L +      +    G + +F
Sbjct: 653  -SSIQVLHVRGKEIEPLEGEGDIAEKKWLGGSIHT---GSLTKDVPALFLLSAQGTMHVF 708

Query: 767  DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
             +P+   V                Y   AL      ++S + +    G KE +  + V E
Sbjct: 709  SLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTELLVAE 751

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            L     ++    P+L A      ++ Y+   F  PE       P  + +  +        
Sbjct: 752  LG----ASGVDTPYLVARTALDDLVLYEP--FRHPE-------PAPSDQWYT-------- 790

Query: 887  LRNLRFSRTPL-------DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
              NLRF + P+       +A  +EE+    P + I    ++  +    + GS P   ++ 
Sbjct: 791  --NLRFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPPL--LLV 842

Query: 940  RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            +E       L V        +     +H  +C  GF  V + G+L+   LP  + Y   W
Sbjct: 843  KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902

Query: 994  PVQKIPLKATPHQITYFA---EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
             VQ++ L +   ++ + A    + +Y +     V     +     + Q  G Q D     
Sbjct: 903  SVQQVDLGSEDLEVRHLAYHETRGVYVVATCKDVDFYFAEDDHRHLGQSGGGQDDITLRP 962

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1110
             V  +  + V     R+++         +RA +P   +  AL V  + + +  T E +  
Sbjct: 963  QVKQYSIHLVSSKTHRVID---------SRA-MPYLEAITALQVMPLEV-SELTHEQDLR 1011

Query: 1111 LAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVT-EVYSKE-LKGAISALASLQG 1165
            + + TA ++GED+ ARG +++F+        D P++ +   V ++E  KGAI+ALA   G
Sbjct: 1012 ILVSTAAMRGEDMPARGAIIVFNIIDVVPAPDVPESGIKLHVNAREETKGAITALAPFPG 1071

Query: 1166 HLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI-VKNFILLGDIHKSIYFLS 1221
              +    G KI++   K  G+ L  +AF DA     V   +      L GD  K ++F  
Sbjct: 1072 GFVGSGQGQKIMIRGLKEDGSCLP-VAFLDAQCHTTVIKTLGTSGMWLAGDAWKGLWFGG 1130

Query: 1222 WKEQGAQLNLLAK-DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            + E+  +L +L K     ++  A EFL     L +++ D   ++ +  Y P+  +S  G 
Sbjct: 1131 FTEEPYKLTVLGKAPERQMEVMAAEFLPFDGALYILIIDADMDLHVLQYDPENPKSQNGM 1190

Query: 1281 KLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
            +LL R+ FH+G   T  L L             T+ D  G +P  + +  F +L  +L G
Sbjct: 1191 RLLHRSTFHLGHFATNMLLLPSSLNPFGENQPFTNGDTNGESP-EESSPLFHVLTTSLTG 1249

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
            SIG I PLDE ++RRL +LQ  L   + H A LNPR++R   S       G   +VD  +
Sbjct: 1250 SIGMITPLDESSYRRLSALQTHLTTILEHPASLNPRAYRAIESESFG---GARGVVDGNI 1306

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +     L    + ++  + G     I S+L
Sbjct: 1307 VRRINELGAARRADVLARAGADAWSIRSDL 1336


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score =  192 bits (488), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 147/538 (27%), Positives = 260/538 (48%), Gaps = 38/538 (7%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+R+    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 882  CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
             YV    ++++ +LPS + +D+ W  +KI        I Y +    Y +  S        
Sbjct: 941  AYVDEDNVIRMSRLPSNTRFDSGWATRKIAFGEQVDSIVYSSASECYVIGTSA------K 994

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
            +   L  D E   +  N  ++ +       +E   V++LEP      W T  +  ++ +E
Sbjct: 995  EDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPRN----WSTIDSHELEPAE 1045

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
              + + V+ L  +  T E + ++ +G++ V+GED+  +G + +F        P    ++ 
Sbjct: 1046 RIMCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSK 1105

Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
              ++++KE +KGA++AL+ +  QG L++A G K ++   K  G+ L  +AF D    YV 
Sbjct: 1106 KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1163

Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
             L  +K     ++GD  K ++F  + E+  +L+L  K+  +L     +FL DG+ L ++V
Sbjct: 1164 VLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1223

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--AHVTKFLRLQMLATSS--DRTGAA 1313
            +D+  N+ +  Y P+   S KG +LL R+ FH G  A     L      TSS  D     
Sbjct: 1224 ADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTLLPHGAHTTSSPVDEDAMD 1283

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
              S   +++ +L     GSI  I PL E ++RRL +LQ +LV+++ H   LNPR +R   
Sbjct: 1284 TDSPPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1343

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            S+G   + G   ++D  LL  +  +  + + EIA + G     I  +L  L  G ++L
Sbjct: 1344 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1398



 Score =  106 bits (265), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 176/734 (23%), Positives = 287/734 (39%), Gaps = 96/734 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    +    A L L   Y + G + SL
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    + +    D+II++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85  QRVKISNSKSGG--DAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES-- 232
               P  +  DP G C  +  +G+  + IL   Q G  LV D+      G  SA + S  
Sbjct: 142 LGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSDP 200

Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                           S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DGR V  + L     ++N S+  +  T   ++G +  F GS  GDS+L+ ++  S T 
Sbjct: 381 KVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSSTI 440

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------AQ 498
              S  K   G  E  A           D   D +  ++L     AS   E       + 
Sbjct: 441 KRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDSP 498

Query: 499 KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTV 553
             F F   D L ++GP +D + G    + +     S       +EL   +G        V
Sbjct: 499 ADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLAV 558

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEVT 600
             +    +  DS +M   DD Y  + I  ++ ++             ++ +T D   +  
Sbjct: 559 LKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDKE 613

Query: 601 ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
           ESV Y V                +  TI  G L    RV+QV     R  D  Y      
Sbjct: 614 ESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLAQ 671

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIES 702
             P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA   
Sbjct: 672 IYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA--- 726

Query: 703 SKKPVSSCTLYHDK 716
           S K +S C LY DK
Sbjct: 727 SGKWISGC-LYEDK 739


>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  191 bits (486), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS         Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300

Query: 350 DS 351
           +S
Sbjct: 301 NS 302


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 178/623 (28%), Positives = 300/623 (48%), Gaps = 93/623 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G   +   P+V+ DP+ 
Sbjct: 91  QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S ++I L+ +D  + +V 
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG +V G+NTI Y +Q+   C + LN+     D   + P     S  + 
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + T+ G+L LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             FLGSRLGDS L++++    T  S+    +++E   +EA+                  +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + ++L LYG A     +++ E   ++  F+  D L NIGP+K    G R N  ++    +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++++     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
           S    T+VLE  + L E+ E +  FV G+ T+AAG L      +QV     A + DG   
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDG--- 582

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
            Q L+    +S       N  V+  SI DPYV +   +G + L  LV +P      +   
Sbjct: 583 -QQLAEVKIDS-------NFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634

Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
           QTP +  I  S   ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657



 Score =  184 bits (467), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 187/740 (25%), Positives = 334/740 (45%), Gaps = 80/740 (10%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
            ++   DA  S   GE  D  D         + V+ +E+G L +  +P    V+ + +F +
Sbjct: 736  KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795

Query: 783  GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
                +VD           +K       +S EE      K+N    +++E  +     + S
Sbjct: 796  VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
             P L AI+ D  ++ Y+ +            +P S    L ++         LR S    
Sbjct: 854  HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901

Query: 898  DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
                R      EE     P   I  F+ +S  + G  + G+ P   +V+     ++ HP 
Sbjct: 902  SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
              DGSI AFT  +  N  +GF+Y+T  +  L+I ++ +   Y+  +PV+KI +  T H +
Sbjct: 959  TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKIEVGRTIHSV 1018

Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
             Y    ++Y ++ SVP  KP N++  ++ D  QE  H+ D + +  +     YT+  +  
Sbjct: 1019 RYLMNSDVYVVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF-- 1072

Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGED 1122
                   +   W+      I  +  E       V L + +T    ET LAIGT    GE+
Sbjct: 1073 -------SSQDWKAVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGEE 1125

Query: 1123 VAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1177
            V  RGR++L         P        +  ++ KE KG ++ L +++G LL   G K+ +
Sbjct: 1126 VLVRGRIILAEVIEVVPEPGQPTSNRKIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKVFI 1185

Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             ++   +L G++F D    YV  L+ +++  L  D  +S+  + ++E+   +++ ++D  
Sbjct: 1186 WQFKDNDLMGLSFLDMH-YYVYQLHSLRSIALACDARESMSLIRFQEENKAMSVASRD-- 1242

Query: 1238 SLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
               C     A +F++DG+ +  ++SDE  NI +F YAP+  ES  G++L  RA  ++G +
Sbjct: 1243 DRKCAQAPMAAQFMVDGAHIGFLLSDENGNITLFNYAPEAPESNGGERLTVRAAINIGTN 1302

Query: 1294 VTKFLRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
            +  FLR++    +L        AA       R + +F +LDGS G I PL E ++RRL  
Sbjct: 1303 INAFLRVKGHTALLNLHEFEKEAA-----EQRMSTIFASLDGSFGFIRPLTEKSYRRLHF 1357

Query: 1350 LQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            LQ  +      +AGL+ +  R     Q   NG+  R    +++D +++  Y  L   ++ 
Sbjct: 1358 LQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNAR----NLIDGDVVEQYLNLSTYDKT 1413

Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
            ++A + G  +  I+ +L +L
Sbjct: 1414 DLARRLGVGKYHIIDDLMEL 1433


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
            127.97]
          Length = 1267

 Score =  190 bits (483), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 144/538 (26%), Positives = 260/538 (48%), Gaps = 38/538 (7%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+ +    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 751  CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
             YV    ++++ +LPS + +D+ W  +KI L      I Y +    Y +  S        
Sbjct: 810  AYVDEDNVIRMSRLPSNTRFDSGWATRKIALGEQVDSIVYSSASECYVIGTSA------K 863

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
            +   L  D E   +  N  ++ +       +E   V++LEP      W T  +  ++ +E
Sbjct: 864  EDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPKN----WSTIDSHELKPAE 914

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
                + V+ L  +  T E + ++ +G++ V+GED+  +G + +F        P    ++ 
Sbjct: 915  RITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSK 974

Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
              ++++KE +KGA++AL+ +  QG L++A G K ++   K  G+ L  +AF D    YV 
Sbjct: 975  KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1032

Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
             L  +K     ++GD  K ++F+ + E+  +L+L  K+  +L     +FL DG+ L ++V
Sbjct: 1033 VLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1092

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS----DRTGAA 1313
            +D+  N+ +  Y P+   S KG +LL R+ FH G   +    L   A +     D     
Sbjct: 1093 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMD 1152

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
              S   +++ +L     GSI  I PL E ++RRL +LQ +LV+++ H   LNPR +R   
Sbjct: 1153 TDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1212

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            S+G   + G   ++D  LL  +  +  + + EIA + G     I ++L  L  G ++L
Sbjct: 1213 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1267



 Score = 94.7 bits (234), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+     
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140

Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199

Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +       
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H  
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379

Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++ 
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E    
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493

Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G    
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553

Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
              TV  +    +  DS +M   DD Y  + I  ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score =  190 bits (482), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 176/705 (24%), Positives = 323/705 (45%), Gaps = 69/705 (9%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
            +V +E+G L I  +P    V+ + +F +    +VD T   E  +       ++ E     
Sbjct: 777  IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836

Query: 814  GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
               E +++       +V+E  +     + + P L AI+ D  ++ Y+ +          S
Sbjct: 837  DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886

Query: 868  DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             +P+     +S   +       ++S L N    R P +     +  +G     I  F+ +
Sbjct: 887  SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941

Query: 921  SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
            S  + G  + G+ P   +V+     ++ H    DG I AFT  +N N  HG +Y+T  + 
Sbjct: 942  SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             L+I ++     Y+  +PV+KI +  T H + Y    ++Y ++ S+P  KP N++  ++ 
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVMN 1058

Query: 1037 D--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            D  QE  H+ D + +  +     YT+  +  +    D A  P      I  +  E     
Sbjct: 1059 DDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTAC 1109

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
              V L + +T    ETLLA+GT    GE+V  RGR++L         P    +      +
Sbjct: 1110 EDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVL 1169

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            + KE KG ++ L ++ G LL   G K+ + ++   +L GI+F D    YV  L+ ++   
Sbjct: 1170 FDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIA 1228

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNI 1264
            +  D  +S+  + ++E    +++ ++D     C     A++ ++DG+ +  ++SDE  NI
Sbjct: 1229 IACDARESMSLIRFQEDNKAMSIASRD--DRKCAQPPMASQLVVDGAHVGFLLSDETGNI 1286

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
             +F YAP+  ES  G++L  RA  ++G ++  F+RL+   +               R   
Sbjct: 1287 TMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTT 1346

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAH 1379
            +F +LDGS G + PL E ++RRL  LQ  +    P +AGL+ +  R     Q   NG+  
Sbjct: 1347 VFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNA 1406

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            R    +++D +++  Y  L L ++ ++A + G  R  I+ +L  L
Sbjct: 1407 R----NLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQL 1447



 Score =  186 bits (473), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S +VI L+ +D  + ++ 
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
             +LL +P P+GG LV G+NT+ Y +Q+   C L LN+     D   + P        + 
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL ++    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
             F+GSRLGDS L+++T    T                   + KRL+  + D  A +  +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + +++ LYG A     +++ E   ++  F   D L N+GP+K    G R N  ++    +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
           Q++              N  V+  SI DPYV L   +G + L  LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus A1163]
          Length = 1401

 Score =  189 bits (480), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
            + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884  LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
            YV S+ +L+IC+ PS + +D  W ++KI +      + Y      Y L  S       + 
Sbjct: 943  YVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996

Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
               L  D E+     N  L    L     + +  ++++ P      W    +  +   E 
Sbjct: 997  DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049

Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
             + V+ + L  +  T E   ++ +GTA+ +GED+ +RG + +F   +   +P+   T+  
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109

Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
              +  KEL KGA++AL+ +  QG L+ A G K ++   K  G+ L  +AF D    YV  
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167

Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            L  +K     ++GD  K ++F  + E+  +++L  KD G L+  A EFL DG  L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
            D   N+ +  Y P+  +S  G +LL+R++FH+G   T    L     SS++  A P S  
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287

Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
               +T    +L  +  GS+G +  + E ++RRL +LQ +L +S+ H  GLNPR++R   S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
            +G A R     ++D  LL  +  +    ++EIA + G    +I ++L  + A G  +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401



 Score =  136 bits (343), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740


>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
          Length = 348

 Score =  189 bits (479), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 104/283 (36%), Positives = 165/283 (58%), Gaps = 13/283 (4%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +     + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
             ++LN+      +    P+    + LD + A+++ +D  ++S
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
          Length = 1401

 Score =  189 bits (479), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
            + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884  LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
            YV S+ +L+IC+ PS + +D  W ++KI +      + Y      Y L  S       + 
Sbjct: 943  YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996

Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
               L  D E+     N  L    L     + +  ++++ P      W    +  +   E 
Sbjct: 997  DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049

Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
             + V+ + L  +  T E   ++ +GTA+ +GED+ +RG + +F   +   +P+   T+  
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109

Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
              +  KEL KGA++AL+ +  QG L+ A G K ++   K  G+ L  +AF D    YV  
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167

Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            L  +K     ++GD  K ++F  + E+  +++L  KD G L+  A EFL DG  L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
            D   N+ +  Y P+  +S  G +LL+R++FH+G   T    L     SS++  A P S  
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287

Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
               +T    +L  +  GS+G +  + E ++RRL +LQ +L +S+ H  GLNPR++R   S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
            +G A R     ++D  LL  +  +    ++EIA + G    +I ++L  + A G  +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401



 Score =  136 bits (342), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740


>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1437

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
                V+KIP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
            L+A+GTA ++GED+  RGRV +F        P    T   SK+LK         GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + +  QG +L+A G K ++   K  GT L  +AF D    YV S+  +      L+ D  
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            K ++F  + E+  ++ L  K    ++    +FL DG  L +V SD   +I I  + P+  
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            +S +G  LL R  F+ GAH      L + A   + +  +  S++ +   LL  +  G + 
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
             + PL E  +RRL SL  +L + +PH AGLNP+ +R    +  A    PG D+     IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
            D ++L  +  L   ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374



 Score =  127 bits (318), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740


>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
            2508]
          Length = 1437

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
                V+KIP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
            L+A+GTA ++GED+  RGRV +F        P    T   SK+LK         GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + +  QG +L+A G K ++   K  GT L  +AF D    YV S+  +      L+ D  
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            K ++F  + E+  ++ L  K    ++    +FL DG  L +V SD   +I I  + P+  
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            +S +G  LL R  F+ GAH      L + A   + +  +  S++ +   LL  +  G + 
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
             + PL E  +RRL SL  +L + +PH AGLNP+ +R    +  A    PG D+     IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
            D ++L  +  L   ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374



 Score =  127 bits (318), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740


>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
 gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
          Length = 1333

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 297/1401 (21%), Positives = 533/1401 (38%), Gaps = 199/1401 (14%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            + + L L+  Y L G V S+A +     D     ++I+LAF++AK+S++E+D   H +  
Sbjct: 45   AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102

Query: 157  TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
             S+H +E    +    G        ++ VDP  RC  +     Q+ IL            
Sbjct: 103  VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162

Query: 205  ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                   A+ G +    D    G         + S V+ L  LD  + H  D  F+H Y 
Sbjct: 163  EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222

Query: 257  EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
            EP   I+             +      +  ++    K    + +  NLP   +K++ +PS
Sbjct: 223  EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282

Query: 317  PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
            PIGG L++G N  IH      + A A+N +A+         +S  +++L+      L   
Sbjct: 283  PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGCSVEILNAS 342

Query: 376  VA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLG 426
                L+  + G L  +     GR +  + ++  +        S   S ++ +  +  F+G
Sbjct: 343  TGEMLVVLRDGSLATVDFKMLGRSISAVIVTIISSENGGKVFSTAPSCVSRLDQNNLFIG 402

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
            S  G S L+   C +  S LS   K     +       +       D         +   
Sbjct: 403  SEDGSSSLL--GCTNSQSGLSR--KRSHAQMLGQNSGDEEEDVLDEDDDDLYDAEPDSKK 458

Query: 487  YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-------SATGISKQSNY 539
              ++     +   +  F ++D L +IGP+ +   G R NA         + TG S+ S  
Sbjct: 459  RATSVAEQSAGDSSTHFVIKDDLHSIGPINNTCVG-RSNAAGEDKLQLLAGTGKSRSSRL 517

Query: 540  ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
              +               +R       R   ++   HA+ + + E      E  +     
Sbjct: 518  ACI---------------NRDIIPQHIRKNQFEGARHAWAVCAREKNANDDEAGEGGYTE 562

Query: 600  TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSEN 658
              + ++   G T+    L     V+QV     R  D    ++Q +      +++      
Sbjct: 563  RSATEFEHDGETLEVLTLGHGTAVVQVRRMEIRTYDSHLALSQIIPMIDDETDA-----E 617

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP 718
             +++  S  DPY+L+   D SI++                 ++  +K +    +  D   
Sbjct: 618  FSIVHTSACDPYLLVIRDDSSIQV-----------------LQHERKDIEPLDITPDAAD 660

Query: 719  EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
            + W+             GE  DG         +    +    G+L +  +P+   V    
Sbjct: 661  KQWMSGCVYS-------GEFTDG---------NAALFLLSAEGSLHVLTLPDLQLV---- 700

Query: 779  KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
                        YM  AL      + S+       G KE +  + V +L          +
Sbjct: 701  ------------YMTPALPHLPP-VLSADVSHRRMGVKETLTELLVADLGNDGVL----Q 743

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P+L        ++ Y+               P  +S S S     +    NLRF + P+ 
Sbjct: 744  PYLTVRTAMDDVVLYE---------------PFHSSPSASTGPWHS----NLRFRKVPVP 784

Query: 899  AYTR-EETPHGAPCQRITIFK--NISGHQGFFLSGSRPCWCMVFRER------LRVHPQL 949
               +  ++P   P  R    +   I G+    + G+  C  ++ +E       L V+   
Sbjct: 785  YIPKYNDSPLEDPNARPPALRRMQIGGYNTVSIPGAPSC--LLLKEASGPPKILEVNEPK 842

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
               +    T L+ + C +GF  V   G L  CQLP  + +   W +++I L     ++ +
Sbjct: 843  RSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQIDLGDDAREVRH 902

Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS---SVDLHRTYTVEEYEVR 1066
             A      + V+          +     +E G   +  ++S    V  +  + +     +
Sbjct: 903  LAYHEARGIFVAATC-----TTVDFYFAEEDGRHPEQDDISIRPQVPQYSVHLISAKTHK 957

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR 1126
            I+   +          +P   +  AL V    + +  + E + ++ + T   +GED+ A+
Sbjct: 958  IIHTHK----------LPYLETVTALKVMPAEV-SELSHEVKPVVVVSTGAQRGEDMPAK 1006

Query: 1127 GRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASLQGHLL-IASGPKIILHKW 1180
            G +++F       +P      L   V ++E  +GAI+ALAS  G ++  A G K+++   
Sbjct: 1007 GALIVFDVIDVVPDPDVEESGLHLHVLAREESRGAITALASFPGGMIGTAQGLKLMIR-- 1064

Query: 1181 TGTELNG----IAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
             G   +G    +AF DA   Y   L  +  +   L GD  K ++F  + ++  +L LL K
Sbjct: 1065 -GMREDGSCLPVAFLDAQ-CYTSLLKTLDSRGLWLAGDAWKGLWFGGFTQEPYKLTLLGK 1122

Query: 1235 DFGS-LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
               + ++    +FL     L L+V D   ++ +  Y P+  +S  GQ+LL R+ FH+G  
Sbjct: 1123 SPRTEMEVIEADFLPFDGALFLLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHF 1182

Query: 1294 VTKFLRL-QMLATSSDRTGAAPGSDKTNR---------FALLFGTLDGSIGCIAPLDELT 1343
             T  + L   LA  +++    P  D  +          F +L  T  GSIG I PLDE T
Sbjct: 1183 PTGSMLLPSTLAPFTEQARDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDEST 1242

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLPL 1400
            +RRL +LQ  L + + H AGLNPR +R   +  KA      G   +VD  L+     L  
Sbjct: 1243 YRRLSALQGHLTNILEHAAGLNPRMYRT-DTEMKATDSEMGGAKGVVDGSLIRRISELGA 1301

Query: 1401 EEQLEIAHQTGTTRSQILSNL 1421
              + ++  + G    Q+ S+L
Sbjct: 1302 ARRADVLSRVGGDVWQLRSDL 1322


>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
 gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1437

 Score =  187 bits (476), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
                V+KIP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
            L+A+GTA ++GED+  RGRV +F        P    T   SK+LK         GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113

Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
            + +  QG +L+A G K ++   K  GT L  +AF D    YV S+  +      L+ D  
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            K ++F  + E+  ++ L  K    ++    +FL DG  L +V SD   +I I  + P+  
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            +S +G  LL R  F+ GAH      L + A   + +  +  S++ +   LL  +  G + 
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
             + PL E  +RRL SL  +L + +PH AGLNP+ +R    +  A    PG D+     IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
            D ++L  +  L   ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374



 Score =  124 bits (311), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score =  187 bits (474), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 150/542 (27%), Positives = 260/542 (47%), Gaps = 50/542 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            + I  NIS     F+ G    + +   +      R+  +   G  ++   L + + + GF
Sbjct: 885  LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            IYV S+ +L+IC+ PS + +D  W ++KI +      + Y      Y L  S       +
Sbjct: 943  IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDHLAYATSSETYVLGTS------HS 996

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
                L  D E+     N  +S +   R  +++    R          W    +  +  +E
Sbjct: 997  ADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPAE 1047

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1147
              + V+ + L  +  T E   ++ +GTA+  GED+ +RG + +F   +   +P+   T+ 
Sbjct: 1048 YVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1107

Query: 1148 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
               +  KEL KGA++AL+ +  QG L+ A G K ++   K  G+ L  +AF D    YV 
Sbjct: 1108 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV- 1164

Query: 1200 SLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
              N+VK        ++GD  K ++F  + E+  +++L  KD G L+  A EFL DG  L 
Sbjct: 1165 --NVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLF 1222

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
            ++V+D   N+ +  Y P+  +S  G +LL+R++FH+G   T    L     SS++  A P
Sbjct: 1223 ILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADP 1282

Query: 1315 GS----DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             S     +T    +L  +  GS+G +  + E ++RRL +LQ +L +S+ H  GLNPR++R
Sbjct: 1283 DSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYR 1342

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTS 1429
               S+G A R     ++D  LL  +  +    ++EIA + G    +I ++L  + A G  
Sbjct: 1343 AVESDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLG 1398

Query: 1430 FL 1431
            +L
Sbjct: 1399 YL 1400



 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)

Query: 57  NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLVV   +V++I+ +++VQ     G+ E K++         D +    L L   Y L G 
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80

Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           V  +    IL+   GG       ++++LAF +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 81  VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
                  +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D       +D 
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192

Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
           F       G    ++          SS V+ L  LD  + H     F++ Y EP   +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +  +   +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312

Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N  +H      + A+ +N +A  + +   + +S  ++ L+      L +     LL  
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372

Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
            +G++VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
           + ++  S         +    ++  D        +S  D  + D+   E E    G   +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
              S    + F + D L NIGPL+D + G           L  NA +    I+ Q    N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549

Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
             LV              +    +G+WT       G     D  R+   + EY  Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606

Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
                   ++   + +  DL      E   + D      TI  G L  +RRV+QV     
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661

Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           R    SY   D+  G     P   E    S+    +S S+ADPY+ +   D ++ LL  D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
            S     V+   +  + K    SC LY DK 
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score =  186 bits (473), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV    NV+ +Y + VQ +G+ + +   E      +DG+     + V   R  GN    
Sbjct: 29  NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
                        RD +I  F+DA+++++ FD  +  L   S+H FE  +     W   +
Sbjct: 80  ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
           R R        V VDP  RC  ++VYG ++I++  S G +    + DT   +   F++R 
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178

Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
               VI+L  L   +  V D  F+ GY  P + ILH+    W G ++    T  ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234

Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
           +                  P++W   NLP D + L  VP+P+GGV+V+G N + Y +QS 
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
             +LALN Y+ +  ++    ++  S++LD AH   L     L +  +GD+ LLT+V    
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
            V  L + K   SV+ SDI T+G    F+ SR   SLL+++      + T +  SG+  +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414

Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
           ++ G     + ST  L  S+S            D   D+V+G     +G  S        
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
           T     RD+L  + P+K  + G   +A         +S YELV                 
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529

Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
                      L   K  W V+  +S+               +H +++ S   +TMV   
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
           A    ++     +  +  T+ AGN+ GR+ V+QV  +   +LD     +D  F       
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVFDERM--- 627

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               +   +  VS+ADPY+ L ++D + ++
Sbjct: 628 ---KKGVRITKVSVADPYIALLLNDATTKV 654



 Score =  127 bits (319), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 163/723 (22%), Positives = 288/723 (39%), Gaps = 91/723 (12%)

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
            C ++G L IF VP+   VF    F        D+  R+ +   E E              
Sbjct: 807  CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866

Query: 805  -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
                       +  E G+ Q  ++    ++  E+ +    A      SRP LF       
Sbjct: 867  GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            ++ Y+ +       TS      S  R          RLR      T +D    + +    
Sbjct: 927  VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
            P      F +ISG  G F+   RP W M     + +R H    DG++  FT L +     
Sbjct: 972  PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
             F+Y T +G++++    +G       P ++ P+KA+   + +  E  +Y  +V +   +P
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRRTPIKASACYVDFDPESGVY--VVVLKHKEP 1089

Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
               +       E    +D    S   L +    E Y + +   +     WQ     P++ 
Sbjct: 1090 CAHLPKFGPPMEEAPAVDMKFASDEPLPQR---ERYSICLFSCED----WQLVPNSPVEI 1142

Query: 1088 SENALTVRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-- 1141
              +   V    + N +++ + T     +A+GT  V GE    RG + L+        P  
Sbjct: 1143 PADH-HVTAFKVINISSERHLTGKKPCVAVGTTPVLGERNLERGLLQLYDVLEVVPEPGK 1201

Query: 1142 ---QNLVTEVYSKELKGAISALASLQGHLLIA----SGPKIILHKWTGTE-LNGIAFYDA 1193
               +N +  + S +  GA++AL S++G+++ A     GPKI + +    E L  IAF + 
Sbjct: 1202 PTTKNRLKLMLSSDETGAVTALNSIEGYVIGALARRDGPKIFVWRVEDDEKLQPIAFLEG 1261

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYF--LSWKEQGAQLNL-----------LAKDFGSLD 1240
              ++ V+L +  NF+++GD    +    L   E    LNL           + +D     
Sbjct: 1262 -SMFTVTLKVALNFVIIGDYMGRVMLARLIKDETLKILNLSKGTTSQALLQVGRDVAPTS 1320

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFL 1298
             +A +F++ G+ L ++  D+  N+ I  +      + +G ++L R   +   H  +    
Sbjct: 1321 VYAADFIVRGAELHVLFLDQHANMTILAFDSD-DPTTRGGRILKRHSVYNTGHQRIVALT 1379

Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
            RLQ +   + R          +   L + TL+G  G I  + E  FRRL  LQ +L+  +
Sbjct: 1380 RLQNVPPRNSRNAT------VDAHFLTYQTLEGGAGYITSIPEDIFRRLMLLQLRLLPHL 1433

Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
               AGL+P +F+++ S            +  ++ +   ML L+ Q E+A Q GTT  Q+ 
Sbjct: 1434 KFRAGLHPSAFKKYKSASLHMVHQEVRTICADVYTRLFMLDLDAQKEVARQVGTTTKQLC 1493

Query: 1419 SNL 1421
             + 
Sbjct: 1494 DDF 1496


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score =  185 bits (469), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 158/559 (28%), Positives = 262/559 (46%), Gaps = 55/559 (9%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
            N    R P D+ T       +  + + I  +ISG+   F+ G         SR C   + 
Sbjct: 861  NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
            R R  V   L D     FT   + +   GFIYV S+ +++ICQLP  + YD  W ++K+ 
Sbjct: 920  RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
            +      + Y      Y L  S       +    L  D E+  +  N  +S +   R   
Sbjct: 974  IGEHVDHLAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQCC 1027

Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
            +     +++ P      W    +  +   E  + V+ + L  +  T E + ++ +GTA  
Sbjct: 1028 L-----KVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALA 1078

Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTE----VYSKEL-KGAISALASL--QGHLLIAS 1171
            +GED+ ARG + +F   +   +P+   T+    +  KEL KGA++AL+ +  QG L+ A 
Sbjct: 1079 RGEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQ 1138

Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
            G K ++   K  G+ L  +AF D    YV  L  +K     ++GD  K I+F  + E+  
Sbjct: 1139 GQKCMVRGLKEDGSLLP-VAFMDVQ-CYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPY 1196

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            +++L  KD    +  A +FL DG  L ++V+D   N+ +  Y P+   S  G KLL R++
Sbjct: 1197 KMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSK 1256

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSD-----KTNRFALLFGTLDGSIGCIAPLDEL 1342
            FH+G H T  L L    T+S    +A         +     +L  +  GSIG +  + E 
Sbjct: 1257 FHMG-HFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEE 1315

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
            ++RRL +LQ +L ++V H  GLNPR++R   S+G A R     ++D  LL  +  +  + 
Sbjct: 1316 SYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGR----GMLDGNLLYQWLSMSKQR 1371

Query: 1403 QLEIAHQTGTTRSQILSNL 1421
            ++EIA + G    +I ++L
Sbjct: 1372 RMEIAARVGAHEWEIKADL 1390



 Score =  119 bits (297), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)

Query: 57  NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
           NLVV   +V++I+    V    EG   +  S         D + +  L L   Y L G V
Sbjct: 28  NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81

Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
             L     + ++ G D      +I+LAF +AK+S++E+D   +G+   S+H +E  +   
Sbjct: 82  SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135

Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
                +  + G ++ VDP  RC  V  +G++ + IL   Q G  LV GD           
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194

Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              +D+ G+     G        SS V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T    +  +      +  ++    +   ++ S   LP D +K++A+P P+GG L++G 
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + +     +S  ++ L+      L N     LL+  +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
           G +VL+    DGR V  + + +  P     +I         ++G+   F GS   +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
            ++  S  +  S   + E   IE D         S  D  +D        LY +A +   
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483

Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
                    S   ++ F V D L N  PL+D + G                   L + A 
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543

Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
             + G     + K+    +V+    +    G+WT       +++  + D + +    +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
             Y+I++ + +T+  E +++      ++  F           +I  G L  R+RV+QV  
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658

Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
              R  D      DL      P   E    S+    LS S+ADPY+ +   D ++ LL  
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711

Query: 686 DPS 688
           D S
Sbjct: 712 DDS 714


>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
 gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
          Length = 318

 Score =  184 bits (467), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +LVV     + +Y ++   E S++ K                  +ELV  + ++GN+ S+
Sbjct: 29  SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+D    RD+++L+F DAK+S++E+D   H L+  SMH FE  E   +K G  S
Sbjct: 71  ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   + G+    DE    +G   S  I  +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+DLD +  +V D  F+HGY +P ++IL+E   TW GRV+ +  TC I A+S++   + 
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           HP+IWS  NLP D  + +AVP PIGGVLV   N++ Y +QS         Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
            sinensis]
          Length = 1741

 Score =  182 bits (463), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 187/754 (24%), Positives = 321/754 (42%), Gaps = 103/754 (13%)

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
            D+   ++ + + +G LEI+ +P+F  ++ V  F      +VD     A + ++ E+N  +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                    ++NI    V+E+ +     +  RP L  + T   I  ++A         S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
            +     S S S   +   R R L     PL A  R  T P  A  Q   +        F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171

Query: 919  NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            +I GH G F+ G+ P W        +RV     DG + +F  L+   C  GF+Y T    
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231

Query: 978  LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
            +++  L  G ++  +  ++ +PL+ TP+ + Y  E   Y L+ +   +K  + V  L  +
Sbjct: 1232 MRLATLLPGYSFKEHLGMRWVPLELTPYFLQYHIESKTYALVGTR--VKSCSSVYHLNAE 1289

Query: 1038 ----QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA---GGPWQT--RATIPMQSS 1088
                +EV  +     L S+D         Y +++  P  +     PWQ    A I  +  
Sbjct: 1290 GNKEEEVLLRPPTCVLPSLDY--------YVLQMYAPSTSLAEATPWQAIPHACIDFEPW 1341

Query: 1089 ENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
            E    +    L +  T    +  LA+G     GE++  RGR+++         P   +T 
Sbjct: 1342 EVVTCMITAQLSSEQTFHGTKDYLALGANLSYGEEIPVRGRIIILDVIDVVPEPGQPLTR 1401

Query: 1148 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
                 +Y  E KG ++AL+S QGHL+ A G K+ +      +L G+AF D+  LY+ SL 
Sbjct: 1402 HKLKTIYDGEQKGPVTALSSCQGHLVSAIGQKVYIWTLKNADLVGVAFVDSE-LYIHSLL 1460

Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
             VKN IL  D+ KSI  L ++     L+++++D    + + + F +DG  L  +V+DE+ 
Sbjct: 1461 CVKNLILAADVLKSIQLLRFQSDLRVLSVVSRDAIPREVYTSNFFVDGRRLGFLVTDERG 1520

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGA------HVTKFLRLQMLATSS--------- 1307
            N+ I+ Y P    S  G++L+ RA+  +         V   LR  +L+  S         
Sbjct: 1521 NVVIYSYDPLEPSSRSGRRLVRRADMCLPTRAISSLRVANRLRHALLSVKSAGTGTQTTV 1580

Query: 1308 ------------DRTG-------AAPG------------SDKTN------RFALLFGTLD 1330
                        +RTG        APG               TN      + ++  GT  
Sbjct: 1581 PSAAGVGGSEVLERTGKTGVSSFVAPGRANSASAMTLSTPSATNIDPEKLKHSVYLGTQT 1640

Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            G++  I PL +  + RL+  +K L+       GL P+    +  +          + D +
Sbjct: 1641 GAVFLIGPLRDKMYSRLRITEKNLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADAD 1700

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            LL  Y  LP  ++LEIA ++G +   I+ ++ +L
Sbjct: 1701 LLWRYLTLPHSQRLEIAKKSGQSLEGIMDDIAEL 1734



 Score =  172 bits (436), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           R DS++L+F +AK++V+ FD   + L+  S+H +E   + +LK GR  F+  P+++VDP 
Sbjct: 27  RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
            RC  VLVY   + +L   +  +   GD+        +T    G  S      A + ++ 
Sbjct: 84  QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143

Query: 235 VINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
              L       + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +  
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
            + +P+IW   +LP+D   + +VP PIGGVL++  N+I Y  Q+  SC L LN YA    
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
           +    Q++P+    + LD      + +   L+ T+TG + LL+  V +  + V  L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
              SV    +  +     F+GSRL DS+L+  T  +              +  +   + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382

Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
              IE DA         P+ K     SS         +V+G       ++ LYG    N 
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442

Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
            S  +   T+ F V D LVN GP+      +  Y    N D +    I+ Q+    VEL 
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502

Query: 546 GC 547
            C
Sbjct: 503 AC 504


>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
          Length = 990

 Score =  182 bits (461), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F    + LD    T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +       T        
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDS 508
                ++  AP   +L  +  +A +D    E+  LYG         +SA++  +  V D 
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   ++ +S R     SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553

Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
                 +QV      ++      Q +                 V S SI DPY+ +   +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603

Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           G + +  L   P      +     +     P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643



 Score = 84.7 bits (208), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 123/273 (45%), Gaps = 29/273 (10%)

Query: 759  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
            E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++   T 
Sbjct: 733  ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
              + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N         
Sbjct: 789  AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
              + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G F+ 
Sbjct: 840  RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893

Query: 930  GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
             S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +LP+   
Sbjct: 894  SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953

Query: 989  YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
             D  +PV++I + A+ H +TY    N Y ++ S
Sbjct: 954  LDTSYPVKRIDVGASVHSVTYLLHSNTYAVLTS 986


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score =  179 bits (453), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 133/530 (25%), Positives = 243/530 (45%), Gaps = 55/530 (10%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              ++ G+    +SG+ P   +   + L     +    I   +  +   C  G +YV ++ 
Sbjct: 873  LSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSDFIRGISGFNGAGCKKGLVYVDNER 932

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
            +++ CQL + +  D  WP+++IPL      + Y      Y +  +               
Sbjct: 933  LVRTCQLYNNAQLDFSWPIRRIPLNEQVDHLAYSTASGTYVVGTT--------------- 977

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRATIPMQS 1087
                 H+ D       +LH  +  EE           ++++ P      W+   +     
Sbjct: 978  -----HEQDFKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKT----WKVIDSYTFSP 1028

Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ---- 1142
            +E    V  + L  +  T + + ++ +GT Y +GED+AARG V +F       +P     
Sbjct: 1029 AERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVFDVIDVVPDPDEPGT 1088

Query: 1143 NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
            NL  ++  +E ++GA++A++ +  QG +++A G K ++   K  G+ L  +AF D    Y
Sbjct: 1089 NLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGLKDDGSLLP-VAFIDVQ-CY 1146

Query: 1198 VVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            V  +  +K     L+GD  K ++F  + E+  ++ L  KD   L+    +FL DG  L +
Sbjct: 1147 VSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLYI 1206

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D   N+ +  Y P+  +S  G +LL+R +FH+G   +    L   A SS+    +  
Sbjct: 1207 LVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLTLLPRTAVSSELAVMSSD 1266

Query: 1316 SDKTNRFALLFGTL----DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
            S   + +  L+  L     GS+  I  L E ++RRL +LQ +L +++ H  GLNPR++R 
Sbjct: 1267 SMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQLSNTLEHPCGLNPRAYRS 1326

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
              S+G   R     ++D +LL  +  L    +LEIA + G    +I ++L
Sbjct: 1327 VESDGVVGR----GMIDGKLLMRWLDLSRSRKLEIAGRVGADEWEIRADL 1372



 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++IY +  +   +   +N  +    V  +   A  L L   Y L+G V  +
Sbjct: 28  NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +    + S+ G D      +++L+F +AK+S++E++    G+   S+H +E         
Sbjct: 85  SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129

Query: 173 GRESFARGPLV----------KVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
            +E     P V           VDP  RC  +L +G++ I IL   Q G  LV DE    
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187

Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             D+  + G         +S V+ L  LD  + H     F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+    T A  +  +    + S  ++    +    + S   LP D   ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307

Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
           +P+GG L++G+N  IH      + A+A+N +A  + +   + +S   + L+ +    + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
           +    LL+  TG+LVL+    DGR V    +    P+V   D+ +        +G+   F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
           +GS   +S+L+     S  S  S     +  D E +      +      S A ++ VN  
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
           +    G  S+N       ++F V D L+++GPL+  + G   + D++A     Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535

Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
           EL    G                             +W +   +++ GHN DS       
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588

Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
           +E  +Y+I++         T+V    +   E   + D    G  TI    L G  RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648

Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
                RI D +  M Q     P   E   G E   V S S AD Y+L+   D S+ LL  
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703

Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
           D S     +  P  +  S +P     LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732


>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
          Length = 312

 Score =  178 bits (452), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 99/253 (39%), Positives = 149/253 (58%), Gaps = 18/253 (7%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L      +  + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G       S++I++R+LD  + ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP- 294

Query: 339 ALALNNYAVSLDS 351
                 + VSL+S
Sbjct: 295 -----PFGVSLNS 302


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI       +    DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
                      +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
                 +QV          G +I    Y+   L+F               V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597

Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           + +   +G + +  L   P      +     +     P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644



 Score =  147 bits (371), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 96/292 (32%), Positives = 151/292 (51%), Gaps = 16/292 (5%)

Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
            +  +Y KE KG +++L S  G+LL   G K+ +  +    L GI+F D    Y+  L  V
Sbjct: 1039 IKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGV 1097

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQK 1262
            +N  L  D+++S+  L ++E+   L+L ++D  S      A +FLID   +  ++SDE  
Sbjct: 1098 RNLALACDMYRSLALLRYQEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAA 1157

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            NI IF Y P+  ES  G+KL  RAE ++G  V  F+R++   +S        G  +   F
Sbjct: 1158 NIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELF 1209

Query: 1323 AL-----LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            +L     LF +LDGS G + PL E  FRRL  LQ+ +   V   AGLN +  R       
Sbjct: 1210 SLERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRP 1269

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             H     ++VD ++   Y  L L E+ ++A + GT+R  I+ +L ++   T+
Sbjct: 1270 NHYLNTRNLVDGDVAMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1321



 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 121/276 (43%), Gaps = 29/276 (10%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
            +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S +  
Sbjct: 731  IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
             T   + E +    ++EL +     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 787  DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                 + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     G 
Sbjct: 838  LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
            F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +   +++ +LPS
Sbjct: 892  FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951

Query: 986  GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
                D  +PV++I + AT H + Y    N Y ++ S
Sbjct: 952  DMILDASYPVKRINVGATVHSVVYLLHSNTYAVLTS 987


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
            [Schistosoma mansoni]
          Length = 1825

 Score =  176 bits (446), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 181/769 (23%), Positives = 312/769 (40%), Gaps = 125/769 (16%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LP G +++ +  ++ I L   P+ + Y  E   Y  +V +   +P   V  L  +    
Sbjct: 1310 TLPPGYSFNEHLGIKWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNKE 1367

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTVR 1095
              +     + V      T++ Y +++  P+          PW     IP    E      
Sbjct: 1368 EDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWEV 1420

Query: 1096 VVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
            V  L        ET       LA+G     GE++  RGR+L+         P   +T   
Sbjct: 1421 VTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRHK 1480

Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
               ++  E KG ++AL S QGHL+ A G KI +     T+L G+AF D+  LY+ +L  V
Sbjct: 1481 LKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLCV 1539

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KN +L  D+ KS+  L ++     L+++++D  S + + + F +DG  L  +VSDE  N+
Sbjct: 1540 KNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGNV 1599

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVT------KFLRLQMLATSSDRTGAAPG--- 1315
             I+ Y P    S  G++L+  A+  + +  T        LR  +L+     T  A     
Sbjct: 1600 TIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRVANRLRHALLSVKPSSTTTASAMTA 1659

Query: 1316 ------SDKTN--------------------------------------------RFALL 1325
                   D TN                                            R ++ 
Sbjct: 1660 GTSATIQDSTNTVLDNLSRVDSVNQMNNLRQSQQQSTAAQQGTTNPNSGVDPEKFRQSIY 1719

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD- 1384
            FG+ +GSI  I P+ +  + RL+  +K L+  +  + G+ P+S   +      +RP P+ 
Sbjct: 1720 FGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSY------NRPQPEL 1773

Query: 1385 -----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
                  + D +L+  Y  LP  ++LEIA ++G +   I+ ++ +L   T
Sbjct: 1774 ANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATT 1822



 Score =  156 bits (395), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    +TH +                           + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 48  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 88  DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum PHI26]
          Length = 1271

 Score =  176 bits (446), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 134/467 (28%), Positives = 235/467 (50%), Gaps = 43/467 (9%)

Query: 978  LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
            ++ CQLPS + +D  W ++K+P++   + + Y      Y L  S            L   
Sbjct: 822  IRACQLPSQTQFDYSWTLRKVPIEEQVNFLAYSTSSETYVLGTS------RQGDFKLPEG 875

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1097
             E+  +  N  LS         + E  ++++ P      W    + P+   E    V+ V
Sbjct: 876  DELHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPDEQVTAVKNV 926

Query: 1098 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSKE 1152
             +  +  T E   L+ +GTA  +GED+ ARG + +F   + A +P+   T    ++  KE
Sbjct: 927  NIEVSENTHERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPERPETGRKLKLIGKE 986

Query: 1153 -LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
             +KGA++AL+ +  QG +++A G K ++   K  G+ L  +AF D    YV   N+VK  
Sbjct: 987  TVKGAVTALSGIGGQGFIIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV---NVVKEL 1041

Query: 1208 -----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
                 ++LGD  K ++F  + E+  ++ L  KD   L+  A +FL DG+ L ++V+D   
Sbjct: 1042 KGTGMVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYMLVADSDC 1101

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
            N+ +  Y P+  +S  G +LLSR++F+ G   + VT   R  + +  ++ +  A   D+T
Sbjct: 1102 NLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSELTESSEEAMDVDET 1161

Query: 1320 -NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
              ++ +L  + +GS+  +  + E ++RRL  LQ +L+++V H AGLN R+FR   S+G A
Sbjct: 1162 FAKYQVLIASQNGSLALVTSVAEESYRRLSGLQSQLINTVDHPAGLNARAFRATESDGAA 1221

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
             R     +VD  LL  +  +  + Q EIA + G T  +I ++L  + 
Sbjct: 1222 GR----GMVDGNLLRLWLNMGKQRQAEIAGRVGATEWEIKADLETIG 1264



 Score =  114 bits (286), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 175/758 (23%), Positives = 301/758 (39%), Gaps = 153/758 (20%)

Query: 57  NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
           NL+V   ++++I+ +      ++Q+EGS+      +  ETK            L L   Y
Sbjct: 28  NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L G V  L+ +     +N    ++I++A  +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 76  PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
                  +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+ 
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192

Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
               SGGG   +             SS V+ L  LD  + H     F++ Y EP   IL 
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312

Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+ 
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372

Query: 382 KTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
            +GD+ L+    DGR V  +           D+ K+  S      + +G+   F+GS   
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLLPAHAGGDMLKSAASC----SSCLGDGNVFIGSEDA 428

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEE 483
           DS+L++++  S                     STK+ R  S            D    E+
Sbjct: 429 DSVLLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMED 468

Query: 484 LSLYGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASAT 531
             LY SA  +T+   +          ++F ++D L +IGPL+D + G   +    +  AT
Sbjct: 469 DDLYSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQAT 528

Query: 532 GISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADS 565
             +  +  ELV   G                            G+W+   K  RG     
Sbjct: 529 CEAVSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG----- 583

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLF 618
              ++ D+    Y+++S    +   E  ++     +++  F        +  T+  G+  
Sbjct: 584 --ASSTDNPSRQYVVVSRSTDSE-QELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFA 640

Query: 619 GRRRVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           G  R++QV     R  D   M   LS   P   E    S+    +S S  DPY+++   D
Sbjct: 641 GDTRLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDD 695

Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            S+ LL  D +     V     I SS+    S  LY+D
Sbjct: 696 SSVLLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731


>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1912

 Score =  175 bits (444), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)

Query: 1111 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1159
            + IGT YV   GED + +GR+LL+           G  +     L      +  +GAI++
Sbjct: 1570 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1629

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1218
            +  L  ++L + G K+I++++   +L G AFYDA  +++ SL+++ K +++  D++KS+ 
Sbjct: 1630 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1688

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
            FL W+++  QL LLAKD+  L     EF I  + L+L+ +D ++N+ +  YAP   ES  
Sbjct: 1689 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1748

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1337
            GQ+LL  ++FHVG  ++  LR  +++ +S +    A G    N +  + G+ +G I  + 
Sbjct: 1749 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1808

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1388
            P+ E  FRRL +LQ  ++ ++P    LNPR FR   +NG+      D+          +D
Sbjct: 1809 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1868

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1431
             ++L  +  L    Q E+A   GT    I+ NL++L   T SFL
Sbjct: 1869 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1912



 Score =  166 bits (419), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 153/569 (26%), Positives = 252/569 (44%), Gaps = 124/569 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S    SG     G D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
            ++  LYG      S       +   FSF+              D +  IG +     G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741

Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
           + N+D++        + G SK  +  ++            EL GC+ +WTV   SS    
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           +  +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855

Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
           ++QVF++G R++  + +                      TQ++ F   + ESG     + 
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           N  ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943



 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 64/316 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381

Query: 996  QKIPLKATPHQITYFA 1011
            QK+   AT H + Y  
Sbjct: 1382 QKVRFGATIHHLLYLG 1397


>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1911

 Score =  175 bits (443), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)

Query: 1111 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1159
            + IGT YV   GED + +GR+LL+           G  +     L      +  +GAI++
Sbjct: 1569 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1628

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1218
            +  L  ++L + G K+I++++   +L G AFYDA  +++ SL+++ K +++  D++KS+ 
Sbjct: 1629 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1687

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
            FL W+++  QL LLAKD+  L     EF I  + L+L+ +D ++N+ +  YAP   ES  
Sbjct: 1688 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1747

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1337
            GQ+LL  ++FHVG  ++  LR  +++ +S +    A G    N +  + G+ +G I  + 
Sbjct: 1748 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1807

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1388
            P+ E  FRRL +LQ  ++ ++P    LNPR FR   +NG+      D+          +D
Sbjct: 1808 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1867

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1431
             ++L  +  L    Q E+A   GT    I+ NL++L   T SFL
Sbjct: 1868 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1911



 Score =  166 bits (421), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 152/568 (26%), Positives = 253/568 (44%), Gaps = 123/568 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S    SG     G D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
            ++  LYG      ++     +   FSF+              D +  IG +     G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741

Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
            N+D++        + G SK  +  ++            EL GC+ +WTV   SS    +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
             +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855

Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
           +QVF++G R++  + +                      TQ++ F   + ESG     + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
             ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942



 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 64/316 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380

Query: 996  QKIPLKATPHQITYFA 1011
            QK+   AT H + Y  
Sbjct: 1381 QKVRFGATIHHLLYLG 1396


>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
            [Nannochloropsis gaditana CCMP526]
          Length = 395

 Score =  174 bits (442), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 108/334 (32%), Positives = 184/334 (55%), Gaps = 22/334 (6%)

Query: 1108 ETLLAIGTAYV--QGEDVAARGRVLLFSTGRNA-----DNPQNLVTEVYSKELKGAISAL 1160
            +T LA+GT  V  +GEDV ++GR+L++    +        P   + + YS+   G  +A+
Sbjct: 68   DTYLAVGTCTVRAKGEDVPSKGRLLMYRISLDPYAGLTSPPTLTLVDQYSQR-SGPPTAI 126

Query: 1161 ASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            A L  H++IA+GP + ++ ++  E L  IAFYDA   YVVSL +VK  + + D + S++ 
Sbjct: 127  AQLGPHIIIAAGPTLWVYAFSAREKLKPIAFYDAD-FYVVSLRVVKTLVAVTDAYHSVHL 185

Query: 1220 LSWKEQGAQ--LNLLAKDFG---SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            L W E      L L+ KD+    S     + F++D  +L ++V D + N+Q+  Y P   
Sbjct: 186  LRWHEHDPAHTLELMGKDYSPIVSAQPGGSHFVVDPPSLGMLVGDSRGNLQLLQYDPADV 245

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            ES  G +L+ RA+FH+ +H   FL+   +A         PG+ +     ++FG+++G +G
Sbjct: 246  ESRGGNRLVRRADFHL-SHRLSFLQHTRMAEVPR-----PGAYRAGVRVMVFGSVEGGVG 299

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             + P++E  +RRL +LQ  +V+++PHV   NPR FR   + G A        +D ELL  
Sbjct: 300  ALVPVEEKVYRRLYALQAVMVNALPHVGAFNPRGFRLVEARGWAQG-RKKGTLDGELLWR 358

Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +  L + +Q ++A   GT+R  +L +L ++ + T
Sbjct: 359  FAGLSVGKQEDLASAIGTSREMVLESLLEVDMMT 392


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score =  174 bits (440), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 176/322 (54%), Gaps = 21/322 (6%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGR----NADNPQNL-VTEVYSKELKGAISALASLQ 1164
             + +GT   +GEDVA +G + LF        + D+  N  +  +  +E KGA+SA+ S  
Sbjct: 950  FIGVGTCINRGEDVAVKGAMYLFEIAELIPSSKDSGNNYKLKMLMREETKGAVSAITSCS 1009

Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            G+ ++A G K+++      E L  +AFYDA   Y+VSL ++KNFIL+GD  KSI FL+++
Sbjct: 1010 GYFVVAVGQKVLIRALEINERLISVAFYDAG-TYIVSLEVLKNFILVGDQVKSITFLAFQ 1068

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E   +L  L++D   ++   + FL     +S V +D Q ++++  Y P    +  G+KL+
Sbjct: 1069 ESPYKLVQLSRDARQIETCVSNFLAHEDQISFVSNDIQGDLRLIDYNPFDPTAEGGEKLI 1128

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
               EFH G+  T  L L        +    P S+      LL G +DGS+ C++P+DE+T
Sbjct: 1129 RTTEFHKGSEATCSLLLP-------KPSVRPSSE------LLLGCVDGSLSCLSPVDEIT 1175

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            F+ L  LQ  LV  +PH+A LNPR+ R    N    R     I+D  LLS Y+ +    Q
Sbjct: 1176 FKALWLLQGALVRQIPHIAALNPRAHRHVR-NDYVSRSLSKGILDGLLLSAYQTIDHATQ 1234

Query: 1404 LEIAHQTGTTRSQILSNLNDLA 1425
            +EIA + G +++++L  L + +
Sbjct: 1235 VEIAKRIGYSKAELLGYLRNFS 1256



 Score =  136 bits (343), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 164/666 (24%), Positives = 278/666 (41%), Gaps = 103/666 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
           N+V TA N ++IY + +                        +A L L   Y+LHG +   
Sbjct: 30  NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           +S+ ILS         +D +++AF DAKI++LE+ D I+ +   S+H +E  + + + + 
Sbjct: 71  QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
              F    +++ DP+ RC  +L+    + IL      + L  D D   S          S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182

Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++ L+  D D+ +V D+ F+ G+  P + +L E   TW GR+S    TC +  L++   
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
            + +P+I +  NLP+D+  L A P  IGGV ++ AN  IH          A N +A +L 
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301

Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
           S++    P     V L+ A   +LQ      +  ALL    G++  +    +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
           +SK         I +    L   G  L     V      GTS L   + +   D+E    
Sbjct: 362 ISK---PFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR-VGKSLNDLEL--- 414

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRDSLVNIGPLK 516
              +  +  + A  DM    +  LYGS+    +  ++         F + D + + GP++
Sbjct: 415 -FPKQEKVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICDYIESYGPIQ 473

Query: 517 DFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKGIWTVYHKSS 558
           D + G         L+I A   A      T   ++  +E    +++ G  G+WT ++ + 
Sbjct: 474 DITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHGLWT-FNVTG 532

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
            G+      + A D +    +   L +  + L   D   EV           TIAA    
Sbjct: 533 VGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV-----------TIAADTAA 574

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
              R++ +     ++L    + Q               EN  V   SI+DPY+L   S+G
Sbjct: 575 NSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDPYILTLQSNG 624

Query: 679 SIRLLV 684
           SI L +
Sbjct: 625 SISLFI 630


>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 304

 Score =  173 bits (439), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTGI +C  G                  EE               NL++  
Sbjct: 2   YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
           A  + +Y + + +  SK+ K+  E  R                 + L GN+ SL      
Sbjct: 35  ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
           GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE      LK G  S    P 
Sbjct: 78  GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           V VDP  RC  +L+YG ++++L   + G+    D+D    G   S+ +  S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187

Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            K  ++ D  F+HGY EP + IL E   TW GRV+ +  TC I A+S++   + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
             NLP D  K + VP PIGGVLV   +++ Y +QS         Y VSL+   E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295


>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
            isoform CRA_b [Rattus norvegicus]
          Length = 241

 Score =  173 bits (438), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 139/239 (58%), Gaps = 13/239 (5%)

Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            +AF D   LY+  +  VKNFIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F+
Sbjct: 1    MAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFM 59

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
            +D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+FHVGAHV  F R       +
Sbjct: 60   VDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------T 112

Query: 1308 DRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
               GAA G  K      N+    F TLDG IG + P+ E T+RRL  LQ  L   +PH A
Sbjct: 113  PCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHA 172

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            GLNPR+FR  H + +  +    +++D ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 173  GLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 231


>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1627

 Score =  173 bits (438), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 100/287 (34%), Positives = 166/287 (57%), Gaps = 6/287 (2%)

Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
            Q  +  +Y K+ KG ++++A L G L+++ GPK+I++ ++   L G+AFYD   +++VSL
Sbjct: 1336 QKRLNLLYEKDQKGPVTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAFYDTQ-IFIVSL 1394

Query: 1202 NIVKNFILLGDIHKSIYFLSWKE---QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            N VKN+IL+GD+ KSI F   K    Q   + LL KD+  +  ++++F++D   LS+V+S
Sbjct: 1395 NTVKNYILVGDMFKSISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLS 1454

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
            D  +NI++F + P   ES  GQ LL+++ FH+G    KF+R+ M  T+ D   ++     
Sbjct: 1455 DANRNIRMFSFDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIV 1514

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ-FHSNGK 1377
             ++  L +GTL G I  + P+++     L +L+ KL+      AGLNPR FR   H N  
Sbjct: 1515 NDKHLLFYGTLGGGINLLMPINKRFHEILHALETKLMHR-GQTAGLNPRGFRYGHHVNNT 1573

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                    +VD +LL+ ++ L  ++  ++A   G+T   IL  LN L
Sbjct: 1574 LGHLHNQYVVDGDLLTKFQSLSPDDAKQLATSIGSTTPIILDLLNQL 1620



 Score =  115 bits (287), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 114/242 (47%), Gaps = 37/242 (15%)

Query: 912  QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
            +RI  F NI   +G F+SG S P W    +   R+HP                     I 
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
             FT  HN+NC HGFIY    G+L IC+LP G+ Y+N WP++K+ ++ T H+I+Y   +  
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKC 1137

Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAG 1074
            Y L++S P     ++      ++E+             L +   +EE Y++++++P    
Sbjct: 1138 YVLVLSYPQAPQSDEDEQEEQEREL-------------LKKPLVLEEKYQLKLIDP---A 1181

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKEN----ETLLAIGTAYVQGEDVAARGRVL 1130
              W    +  +   E  L  +++ L +    +     +  + +GTAY  GED   +GR+L
Sbjct: 1182 NNWNIIDSFSLAEKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRIL 1241

Query: 1131 LF 1132
            +F
Sbjct: 1242 IF 1243


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 147/491 (29%), Positives = 225/491 (45%), Gaps = 82/491 (16%)

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQIT 1008
            L DG +  F   +  NC +GF+Y  S+  L+IC L    TYD  WPV K+PL+ T H IT
Sbjct: 767  LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKVPLRNTLHFIT 826

Query: 1009 YFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
            +      Y +I S + V + +  + +     I  E G +  +  +    L +  T E +E
Sbjct: 827  HHFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCL-QLITSETWE 885

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1123
            +    PD         A I M   E+   ++ V L +  T    +  +A+GT  V GE+V
Sbjct: 886  II---PD---------AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEEV 933

Query: 1124 AARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
            A RGR+++F        P     +N +   Y KE KG ++A+  ++G L+ + G KI + 
Sbjct: 934  ACRGRIVIFDVIEVVPEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKIYIW 993

Query: 1179 KW-TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
            ++    +L G+AF D   +Y+ SL                                 D  
Sbjct: 994  EFRDNKDLIGMAFIDTL-IYIHSL---------------------------------DRH 1019

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
             L+ F T F ++ + L  V             AP   ES  GQ L+ RAE   G++   F
Sbjct: 1020 QLEIFNTNFYVNKNQLGFV-------------AP---ESHGGQFLVRRAEIQTGSNAHAF 1063

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
             R ++ A +  +          N+    FGTLDGSIG + P+DE  +RRL SLQ KL   
Sbjct: 1064 FRTKVRALNQRQ--------NENKHITWFGTLDGSIGLLLPVDEKEYRRLFSLQAKLSIY 1115

Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            +   AGLN ++FR F S+ K  +    +I+D +LL  Y  L   E+ ++A Q  +T  QI
Sbjct: 1116 LEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFHLGFVERRDLAKQIMSTPEQI 1175

Query: 1418 LSNLNDLALGT 1428
            +++L  L L T
Sbjct: 1176 INDLTKLELST 1186



 Score =  144 bits (364), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+      I +Y +   +E       S      +  D      LE +  Y  +G +  +
Sbjct: 29  NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
               +        RDS+ + F DAK+S++E+D     L   S+H FE  E   LK G   
Sbjct: 89  ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
               P+++VDP  RC  ++V  + + IL     G      + D   + G +   +  S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199

Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +++RDL  + V   I   F+ GY EP ++IL E   TW+GRV+ +  TC I A+S++T  
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           K HP+IWS  NLP D    + VP PIGGVL+  AN + + +QS         YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312



 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
           + D P++K+LR       +++       LY + ++ T  ES  ++++F V D ++++GP 
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390

Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
              + G        +T + ++S+ E+V                          +LPGC  
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
           +WTV  K  R ++ +        +  H +LIIS +  TM+L T   +TEV E + +  Q 
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
           +T+ AGNL     +IQV      ++      Q L               S ++  S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549

Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
           Y+ L +  GSI +L   + D     V +     +  S+  +++C L+ D           
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607

Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
             + P P    T+  A L       ID  +   LD  D                      
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663

Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
              +  +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694


>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1490

 Score =  169 bits (428), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 177/350 (50%), Gaps = 19/350 (5%)

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
            W T        +E    V VVTL   +T+  ++  +A+GT   +GED+A RG   +F   
Sbjct: 1142 WITLDGYEFAPNEFVNAVEVVTLETLSTETGSKEFVAVGTTINRGEDLAVRGATYIFEVV 1201

Query: 1136 RNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
                +P + +   Y  ++      KG ++AL  + G+L+ + G KI +  +   E L G+
Sbjct: 1202 EVVPDPSSKLDRWYKLKMRVRDDAKGPVTALCGINGYLVSSMGQKIFIRAFDLDERLVGV 1261

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF DA  +YV SL  +KN +L+GD  KS++ ++++E   +L +L+KD       + +F  
Sbjct: 1262 AFLDAG-VYVTSLKALKNLLLIGDAVKSVWLVAFQEDPYKLVILSKDIRRQYAASVDFFF 1320

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
                LS+V  DE+  ++ + Y P   ES  GQ+LL   EFH     +  L +     +  
Sbjct: 1321 ANGELSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLTIARRTKTEH 1380

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                A          L+ G  DGS+  + P+DE  F+RLQ LQ +L  +V H+AGLNPR+
Sbjct: 1381 EIPQA---------KLISGFGDGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGLNPRA 1431

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            FR    N    +P    I+D +LLS +E   +  Q E+  Q GT R+ IL
Sbjct: 1432 FRIVR-NETVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTIL 1480



 Score =  121 bits (303), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 196/913 (21%), Positives = 349/913 (38%), Gaps = 130/913 (14%)

Query: 57  NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDGI---- 96
           NLV   +N+I IY VR        Q E  KE K+          GE +     DG     
Sbjct: 40  NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99

Query: 97  ----------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     +      V  + +HG V  +  + +  + N  R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           + D+ H L   S+H +E  E   L        R  L +VDP  RC  + +    + IL  
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215

Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
            Q  +     E    + G        S +++L    D  + +V DF F+ G+  P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
            + E TWAGR+     T ++   ++       P++ +   LP DA+ L     P      
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335

Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
                    GV+V+  + + Y  Q A C              L+++N  ++  ++++   
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
               + L+ A    L     LL  ++G++ ++ +V +GR V R+D+      +V+ + + 
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454

Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
            +      LG   G+         + V    G  T +LS+   EE      +    S   
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513

Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
               + DA  +M     ++  LYG  +  ++    SA++   FA  D++   GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573

Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
            L    D       ++TG      + L +             L G +GIW++  K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             A S+      +  +  ++IS +A     V   A   T    ++   + G T+ AG  F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           GR  ++ V     R+L+      D +   S  ++      + +   SI DP VL+   D 
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748

Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
           ++ L +G+P    +  +  + + E S + ++ C      G      +P     S+     
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
            G  + +        +    + V+    G LEI+ +P    VF+     +      D+Y 
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
             AL                Q        + V  + M +    +  P L   L  G +  
Sbjct: 867 PAALS-------------LPQDPPRKPQELDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913

Query: 853 YQAYLFEGPENTS 865
           Y+      P + S
Sbjct: 914 YETVHHPPPPDPS 926


>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1376

 Score =  169 bits (427), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 148/553 (26%), Positives = 253/553 (45%), Gaps = 86/553 (15%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
            R+    ++ G++  F+ GS PC+ M          RL+  P      + + +  H   C 
Sbjct: 876  RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
             GF YV ++ ++++C+LP  + +DN W  +KI +      + YFA    Y L  S     
Sbjct: 930  KGFAYVDAKNMVRMCRLPGNTRFDNAWAARKIHIGEQVDCVEYFARSETYVLGTS----- 984

Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
              ++   L  D EV  +  +  +S +       ++   V++L P      W       + 
Sbjct: 985  -YHEDFKLPEDDEVHTEWRSEVISFMP-----QLDRGRVKLLSPRT----WSIIDCYDLG 1034

Query: 1087 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQ 1142
            ++E  L ++ + +  +  T E + ++ +GTA V+GED+  RG + +F       + D P+
Sbjct: 1035 ATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPDPDRPE 1094

Query: 1143 -NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1196
             N   ++++KE +KGA++A++ +  QG L+ A G K ++   K  G+ L  +AF D    
Sbjct: 1095 TNQKFKLFAKEDVKGAVTAISGIGGQGFLIAAQGQKCLVRGLKEDGSLLP-VAFMDMQ-C 1152

Query: 1197 YVVSLNIVKN--FILLGDIHKSIYFLSWK------------EQGAQLNLLAKDFGSLDCF 1242
            YV  L  ++     ++GD  K ++F  +             E+  +L L  KD   L   
Sbjct: 1153 YVSVLKELQGTGLCIMGDALKGLWFTGYSVQLSSAVDVETCEEPYKLTLFGKDSEYLQVV 1212

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            A +FL                       P    S KG +LL R+ FH G H    L L  
Sbjct: 1213 AADFL-----------------------PDDPSSSKGDRLLHRSSFHTG-HFISTLTLIP 1248

Query: 1303 LATSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
              TSS  TGA+  +   +     + ++  +  GS+G I PL E T+RRL +LQ +LV S+
Sbjct: 1249 QYTSSG-TGASEDNMDVDYMPAGYQVVVTSQSGSVGVITPLTEETYRRLSALQSQLVMSM 1307

Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
             H  GLNP+++R   S+G + R     +VD  LL  +  + ++ + EIA + G     I 
Sbjct: 1308 EHPCGLNPKAYRAVESDGFSGR----GLVDGNLLLRWLDMGVQRKAEIAGRVGADLQSIR 1363

Query: 1419 SNLNDLALGTSFL 1431
            ++L  +  G  FL
Sbjct: 1364 ADLERINGGLDFL 1376



 Score =  114 bits (284), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   +V++++ +     G+  S ++ +  R   ++      L L+  Y L G V  L
Sbjct: 28  NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E LH       
Sbjct: 85  GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
               P L+ VDP  RC  +L +G+  + IL   Q G  LV D   ED  G          
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200

Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
              +     AR      SS V+ L  LD  + H     F++ Y EP   IL+    T   
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
            +  +      +  ++    +    + +   LP D + ++ +P PIGG L++G+N  IH 
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
                + A+ +N +A          +S   + L+      L     D+AL+   +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379

Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
           +    DGR V     +L  ++   S+L +  +   ++G    FLGS   DS+LV +T  S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
                                S KRL+R SS       D   D  +  E  LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479

Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
              +            F F   D L ++GP+KD + G      D ++   SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539

Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
                G  G  T+  K    +  DS +M   D                     Y  YL+ 
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599

Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
           S ++R    E + + T     +D          +  T+  G L G  RV+QV     R  
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658

Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
           D      DL+     P   E    S+  +V   S A+PY+L+   D S+ LL  D S   
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             V     + +S K +  C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734


>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
          Length = 262

 Score =  167 bits (422), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
           +++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53  RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112

Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
           N+GPLKDF+YGLRINAD  ATGI KQSNYEL                          VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
           PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200


>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
          Length = 398

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
            +  LA+GTA   GE+V  RGR+++         P     ++ +  +Y KE KG +++L S
Sbjct: 72   QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 131

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
              G+LL   G K+ +  +    L GI+F D    YV  L  V+N  L  D+++S+  L +
Sbjct: 132  CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSVALLRY 190

Query: 1223 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            +E+   L+L ++D  S      A +F+ID   +  V+SDE  NI IF Y P+  ES  G+
Sbjct: 191  QEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLGGE 250

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            KL  RAE ++G  V  F+R++   +S          +   R ++LF +LDGS G + PL 
Sbjct: 251  KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGFLRPLT 307

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E  FRRL  LQ+ +   VP  AGLN +  R        H     ++VD +++  Y  L L
Sbjct: 308  EKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSL 367

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             E+ ++A + GT+R  I+ +L ++   T+
Sbjct: 368  PEKNDLARKLGTSRYHIIDDLIEICRVTA 396


>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
          Length = 1408

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 138/520 (26%), Positives = 237/520 (45%), Gaps = 41/520 (7%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
            + N+ G  G F++G +P W M   +  LR++  L  G++      H  +    F+ +   
Sbjct: 916  YDNLEGQSGAFITGEKPYWIMSSEKHPLRLY-GLKQGAMAFGPTTHLGSMGEYFMKIDDG 974

Query: 976  GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVLS 1033
                IC  P     D   P  +  ++ T   + +      Y     +SVP          
Sbjct: 975  CF--ICYFPQSLNTDLTMPCDRYEMQRTYTNVVFDPPSGHYLGATAISVPFQA------- 1025

Query: 1034 LLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
               D+E   Q+  +  NL    L+   ++E +        R   PW+         +EN 
Sbjct: 1026 --YDEEGEIQLGPEGENLVP-PLNERSSLELFS-------RGSDPWRVIDGYDFDQNENV 1075

Query: 1092 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN---LVTE 1147
            L+++ V L +++        +A+GT +  GED A RG V +F        P        +
Sbjct: 1076 LSMQSVLLESSSVPGGYRDFVAVGTGFDFGEDRATRGNVYIFEVVEVVPEPGQKSAWALK 1135

Query: 1148 VYSKE-LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVK 1205
            +  K+  +  +SAL ++ G+LL ++GPK+ +      E L G+AF D   +Y+ S+ + K
Sbjct: 1136 LRCKDPCRNPVSALGNINGYLLHSNGPKMYVKGLDFDERLMGLAFVDVM-IYLTSIKVFK 1194

Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            NFIL+ D+ KSI+FLS++E   +  +++KD   +   + +FL+    ++ +  D   +I+
Sbjct: 1195 NFILISDMVKSIWFLSFQEDPYKFTVISKDLMPISVTSADFLVHDGHVTFLTYDRSGDIR 1254

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
            +  + P   ES  G++L+ R E+H G+ VT      M+A    R G     +   +  ++
Sbjct: 1255 MVDFDPANPESINGERLIVRTEYHGGSPVTV---STMIAR---RRGVE--EEFAPQTQII 1306

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
                DGSI          FRRL  +  +L+ +  HVAGLNPR+FR    N    +P    
Sbjct: 1307 CAHADGSISTFVSTKPARFRRLHFVSDQLIRNAQHVAGLNPRAFRTVR-NDLVAKPLSRG 1365

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
            I+D ELL  + + P++ Q E+  Q GT    + S+L  L 
Sbjct: 1366 ILDGELLGRFAIQPIDRQREMLKQIGTDGGTVASDLQALG 1405



 Score =  123 bits (309), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 151/658 (22%), Positives = 275/658 (41%), Gaps = 98/658 (14%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L L+C + LHG +  LA L   +   D     D ++++F+DAK+++LE+  S   +   S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           +H +E  +   +  G   F   PL++ DP  R   + +    + IL   Q  S L   E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228

Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
            F     +S     S V++L D+   +K+++D +F+ G+  P + +L+    TWAGR   
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283

Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
           +  T  +   +   T    +PL+ S   LP D+  ++A P+ +GGV LV     +H    
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343

Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + A ++N +      +  D S E    S  + L+ + + ++     LL  + GD+  +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399

Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
               +GR +  + + + + +V   S + T GN   F+G   GDSLL         +    
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
            +++    IEA+A           D  +D+    ++ L   A+N  +    +       +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505

Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
             D L  +G + D  +G+    + + T        G SK+S +                E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565

Query: 544 LPGCKGIWTVYHKSSRGHN-----ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
           L   + +W +  +   G +      D      +  E     I SL A+    +       
Sbjct: 566 LFNTESVWFLPIQRPSGQHLKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIGR---- 621

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
                   + G+++  G  F R  V+ V +    +LD    TQ             G+E 
Sbjct: 622 --------ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEG 663

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
             ++S SI+DPYV++   +GS  + VGD     +S V+ P+  +S + P  +  ++ D
Sbjct: 664 EEIVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719


>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
            [Galdieria sulphuraria]
          Length = 1454

 Score =  164 bits (414), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 156/578 (26%), Positives = 248/578 (42%), Gaps = 84/578 (14%)

Query: 906  PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD-----GSIVAFTVL 960
            PH  P      F N+S H G FL+GS P   ++ +   + H  + D     G I++ T +
Sbjct: 877  PHLRP------FYNLSSHFGVFLTGSVPSIIVLSKGYPQKHEIMIDSGVEYGDILSITNM 930

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGS--TYDNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
             +   N     + S G +   ++      + +  WPV+   +      + Y A    + +
Sbjct: 931  GDPENNRKLWILDSNGRIHFGEIRETQLESINWAWPVEVFRMNGCVKNVVYHATTGTFGV 990

Query: 1019 IVSVPV----LKPLNQVLSLLIDQE---VGHQIDNHNLSSVDLHRTY-------TVEEYE 1064
            +VS  V    L+   Q+       E   +G Q      ++ +             VE YE
Sbjct: 991  VVSSIVSMSRLERKRQIFERQKRDERAILGSQAPPEEENNTEFEENEPKNALPIEVEAYE 1050

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENAL--TVRVVTLFNTTTKEN--------------- 1107
            ++I   D     W+       +  E  L  T   V  +  T +EN               
Sbjct: 1051 LQIYRAD----TWELVDKFAFKEEEAVLSATFMQVDAYKITEEENNDDKSSRATQQQAEA 1106

Query: 1108 -------------ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-------LVTE 1147
                         +  + IGT +++GED   RGR++LF   R     +         +  
Sbjct: 1107 AISQSSRSIKFKPKECIVIGTGFIKGEDAGTRGRLMLFEVARQEAYTEESGAFSAIQLML 1166

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKN 1206
            +  KELK  +S++A L+G++  A GPK+ ++K    +EL   +FY    L+  S+N VK 
Sbjct: 1167 IAEKELKSVVSSIARLEGYICCAVGPKVEIYKLVNESELVCCSFYSGFQLFSTSINTVKQ 1226

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            ++ +GD++K  YFL W+++   LN L KDF  +   +TEFLI    +  VVSD   N+ +
Sbjct: 1227 YVFVGDMYKGGYFLFWRDRNKSLNFLGKDFDPVQTLSTEFLILNEFILFVVSDNFGNLHL 1286

Query: 1267 FYYA-PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM---LATSSDRTGAAPGSDKTNRF 1322
              YA P   ES  G+KLL R   H+G   +  +RL+       S DR G+          
Sbjct: 1287 LEYAGPHEIESRGGEKLLRRGVLHLGTRSSSMIRLRTDWKENNSEDRAGSH--------- 1337

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKK--LVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
             ++ GT DG + C+ PL +  + +   L KK  L     +VAGLNP+ FR      K  R
Sbjct: 1338 IVVLGTWDGGLACLLPLQQEEYEQKNELLKKVYLHSYSLYVAGLNPQEFRIPRGLSKKTR 1397

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
               + ++D  LLS  + L     +EIA   G   SQ+L
Sbjct: 1398 TFGERLLDSTLLSSLQGLEYTSIVEIAKSCGLDASQLL 1435



 Score = 97.1 bits (240), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 120/528 (22%), Positives = 198/528 (37%), Gaps = 89/528 (16%)

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           KVDP+     VL+    ++++        ++   D+  +    +  +    +++LR L  
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
                F F+ GY  P + +L E+  TW+G  S    + ++S +    + K+   IW    
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281

Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
           LPH+ + + +VP    GG LV G N I Y    +    L+ N+      S   L      
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341

Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
           + LD         +H T+++N V +L  + G    L +   G     + L      +  S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            ++  GN L FLGS +  S L++    + T +                           D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L+        S +G +SN     +   S   RDSL  IGP++D      I   +    +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484

Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
                  NY  V                L  C+ IW V  +   G    S  +       
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              LI+S +  T+VL  +D + E+ +S  +    RT+    +   R +IQVF+ G RIL 
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593

Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                  L   P             V    + DPYV+L +S   + +L
Sbjct: 594 NWDSLISLYELPP---------GDVVTQAFVCDPYVMLHLSSSYLVIL 632


>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
          Length = 374

 Score =  164 bits (414), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
            +  LA+GTA   GE+V  RGR+++         P     ++ +  +Y KE KG +++L S
Sbjct: 48   QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 107

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
              G+LL   G K+ +  +    L GI+F D    Y+  L  V+N  L  D+++S+  L +
Sbjct: 108  CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGVRNLALACDMYRSLALLRY 166

Query: 1223 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            +E+   L+L ++D  S      A +FLID   +  ++SDE  NI IF Y P+  ES  G+
Sbjct: 167  QEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAANIAIFNYLPETLESLGGE 226

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            KL  RAE ++G  V  F+R++   +S          +   R ++LF +LDGS G + PL 
Sbjct: 227  KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGYLRPLT 283

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E  FRRL  LQ+ +   V   AGLN +  R        H     ++VD +++  Y  L L
Sbjct: 284  EKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQYLHLSL 343

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             E+ ++A + GT+R  I+ +LN++   T+
Sbjct: 344  PEKNDLARKLGTSRYHIIDDLNEICRVTA 372


>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
          Length = 542

 Score =  162 bits (410), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 145/545 (26%), Positives = 242/545 (44%), Gaps = 48/545 (8%)

Query: 889  NLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            N   +++P++    E  E P   P +      NI+G+   FL G+ P   +   +     
Sbjct: 1    NPTIAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIVKSAKSSPKV 57

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPH 1005
              L    +   +  H   C  GFIY  S+G  ++ QLP+ S +      V+KIP+     
Sbjct: 58   VGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDAVG 117

Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
             I Y      Y +  S+      ++   L  D +   +      +S  L      E   V
Sbjct: 118  LIAYHPPMETYAVACSI------SEHFELPKDDDYHKEWAKETTTSYPL-----TERGIV 166

Query: 1066 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1124
            +++ P      W    T+ ++  E A+ ++ + L  +  TKE   L+ IGTA  +GED+ 
Sbjct: 167  KLMSPTT----WSVIDTVELEPHEVAMCMKTLHLEVSEETKERRMLITIGTAINRGEDLP 222

Query: 1125 ARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKII 1176
             RGR+L++        P     N   ++ +KE   +GA++ L  +  QG +L+A G K +
Sbjct: 223  IRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTGLCEVGSQGLMLVAQGQKCM 282

Query: 1177 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLL 1232
            +   K  GT L  +AF D    YV ++  V+   + L+ D  K ++F+ + E+  ++ L 
Sbjct: 283  VRGLKEDGTLLP-VAFMDMN-CYVTAVREVRGTGYCLMTDAFKGVWFVGYAEEPYKMMLF 340

Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
             K  G+ +    +F++ G  L +VV D+   I +  + P+  +S +G  LL+RA F    
Sbjct: 341  GKSMGNFEVLTADFVVAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAP 400

Query: 1293 -HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
             H T  L L     S   T  +    K     LL  +  G++  + PL E  +RRL SL 
Sbjct: 401  NHPTITLSLPRTPISPSATSVS----KNPPTTLLLASPTGALASLTPLSEQAYRRLTSLA 456

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEI 1406
              +  ++PH A  NP+  R    +  A  PG D     SIVD  LL+ +  L    + E+
Sbjct: 457  NSIAGALPHAAATNPKGHRLQPLD--ARTPGVDTSAGRSIVDGALLARWNELGAGRRSEV 514

Query: 1407 AHQTG 1411
            A + G
Sbjct: 515  AGKGG 519


>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1678

 Score =  162 bits (409), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 203/839 (24%), Positives = 328/839 (39%), Gaps = 202/839 (24%)

Query: 740  DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
            +G  G P     +Y  VC +SG LEI+ VP                 ++D   R   K S
Sbjct: 895  EGDGGNPA----LYIAVCRQSGQLEIYLVP-----------------LIDNIPRCCWKSS 933

Query: 800  ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFE 859
               +  SS  G  + +     + KV           H+R   F               FE
Sbjct: 934  GCGLGVSSLTGQNESKVPLPKTYKV-----------HAREIRF---------------FE 967

Query: 860  GPENTSKSDDPVSTSRSLSVS------NVSASRL----------RNLRFSRTPLDAYTRE 903
                 SK+ D    +RSL ++      ++S  RL          R  +F    +   ++E
Sbjct: 968  CGPIPSKTLDTAKKNRSLCLAVDCSSGDLSVYRLAISQDHGFPPRFEKFRMKSVFRRSQE 1027

Query: 904  ETPH------------------GAPCQRITIFKNISGHQGFFLSGSRPCW-C-------M 937
            +  H                  G    R+  F  ISG  G F +  RP W C       M
Sbjct: 1028 QARHRTKLIRKRMVVDVNDGTGGFVYNRLYRFSGISGQAGMFAAVPRPFWLCAERGKPSM 1087

Query: 938  VFRERLRVHP---QLCDGSIVAFTVLHNVNCNHGFI-------YVTSQGILKICQLPSGS 987
            +F       P   +L   S     V+++ + N GFI        + SQ +     L    
Sbjct: 1088 LFHRTRHASPAGGKLRPVSGFCSAVINDKSGNGGFITLHERVGRIGSQRLTLFHGLAPAF 1147

Query: 988  TYDNYWP-----VQKIPLKATPHQITYF-------AEKNLYPLIVSVPVLKPLNQVLSLL 1035
                  P     V+KI    T   I +        +E  LY L+VS    K L    S L
Sbjct: 1148 GAHGLLPGGGMCVEKILFGMTVRHIQFINDPFVSTSEHPLYALLVS----KKLEVDQSDL 1203

Query: 1036 IDQ-----------------EVGHQIDNHNLSSVDLHRTYT--VEEYEVRILEPDRAGGP 1076
             D                  ++  Q++  +L   DL   +   +E  +   +E    G P
Sbjct: 1204 NDDGLTAQERKETEEEKENAKIKRQVE-ADLGGFDLENEWVEEIERDDCFAVEMQLGGAP 1262

Query: 1077 -----------------WQTRATIPMQSSENALTVRVVTLF---------NTTTKENETL 1110
                             W    +  +   E+ +T+ ++ L          N T    + L
Sbjct: 1263 PIPKEAFAVWIVDAANNWMVVDSFKLDEYEHGMTLSIMELTEFPEEPGSSNDTDVSGDEL 1322

Query: 1111 -----LAIGTAYV--QGEDVAARGRVLLFSTGRNADNPQNLVTEV------YSKEL-KGA 1156
                 +A+GT  +   GEDVA+RGR +L    R   + +    +V      Y KE+  GA
Sbjct: 1323 SKRMFVAVGTGVLDHNGEDVASRGRAILLELKRTNSSAKAAGRQVVELSFCYEKEIFHGA 1382

Query: 1157 ISALASL----QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
            +++L  L    +  LLI +G  I + +W   +L  + F+ A  + V+     K+F+LL D
Sbjct: 1383 VTSLVCLSSEGKNRLLIGAGADINVEQWGNAKLTQVGFFRAT-MQVLHTIPFKSFLLLSD 1441

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             + S+YFL W+E    L LLAKD+  +  +A   +  G  ++ +  D+++N+Q F YAP 
Sbjct: 1442 AYDSLYFLIWRESDKSLTLLAKDYDPIPVYAAGVMSRGPAMTFLCHDDRQNLQFFQYAPG 1501

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPG----------SDK 1318
             + +  G +L+ RA++H+G   T     F R  ++  S+  T               S++
Sbjct: 1502 EAAARGGNRLVCRADYHLGTQTTSFASHFCRSSLMIHSATPTSTLAALKQQDSYFGRSEE 1561

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
              R    FGT DG +G + PL E  + RL +LQ  + +++     L PR++R +  + + 
Sbjct: 1562 DQRLGAYFGTADGGMGAVVPLSEPVYWRLTALQSIVANALESDCALAPRAWRLYRRSTR- 1620

Query: 1379 HRPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             R G  S      ++D +L+  Y  L + +Q +IA   G+T   IL NL +L  G+  L
Sbjct: 1621 -RGGCRSNDRKKGVIDGDLVLQYADLSISKQEDIASAIGSTVDLILDNLLELQCGSLVL 1678



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 135/333 (40%), Gaps = 79/333 (23%)

Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
           F+ GY+EPV+V+LH       W+GR+  +          ++ALSIS    +  ++WS + 
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSS------QELP 356
           +  DA K+L+      G LVVGANT +          +A+N +A S   +      Q  P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--------------------- 395
               +++LD    TWL    A+++ +TG L +L    D                      
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419

Query: 396 ------RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
                 R ++++ + +   S +   +        F GSR GDSL + +      +M  + 
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMGV-------LFAGSRTGDSLFLGYAL-EIVTMPWAA 471

Query: 450 LKEE---FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTES 496
           +K E   F + E    S        ++ L  ++  EE +LYG+           S   E+
Sbjct: 472 IKSEGQTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEET 531

Query: 497 AQ--------KTFSFAV------RDSLVNIGPL 515
           A         +  +F V       D LVN+GPL
Sbjct: 532 ADIPSDAKRLRPVAFTVVRTIVPLDVLVNLGPL 564


>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1820

 Score =  161 bits (407), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 162/672 (24%), Positives = 278/672 (41%), Gaps = 141/672 (20%)

Query: 888  RNLRFSRTPLDAYTR---EETPHGAPCQRITI----------------FKNISGHQGFFL 928
            R+L FSR PL    R   E   H    +R  I                F +ISG  G F 
Sbjct: 1162 RSLEFSRVPLSCVARSSEEAARHFIKLRRKGIVTSTTHSDFRPNRLHRFCDISGEDGLFA 1221

Query: 929  SGSRPCWCMVFRERLRV------HPQLCDGSIVAF----TVLHNV--NCNHGFIYVTSQ- 975
            + +RP W +  R    V      H     G  V      T + ++  N N GF+ +  + 
Sbjct: 1222 ATARPLWFVSERGAPTVVSHKSRHVSPAGGRPVPVSGFCTTMPHIFQNANKGFVTLHERI 1281

Query: 976  GILKICQLPSGSTYDNYWPV--------------QKIPLKATPHQITYFAEKNLYPLIVS 1021
            G +   +L   + ++  W V              QK+P   T   I +  + ++      
Sbjct: 1282 GRIGSQRL---TLFNGLWDVFSTHGLLPGGGICVQKVPFGVTVRAIEFIDDASIS----- 1333

Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH-RTYTVEEYEV----RILEPDRAG-- 1074
                 P   V +LLI +E+     + N   +D   R +  EE E     + +E D  G  
Sbjct: 1334 ----SPSRPVYALLISREIEADQSDLNDDGMDAEERKHIKEEKEAASIRKQVEADLGGFD 1389

Query: 1075 ----------------------------------------GPWQTRATIPMQSSENALTV 1094
                                                      W       +   E+A  +
Sbjct: 1390 VEQEWVEEIEREECFDVDTSIGLAPSIPTRQFEVWLVDAASQWTVLDKYQLCDFEHATAL 1449

Query: 1095 RVVTLFNTTTKENET-----LLAIGTAYVQ--GEDVAARGRVLLFS--TGRNADNPQNLV 1145
            +V+ L +     +E       +A+GT  ++  GED+A++GR+LLF+    ++  + +++ 
Sbjct: 1450 KVLFLTDVVEDSDEPPKKSLFVAVGTGRIERDGEDIASKGRILLFNLKKKKHQKDKRSMT 1509

Query: 1146 TEVYSKELK----GAISALASLQGH----LLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
             E++ K  K    G +++L+SL+      + + +G ++ + +W   +L  + FY A  + 
Sbjct: 1510 LELHLKHEKDITIGPVTSLSSLRSEDIFRVAVGAGAEVTVEQWGSGKLVQVGFYHAH-MQ 1568

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            V ++++ K F LL D + +++FL W+E    L LLAKD+     FA   +  G  +S V 
Sbjct: 1569 VQNISLFKTFFLLSDAYDALHFLVWRESDKSLTLLAKDYEPTQVFAAGMISRGGAMSFVC 1628

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--------------LRLQML 1303
             D+++NIQ   YAP    +  G KL+ RA+FH+G+  T                     L
Sbjct: 1629 HDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSLNSHWAQSSLLFNSCTVSSTL 1688

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
            A+   +       D   RFA+ FGT DGS   I PL E T+ RL +LQ  + +++   A 
Sbjct: 1689 ASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPTYWRLTALQSVMSNALESNAA 1748

Query: 1364 LNPRSFRQFHSN----GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
            L+ R++R +  +    G         ++D +L+  +  LPL EQ ++    G+T   ++ 
Sbjct: 1749 LSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLPLPEQEDLTSSIGSTVGLVMD 1808

Query: 1420 NLNDLALGTSFL 1431
            NL +L+   S +
Sbjct: 1809 NLLELSCAGSVV 1820



 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)

Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
           + D  F+ GYIEP +++LH          WAGR+       +                  
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457

Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
                                  ++A+S++   ++  ++WS ++ LP DA+KL  VP P 
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515

Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
            GV+V G NT  Y S     SCALA N +                  AV L+ +   P  
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
             +++LD A   ++  DVA++    G L  L +                           
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634

Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
                              + D   V+  D +K   +  SV    I + G  L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692

Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
           GD  L+ F+                G+G   L     E+   +    P+ K+L++   S 
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750

Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           +  D  +GEE  +    S   +  +     + +  DSL  +GPL    YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800


>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
 gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
          Length = 1341

 Score =  159 bits (403), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 136/587 (23%), Positives = 257/587 (43%), Gaps = 68/587 (11%)

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
            H   +L  +   G +L Y+ Y F+G                    N    + ++L  +  
Sbjct: 786  HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825

Query: 896  PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
            P +AY     P G   +R +  F N++G+   F++G  P   +     +    Q      
Sbjct: 826  PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880

Query: 955  VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
            V+ +  H+    +G I++ +Q   +ICQLP    Y+N WP++ I +  +   ITY    +
Sbjct: 881  VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPMKLIHIGESIRAITYHESSH 940

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY---TVEEYEVRILEPD 1071
             Y       V+     +     D+E G  I       V LH+     +  +  ++++ P 
Sbjct: 941  TY-------VVSTFKDIDYECFDEE-GKPI-------VGLHKDKPPSSAYKGSIKLISPF 985

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIGTAYVQGEDVAARG 1127
                 W    TI +  +E  +TV+ + L   ++TK+    +  + IG+   + ED++A G
Sbjct: 986  N----WSVIDTIELADNELGMTVKSMILDVGSSTKKFKHKKEFIVIGSGKYRMEDLSANG 1041

Query: 1128 RVLLFSTGR---NADNPQ--NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
               ++         D P+  +   EV+ ++ KGA++++  + G  L++ G K+I+     
Sbjct: 1042 SFRIYEIIDIIPEPDRPETNHKFKEVFKEDTKGAVTSVCEVSGRFLVSQGQKVIVRDLQD 1101

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
              +  +AF D   +YV       N ++LGD  KS++ + +  +  ++ +L KD   LD  
Sbjct: 1102 DGVVPVAFLDTA-VYVSEAKSFGNMMILGDSLKSVWLVGFDAEPFRMIMLGKDLQGLDVN 1160

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
              +F+     + ++++D    + +  Y P+   +  GQ+LLS++ F + + VT    L  
Sbjct: 1161 CADFITKDEEVFILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFSINSFVTCLKSLPK 1220

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
                 D          T  F  +  T+DGS   + P++E ++RR+  LQ++L D   H  
Sbjct: 1221 TEEKYD----------TGNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTDKEYHYC 1270

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            GLNPR  R    +  A+      I+D +++  Y  L  E +  +A +
Sbjct: 1271 GLNPRLNRFGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASK 1317



 Score = 97.1 bits (240), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)

Query: 55  VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           V +LVV  A +++I+ V VQ + S  SK                  L+L+  ++LHG + 
Sbjct: 26  VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  +    + N    D ++++ + AK SV+++D  +H +   S+H +E+          
Sbjct: 71  DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
           E  ++  L+ ++P G C  +    L          ++    A      +V  E      G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183

Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             +  +        ++S +I+ + LD  +  + D  F+  Y EP   IL +R+  WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
                      L++  T K    +    NLP+D  +++ +PSP+ G L++G N  IH  +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303

Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
                 +A+N +   + +S +    ++  +++L+      L ND  ALL   TG+   L 
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363

Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
              DG+ +++  +   +  +L SD        I T+ N+L F  +  G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417


>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 1450

 Score =  159 bits (403), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 144/574 (25%), Positives = 250/574 (43%), Gaps = 46/574 (8%)

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
            S+D ++      ++  S S    LRF + P  A  + +         AP +R+ +    N
Sbjct: 871  SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            I+G+   FL G  P + +   +       L    + A +  H   C+ GFIY   +G+ +
Sbjct: 931  IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990

Query: 980  ICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
            + Q+P+  ++      V+K+PL      I Y     +Y  +V+    +P      L  D 
Sbjct: 991  VAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTPSGVY--VVTCSFWEPFE----LPSDD 1044

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E   +    N++          E   ++++ P      W    T     +E A+ ++ + 
Sbjct: 1045 ESHREWAKENITF-----KPQTEHSVLKVINPVN----WSEIWTEEFDKNEVAMCIKSLN 1095

Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRNADNPQ---NLVTEVYSK 1151
            L  + +T E   L+ +GTA  +GED+  RG V ++   S     D P+    L      +
Sbjct: 1096 LEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVYDLASVVPQKDRPETDKKLKLMAKDE 1155

Query: 1152 ELKGAISALASL--QGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
              +GA++AL+ +  QG +L+A G K ++        L  +AF D    YV     +    
Sbjct: 1156 VPRGAVTALSEIGTQGLMLVAQGQKCLVRGLGEDGRLLPVAFMDMN-CYVSCAKELPGTG 1214

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            F  + D  K ++F  + E   ++ +  K   +L+    +FL DG  L LV +D + N+ I
Sbjct: 1215 FCAMADAFKGVWFTGYTEGPYKMMIFGKSSTNLEVINVDFLPDGRNLLLVAADAEGNLHI 1274

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--- 1323
            F + P+  +S +G  LL+R  F  GAH  +   L M  TSS+ +  A   D +   A   
Sbjct: 1275 FQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ-KSLLMPTTSSNPSQPATNGDASAAAAGPQ 1333

Query: 1324 -LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
             +L     G +  + PL +  + RL +L   L  SVPH A LNP+++R   +  +     
Sbjct: 1334 HILMAAPTGVLAAVQPLGQGVYTRLSALASNLAASVPHHAALNPKAYRMPPAPARNQVAA 1393

Query: 1383 PD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             D     ++VD  LL+ +  L    + E+A + G
Sbjct: 1394 VDISVGRAVVDGALLARWAELGSGRRAEVAGRAG 1427



 Score = 94.0 bits (232), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 157/686 (22%), Positives = 262/686 (38%), Gaps = 126/686 (18%)

Query: 91  VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
           V  D  S   + L+  + L G V  LA      +   GG   S   D +++AF+DAK+S+
Sbjct: 86  VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143

Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
           + +D     L   S+H +E  E     W        +F     +  DP  RC  +     
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198

Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
            + IL   Q    + +GD D    G               ++ IE      S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
           D  + H     F++ Y EP   IL    +T +  +  K H    +  ++    K    I 
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
           S   LP D  +++A+P+P+GG L+VGAN  IH      +  +A+N +     S      S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376

Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
             ++ L+      L  ++   L+    G L  +T   DGR V  L +    P    ++L 
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436

Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
           + ++    IG +  F GS  GDS+++      G +  SS +  +   ++        L  
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVI------GWNRKSSQVSRKKSRVQ-----DPDLDL 485

Query: 470 SSSDALQDMVNGEELSLYGSASNNT--------ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                  +    ++  LYG     T        ++  +   F   D L++I P++D +YG
Sbjct: 486 DIDFDDLEDDEDDDDDLYGDTEKTTTVAGLASGQAKLEDLVFRCHDRLISIAPIRDMAYG 545

Query: 522 LRINADASATGISK----QSNYELV--------------------------ELPGCKGIW 551
                    TG       QS  +LV                          + P  +G+W
Sbjct: 546 KPPPPAEGETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLW 605

Query: 552 TVY-----------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETAD 594
           T+             K ++    D      YD     +++++       E+  + + TA 
Sbjct: 606 TLACQKPLPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAA 661

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
              ++  +      G TI AG +    ++IQV +   R  DG   +TQ +   P   E  
Sbjct: 662 GFEKLVGTEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEET 718

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGS 679
           +    +T  S SIADPY+L+   D S
Sbjct: 719 NHEPRAT--SASIADPYLLIIRDDSS 742


>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
 gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
          Length = 1407

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 143/537 (26%), Positives = 243/537 (45%), Gaps = 69/537 (12%)

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPC 934
            S S S++    LR ++ S+  +  Y  +        +   +   ++ G+   F  G+ P 
Sbjct: 831  SRSASDLWTKNLRWVKLSQQHVPRYIEDNGSEDPGFESTLVALDDVCGYSTVFQRGTTPA 890

Query: 935  WCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            +  +F+E     P+   L    + + T  H   C  GF Y+ S   L+ICQLP  + Y +
Sbjct: 891  F--IFKEASSA-PRVIGLSGKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGH 947

Query: 992  Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID----- 1045
              W  +++P+ +  H +TY     LY  IV                 Q   +Q+D     
Sbjct: 948  LGWATRRMPMDSEVHALTYHP-SGLY--IVGT--------------GQTEDYQLDPTETY 990

Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1104
            +++L   DL    ++E   V++L+       W    T  +   E  L+++ + L  +  T
Sbjct: 991  HYDLPKEDLTFKPSIERGVVKLLDEKS----WTIIDTHILDPQEIVLSIKTLNLEVSEIT 1046

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISA 1159
             + + L+A+GT+ V GED+A +G + +F        P    T      +   E+KGA+SA
Sbjct: 1047 HQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPETNKRLKLIVKDEVKGAVSA 1106

Query: 1160 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDI 1213
            ++ L  QG L++A G K ++   K  GT L  +AF D    YV  L  +     + +GD 
Sbjct: 1107 ISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVSDLKNLPGTGMLAMGDA 1164

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
            ++ ++F  + E+  +++L A+   +L+  A +FL     L LVV+D   N+QI  + P  
Sbjct: 1165 YRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHLVVADADMNLQILQFDPDN 1224

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFAL------- 1324
             +S  G +LL +A FH G H+   L L    L   S    AA  S+  + FA+       
Sbjct: 1225 PKSEAGSRLLHKATFHTG-HLPTSLHLIHSHLKLPSATDFAATNSNPADAFAMDTSPNTT 1283

Query: 1325 -----------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
                       L  T  G++  + PL E ++RRL +L   L +++     LNPR+FR
Sbjct: 1284 TDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLTAYLANTLDSACSLNPRAFR 1340



 Score =  139 bits (350), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ + +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
           +      W    +   +F     +  DP  RC  +      + IL   Q           
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199

Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
             G      +   G+ G       SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
            N  IH      +  +A+N +A +  S     +S  ++ L+      L  +    L+   
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
            G L++LT   DGR V  + +                S  + +G    F+GS  G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
            +T       L++ L+ +  + + D            D   D+ N    +++  +A+ + 
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492

Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
            +A  T++F + D L +I P+KD            + G + ++    A G     + EL 
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552

Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
                  ELP   G+W V+ +       +     D+    A D +Y  YL++S   E  T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612

Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
           +V E   D LTE  +      +G T+  G L    +V+QV     RI D       +   
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672

Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
               E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score =  157 bits (397), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 136/534 (25%), Positives = 242/534 (45%), Gaps = 60/534 (11%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+ +    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 846  CKLLRALPDVCGYRTVFMSGHSPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 904

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
             YV                       + I L      I Y +    Y +  S        
Sbjct: 905  AYVD----------------------EDITLGEQVDSIVYSSASECYVIGTSA------K 936

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
            +   L  D E   +  N  ++ +       +E   +++LEP      W T  +  ++ +E
Sbjct: 937  EDFKLPEDDESHTEWRNEFITFLP-----QLERGTIKLLEPRN----WSTIDSHELEPAE 987

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
                + V+ L  +  T E + ++ +G++ V+GED+  +G + +F        P    +N 
Sbjct: 988  RITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKNK 1047

Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
              ++++KE +KGA++AL+ +  QG L++A G K ++   K  G+ L  +AF D    YV 
Sbjct: 1048 KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1105

Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
             L  +K     ++GD  K ++F+ + E+  +L+L  K+  +L     +FL DG+ L ++V
Sbjct: 1106 VLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1165

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRTGAA 1313
            +D+  N+ +  Y P+   S KG +LL R+ FH G   +    L       ++  D     
Sbjct: 1166 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGGHTPSSPVDEDAMD 1225

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
              S   +++ +L     GSI  I PL E ++RRL +LQ +LV+++ H   LNPR +R   
Sbjct: 1226 TDSPPPSKYQILMTFQTGSIAIITPLGEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1285

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            S+G   + G   ++D  LL  +  +  + + EIA + G     I  +L  L  G
Sbjct: 1286 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRVDLEKLHGG 1336



 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
             +    + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200

Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +      S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
             DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436

Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
           S T+   S  K   G  E  A           D   D +  ++L     AS   E     
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494

Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             +   F+F   D L ++GP +D + G    + +     S       +EL   +G     
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
             TV  +    +  DS +M   DD Y  + I  L+ +   T +  + D   LL +     
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611

Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             ESV Y V                +  TI  G L    RV+QV     R  D  Y    
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
               P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701 ESSKKPVSSCTLYHDK 716
             S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
            mansoni]
          Length = 1958

 Score =  156 bits (394), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 144/576 (25%), Positives = 245/576 (42%), Gaps = 54/576 (9%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326

Query: 982  QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
             LP G +++ +  ++ I L   P+ + Y  E   Y  +V +   +P   V  L  +    
Sbjct: 1327 TLPPGYSFNEHLGIKWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNKE 1384

Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTVR 1095
              +     + V      T++ Y +++  P+          PW     IP    E      
Sbjct: 1385 EDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWEV 1437

Query: 1096 VVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
            V  L        ET       LA+G     GE++  RGR+L+         P   +T   
Sbjct: 1438 VTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRHK 1497

Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
               ++  E KG ++AL S QGHL+ A G KI +     T+L G+AF D+  LY+ +L  V
Sbjct: 1498 LKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLCV 1556

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KN +L  D+ KS+  L ++     L+++++D  S + + + F +DG  L  +VSDE  N+
Sbjct: 1557 KNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGNV 1616

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
             I+ Y P    S  G++L+  A+  + +  T  LR+
Sbjct: 1617 TIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1652



 Score =  155 bits (391), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    + H          +     +D+ L        + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 65  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562



 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 67/130 (51%), Gaps = 15/130 (11%)

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
            T++  +G  P   +  R ++ FG+ +GSI  I P+ +  + RL+  +K L+  +  + G+
Sbjct: 1835 TTNPNSGVDP---EKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGM 1891

Query: 1365 NPRSFRQFHSNGKAHRPGPD------SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
             P+S   +      +RP P+       + D +L+  Y  LP  ++LEIA ++G +   I+
Sbjct: 1892 PPKSCWSY------NRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIM 1945

Query: 1419 SNLNDLALGT 1428
             ++ +L   T
Sbjct: 1946 DDIAELIATT 1955


>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score =  156 bits (394), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 130/549 (23%), Positives = 250/549 (45%), Gaps = 52/549 (9%)

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMV 938
            SN    + ++L  +  P +AY     P G   +R  ++   +SG    F++G  P +   
Sbjct: 787  SNFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITR 841

Query: 939  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 998
             R  +    +    +  +F    +   ++G I++ +    +IC+LP    YDN  PV+K+
Sbjct: 842  TRHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV 901

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDL 1054
            P+  T   +TY    N Y  +VS     P N      +D+E     G + D  + +S   
Sbjct: 902  PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY-- 952

Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETL 1110
                   +  ++++ P      W    T+ ++ +E A+TV+ + L   ++TK     + L
Sbjct: 953  -------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKEL 1001

Query: 1111 LAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASL 1163
            L +GT   + ED+ A G   ++         G+   N +    E  +++ KGA++++  +
Sbjct: 1002 LVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCEV 1059

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             G  L+A G KII+       +  +AF D   +YV       N ++LGD  KS++   + 
Sbjct: 1060 SGRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGFD 1118

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
             +  ++ +L KD  S+D    EF+     + ++++     + +  + P+   S  GQ+L+
Sbjct: 1119 AEPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLV 1178

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
             RA F+V +  T    ++M+  + +       +  ++ F  +  T+DGS   + P++E T
Sbjct: 1179 HRASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEFT 1230

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +RR+  +Q++L D   H  GLNPR  R         + G   I+D +++  Y  L  + +
Sbjct: 1231 YRRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRK 1290

Query: 1404 LEIAHQTGT 1412
              IA +  +
Sbjct: 1291 QTIAQKVSS 1299



 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+H
Sbjct: 52  LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
            +E           E+     L+ V+P       + +   +  L  S            G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164

Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           +        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWTG 224

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
            +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH 
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284

Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
            +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L  
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLFH 344

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L 
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397

Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
                    +E      + +  +  S+  +DM   EEL    +A       Q    +   
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
           D L+N GP+ DF+ G        A  I+   N   +   G      C  I+         
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508

Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               + + +R  N ++  +   DD      I  +E     L++ D + +           
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND----------E 558

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D 
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614

Query: 670 YVLLGMSDGSI 680
           Y+++  + G +
Sbjct: 615 YLMIFFASGEV 625


>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
 gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
          Length = 180

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
           TG+ EAIDGADGG  DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4   TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
            E  +D    +N   EE  G GRKE      +V L              F ILT GTILC
Sbjct: 64  GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101

Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
           Y A LFEGP+  SK +DPVS   S+  S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133



 Score = 67.0 bits (162), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/50 (62%), Positives = 41/50 (82%)

Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1068
            I +  V +P+NQVLS + DQE GHQI+N NLSSV++HRT +V+E+EVRIL
Sbjct: 131  ISAFAVQRPVNQVLSSMADQEFGHQIENPNLSSVEIHRTDSVDEFEVRIL 180


>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
 gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
          Length = 1342

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 121/500 (24%), Positives = 231/500 (46%), Gaps = 46/500 (9%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
            N    + ++L  +  P +AY+   T      +R+  F N++G    F++G  P +     
Sbjct: 810  NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865

Query: 941  ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
              +    +      V+F    +    +G IY+ +    +IC++P    Y+N WP++KIP+
Sbjct: 866  HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKIPI 925

Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRTY 1058
            K +   +TY    N +       V+    ++    +D+E G  I   + S  S + ++ Y
Sbjct: 926  KESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKGY 977

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
                  ++++ P      W    TI +   E  + V+ + L   ++TK+    + L+ IG
Sbjct: 978  ------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVIG 1027

Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   + ED++A G   +F         G+   N +    E++ ++ KGA++++  + G  
Sbjct: 1028 TGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHK--FKEIHQEDTKGAVTSICEISGRF 1085

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L++ G KII+       +  +AF D   +YV       N ++LGD  KSI+   +  +  
Sbjct: 1086 LVSQGQKIIIRDLQDDGVVPVAFLDTS-VYVSEAKSFGNLLILGDSLKSIWLAGFDAEPF 1144

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            ++ +L KD  SLD    +F+I    + ++++D    + +  Y P+   S  GQ+L+ +A 
Sbjct: 1145 RMVMLGKDLQSLDVNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKAS 1204

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            F++ +  T       + +        P S  T  F  +  T+DGS   + P++E ++RR+
Sbjct: 1205 FNINSTPT------CIRSIPKNEEINPSS--TEVFQSIGSTIDGSFYTVFPINEASYRRM 1256

Query: 1348 QSLQKKLVDSVPHVAGLNPR 1367
              LQ+++ D   H  GLNPR
Sbjct: 1257 YILQQQITDKEYHFCGLNPR 1276



 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/414 (21%), Positives = 178/414 (42%), Gaps = 64/414 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V  A V++++ +   E  +++ K                  L+LV  ++LHG +  + 
Sbjct: 29  LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +     +NS+  D ++++ + AK+S++++D  ++ +   S+H +E+          E  
Sbjct: 71  AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
               LV V+P   C  +    L   +                 +  S G      +++  
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183

Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            S G   S   ESS +I+ R LD +   + D  F++ Y EP + I+  +   WAG +   
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
                   LS+    K    +    NLP D  K++ +P P+ G L++G N  IH  +   
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303

Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
           +  LALN +  S+ +S +    +S  +++L+      + ND   L+    GD   +    
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363

Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           DG+ +++            D+  T P     +I T+ N+L F+ ++ G++ L++
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLE 413


>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
 gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
          Length = 1388

 Score =  154 bits (390), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 131/495 (26%), Positives = 222/495 (44%), Gaps = 62/495 (12%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
            +    +I G+   F  G+ P + +           L    + + T  H  +C  GF Y+ 
Sbjct: 870  LVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLSGKPVKSLTSFHTSSCQRGFAYLD 929

Query: 974  SQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL 1032
            S   L+ICQLP  + Y +  W  +++P+ +  H +TY     LY  IV            
Sbjct: 930  STDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLTYHP-PGLY--IVGT---------- 976

Query: 1033 SLLIDQEVGHQID-----NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                 Q   +Q+D     +++L   DL    ++E   +++L+       W    T  +  
Sbjct: 977  ----GQAEDYQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKS----WTIIDTHVLDP 1028

Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
             E  L+++ + L  +  T + + L+A+GT+ V GED+A +G + +F        P    T
Sbjct: 1029 QEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPET 1088

Query: 1147 E-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
                  +   E+KGA+SA++ L  QG L++A G K ++   K  GT L  +AF D    Y
Sbjct: 1089 NRRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CY 1146

Query: 1198 VVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            V  L  +     + +GD ++ ++F  + E+  +++L A+   +L+  A +FL     L L
Sbjct: 1147 VSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHL 1206

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAA 1313
            VV+D   N+QI  + P   +   G +LL +A FH G H    L L    L   S    AA
Sbjct: 1207 VVADADMNLQILQFDPDNPKGEAGSRLLHKATFHTG-HFPTSLHLIHSHLKLPSATDFAA 1265

Query: 1314 PGSDKTNRFAL------------------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
              ++  + FA+                  L  T  G++  + PL E ++RRL +L   L 
Sbjct: 1266 TNNNPADAFAMDTSPNTTTDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLSAYLA 1325

Query: 1356 DSVPHVAGLNPRSFR 1370
            +++     LNPR+FR
Sbjct: 1326 NTLDSACSLNPRAFR 1340



 Score =  135 bits (341), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 168/698 (24%), Positives = 290/698 (41%), Gaps = 88/698 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   + +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE--- 217
                 W    +   +F     +  DP  RC  +      + IL   Q    LV D+   
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYDS 198

Query: 218 -----------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
                          + G       SS V+ L +LD  + H     F+H Y EP   I+ 
Sbjct: 199 DAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVA 258

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
               T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+V
Sbjct: 259 ASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLV 318

Query: 325 GAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G N  IH      +  +ALN +A +  S     +S  ++ L+      L  +    L+  
Sbjct: 319 GRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVL 378

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLL 434
             G L++LT   DGR V  + +                S  + +G    F+GS  G+S++
Sbjct: 379 NNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESVM 438

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNN 493
           + +T       L++ L+ +  + + D            D   D+ N    +++  +A+ +
Sbjct: 439 LGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAVS 491

Query: 494 TESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL 541
             +A  T++F + D L +I P+KD            + G + ++    A G     + EL
Sbjct: 492 EPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDREL 551

Query: 542 -------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EAR 586
                   ELP   G+W V+ +       +     D+    A D +Y  YL++S   E  
Sbjct: 552 HPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDG 611

Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           T+V E   D LTE  +      +G T+  G L    +V+QV     RI D       +  
Sbjct: 612 TVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQS 671

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 672 MEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708


>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score =  154 bits (389), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 129/548 (23%), Positives = 249/548 (45%), Gaps = 52/548 (9%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +AY     P G   +R  ++   +SG    F++G  P +    
Sbjct: 788  NFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITRT 842

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
            R  +    +    +  +F    +   ++G I++ +    +IC+LP    YDN  PV+K+P
Sbjct: 843  RHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKVP 902

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
            +  T   +TY    N Y  +VS     P N      +D+E     G + D  + +S    
Sbjct: 903  IGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY--- 952

Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLL 1111
                  +  ++++ P      W    T+ ++ +E A+TV+ + L   ++TK     + LL
Sbjct: 953  ------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKELL 1002

Query: 1112 AIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQ 1164
             +GT   + ED+ A G   ++         G+   N +    E  +++ KGA++++  + 
Sbjct: 1003 VVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCEVS 1060

Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            G  L+A G KII+       +  +AF D   +YV       N ++LGD  KS++   +  
Sbjct: 1061 GRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGFDA 1119

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
            +  ++ +L KD  S+D    EF+     + ++++     + +  + P+   S  GQ+L+ 
Sbjct: 1120 EPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVH 1179

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            RA F+V +  T    ++M+  + +       +  ++ F  +  T+DGS   + P++E T+
Sbjct: 1180 RASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEFTY 1231

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            RR+  +Q++L D   H  GLNPR  R         + G   I+D +++  Y  L  + + 
Sbjct: 1232 RRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQ 1291

Query: 1405 EIAHQTGT 1412
             IA +  +
Sbjct: 1292 TIAQKVSS 1299



 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+H
Sbjct: 52  LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
            +E           E+     L+ V+P       + +   +  L  S            G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164

Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           +        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWTG 224

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
            +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH 
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284

Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
            +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L  
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLFH 344

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L 
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397

Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
                    +E      + +  +  S+  +DM   EEL    +A       Q    +   
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
           D L+N GP+ DF+ G        A  I+   N   +   G      C  I+         
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508

Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               + + +R  N ++  +   DD      I  +E     L++ D + +           
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND----------E 558

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D 
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614

Query: 670 YVLLGMSDGSI 680
           Y+++  + G +
Sbjct: 615 YLMIFFASGEV 625


>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1149

 Score =  154 bits (389), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)

Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
            G W T      Q +E   ++  V L   +T       + +GT   + ED+AARG + +F 
Sbjct: 800  GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 859

Query: 1133 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1187
                +  +N       +   Y +E K  ++A+ ++ G+ L   G K+    +   E L  
Sbjct: 860  IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 919

Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            + F D  P Y   + I KNFILLGD  K I  ++++E+  +L  L   +  L C   +FL
Sbjct: 920  VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 978

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
            +    L++V +D    I+IF Y P   ES  GQKLL R+EF+  + +T  ++        
Sbjct: 979  VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1032

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
             R  A    D+       F +LDGSI  + P  E  ++RLQ +Q +L   + H AGLNP+
Sbjct: 1033 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1089

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              R    N    R     I+D ELL  + +L + +Q EIA   G+ R  +L NL +L
Sbjct: 1090 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1145


>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1513

 Score =  153 bits (387), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)

Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
            G W T      Q +E   ++  V L   +T       + +GT   + ED+AARG + +F 
Sbjct: 1164 GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 1223

Query: 1133 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1187
                +  +N       +   Y +E K  ++A+ ++ G+ L   G K+    +   E L  
Sbjct: 1224 IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 1283

Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
            + F D  P Y   + I KNFILLGD  K I  ++++E+  +L  L   +  L C   +FL
Sbjct: 1284 VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 1342

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
            +    L++V +D    I+IF Y P   ES  GQKLL R+EF+  + +T  ++        
Sbjct: 1343 VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1396

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
             R  A    D+       F +LDGSI  + P  E  ++RLQ +Q +L   + H AGLNP+
Sbjct: 1397 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1453

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              R    N    R     I+D ELL  + +L + +Q EIA   G+ R  +L NL +L
Sbjct: 1454 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1509



 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)

Query: 48  SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
           SK    P+ NL+V  + +++++ + + E+   E   ++N  E K +          L  +
Sbjct: 37  SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89

Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           C +RLHG V  L  L+          D ++++F+DAK+++LE+ +S   L   S+H FE 
Sbjct: 90  CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148

Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
            P+       R+   +   ++VDP  RC  +L+    + +L   Q    L    D+ G  
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201

Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
           GG  + +             SS +++             LD +H  +K  I   F+ G+ 
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           EP + +L++ + TW+ R+    +T  +  L++       P+I    NLP+DA+ L+A P 
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS------------------ 358
            + GVLV+ A+ I +  QS+       N  V   S  ++PR                   
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381

Query: 359 ---------------------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                                         V L+ A   + + D A +  +TG++  L  
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441

Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
           + DGR + +L L K +  S++ S +  + N   F+GS  GDS L                
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501

Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
             G  + SS + +           +F  DI  D   T  +RR+    L D    NG +  
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
             G+ ++  E   +     + D +   GP++DF+         +ATG+        +EL 
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604

Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
            C G       TV+H     +  R  + +S   +  +  +   L++     S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664

Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             +   TE+    ES     ++ F + +T+A    FG++ V+QV     ++   S     
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
                  ++         +L  SI D YV+L    G   +  GD  + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765


>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
          Length = 1305

 Score =  153 bits (387), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)

Query: 55  VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
           V NLVV  +N +++Y      V VQ    +GS  S    +T+            L+L+  
Sbjct: 37  VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84

Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           +  HG V  LA LS     ++R+  R  ++++F DAK++V+E+ D +H L   SMH FE 
Sbjct: 85  HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140

Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
                L +G +  A   +++VD   RC  +L+    + IL   Q  S L   ED    G 
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---EDMTREGL 195

Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
             S     S  I+L ++   +++V DF F+ G+ EP + IL +R+ TW GR+ +      
Sbjct: 196 Q-SLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254

Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
           +  +++    + +P+I+ A  LP+DA  L   P  +GGV+++ AN+ +H    S    +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +N +  +L  ++   R +  + L+ A   ++   VA+L T+TG+   L +  DGR V  +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
           D      + + + I T+G +  F+GS  G S+L+++   SG             DI    
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
            S   L +  SDA+ D        LY +A ++N    Q     K     + D+L   G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +  +       D ++  + + S      +    G+ T++         D     A D  +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530

Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
                + LE R           ++  T    + +  ++D   Q  T         +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
            F R +V++V E   R+            G   +E+  G  ++  + V+++DP+V +  +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636

Query: 677 DGSIRLLVGDPST 689
             ++ +  GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649



 Score =  125 bits (314), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 127/528 (24%), Positives = 230/528 (43%), Gaps = 48/528 (9%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRE---RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
            +R+  F + +G  G F++GS P + +  R    RL   P    G   AF      +    
Sbjct: 806  RRLVSFISTTGRSGVFITGSAPFYLLTDRAGIARLYRAPY---GRASAFGAFDPPSSTP- 861

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
             + V + G +    L   ++     PV  +   AT    T  A  +    +V+  V+   
Sbjct: 862  -LLVLADGAMHTYDLSDQASLARELPVTHV---ATSKCFTSTAYHDSSHTLVAARVV--- 914

Query: 1029 NQVLSLLIDQEVG-HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
            N    L  D+    ++  + ++ S  + R+       + +L P    G W          
Sbjct: 915  NAPFELFDDEGAPVYRAPSEDMISPTVFRSC------LELLVP----GSWDCIDGHEFPQ 964

Query: 1088 SENALTVRVVTLFNTTTKENETLLAIG-TAYVQGEDVAARGRVLLF--------STGRNA 1138
            +E+ L +   TL + T         I  T   +GED+  RG + +F        +    A
Sbjct: 965  NESILQLICATLPSATDPSGRARFVIASTCNNRGEDLQTRGGLYVFRISTTESTAASDQA 1024

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
                  ++ V++ +L+  + A+  + GH++ + G K+ +  +   + L  + F D   L 
Sbjct: 1025 QARSAKLSLVHADDLRHPVGAICEVNGHIIHSLGQKVFIKAFDSDQRLITVGFLDVG-LD 1083

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            V ++  +KN +++GD     YF++++E   +L LL K+    D +  +FL+  + L L+ 
Sbjct: 1084 VSAMRSIKNLLIIGDSLTGTYFVAFQEDPFKLVLLGKEARKTDVYCVDFLVQENRLGLLS 1143

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGS 1316
               +  ++   Y P  +ES  G++LL R E+H+G  +   L   + L+T  D   +    
Sbjct: 1144 VSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQIIDSLSFAKRLSTDEDLRQSG--- 1200

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +L G  DGS+  + P+ E+ +RRL  L+++L   +PH AGLNPR+FR    N 
Sbjct: 1201 ------VMLVGA-DGSLTWVTPVREVVYRRLALLERQLHRQLPHFAGLNPRAFRTAR-ND 1252

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               RP    ++D +LL+ Y  L    Q  +A    +    +  NL +L
Sbjct: 1253 YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSVNLGNL 1300


>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
          Length = 216

 Score =  153 bits (386), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 81/212 (38%), Positives = 122/212 (57%), Gaps = 12/212 (5%)

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            KSI  L ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  
Sbjct: 2    KSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAK 61

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTL 1329
            ES+ G +LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TL
Sbjct: 62   ESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATL 114

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            DG IG + P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D 
Sbjct: 115  DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 174

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            ELL+ Y  L   E+ E+A + GTT   IL +L
Sbjct: 175  ELLNRYLYLSTMERSELAKKIGTTPDIILDDL 206


>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
            WM276]
 gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            gattii WM276]
          Length = 1431

 Score =  152 bits (384), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 130/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
            I  F NI G  G F++G +P W +        HP     L   ++      H       F
Sbjct: 931  IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            I +       IC LP     D   P  +  ++ T   IT+      Y    S+ V  P  
Sbjct: 987  IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERTYTNITFDPTSAHYVGAASIEV--PFQ 1042

Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                   D+E   Q+  D  +L      R+ T+E +        +   PW+         
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
            +E  +++  V L +          +A+GT +  GED A RG   +F   +          
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149

Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
                   +        +  ++A+  + G+LL  +GPK+ +  +    +L G+AF D   L
Sbjct: 1150 GSVPGWKLVRRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGFDYDAQLMGLAFLDIQ-L 1208

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y  ++ + KNF+L+GD+ KS +F+S +E   +   ++KD   +     +FL+    ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
             SD   ++++  + P   +S  G++L+ R E+H G+  T    +    T+ +    AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTTEEE--FAPQT 1326

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +++ T DG++  +  + +  F+RLQ +  +LV +  HVAGLNPR+FR    N 
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               RP    I+D +LL+ + + P+  Q E+  Q GT    + S+L  L 
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428



 Score =  110 bits (274), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 160/733 (21%), Positives = 298/733 (40%), Gaps = 121/733 (16%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A  + ++ +R +     E  K  ++  E K+ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
                         +   L L+  + L+G V  LA     +   D     D +I++F+DA
Sbjct: 96  FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152

Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
           K+++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
           + +L   Q  S L    D    G    A    S V++L D+   +K+++D +FV G+  P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263

Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
            + +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
            +GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++  
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
              LL  + G +  +    +GR V  ++ L K       SD+   G+   F+GS  GDS 
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +                            +R+ R+     +  V+ +E  LYG  ++ 
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478

Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
              E AQ+ F     + +  D L  +G + D  +G+  +           + +G S+ S 
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538

Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
           + +              EL   +G+W +      G      +     +   A +++S E 
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593

Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
              R   L +     ++       + G+T++A   F R  ++ V      +LD +     
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN----- 643

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
              G          +   +++ SI+DP+ ++  +D S+   VGD    TV+ + P   E 
Sbjct: 644 ---GKIIQTVCPRGDGPKIVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSEG 699

Query: 703 SKKPVSSCTLYHD 715
                 +  ++ D
Sbjct: 700 ESPVCQAVEVFTD 712


>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
           [Schistosoma japonicum]
          Length = 462

 Score =  152 bits (384), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/402 (28%), Positives = 197/402 (49%), Gaps = 55/402 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+T +  IEIY ++        S  SGET+     + +   S+        + N+  +
Sbjct: 41  NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G        S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +
Sbjct: 81  VSVRFAGDLLD----SLLLSFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG------------- 223
           F + P++++DP  RC  +LVY   + +L   +    +  + D    G             
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193

Query: 224 ---GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
                F+  + +S         + +V D  F+HG+ EP +++L+E   TWAGRVS +  T
Sbjct: 194 PLLATFTTCLSTS-----TGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDT 248

Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCA 339
           C I ALS +   + +P+IW   +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+
Sbjct: 249 CCIVALSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCS 308

Query: 340 LALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-- 394
           L LN YA    +    Q++P S   + +D      L     L+ T++G+L LL++  +  
Sbjct: 309 LPLNCYAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQA 367

Query: 395 GRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
            + V  L   K   +V    +  + +   F+GSR  DS+L++
Sbjct: 368 TQTVTSLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409


>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
 gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1447

 Score =  152 bits (383), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 137/588 (23%), Positives = 258/588 (43%), Gaps = 49/588 (8%)

Query: 872  STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
            STS +L  S +   ++ N   ++ P + A  + +       + +    N+ G+   F+ G
Sbjct: 879  STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
              P + +   + L     L    +   +  H   C+ GFIY  ++GI+++ Q P  +T+ 
Sbjct: 939  GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFA 998

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
            D    ++K+ +    H + Y +    Y +  S               D E+    D+H  
Sbjct: 999  DIGMALRKVEIGEDVHAVAYHSPLQTYVIGTST------------FTDFELPKD-DDHRR 1045

Query: 1050 S--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF-NTTTKE 1106
            S    D+    ++E+  ++++ P      W    TI ++  E    ++ + L  +  T E
Sbjct: 1046 SWQEEDIAFKPSIEKSSLKLISPVN----WSVIDTIELEPCEVITCIKTMNLVVSEVTNE 1101

Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTG---RNADNPQN------LVTEVYSKELKGAI 1157
             + LL +GTA  +GED+A  GR+ ++         D P+       +  E  ++   G +
Sbjct: 1102 RKPLLVVGTAITKGEDLATTGRLYVYDVVIVVPEPDRPETNKKLKLISAETITRGAGGPV 1161

Query: 1158 SALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLG 1211
            + L+ +  QG +L+A G K ++   K  GT L  +AF D    YV S+  +      ++ 
Sbjct: 1162 TGLSEIGTQGFMLVAQGQKCMVRGLKEDGTNLP-VAFMDTN-CYVTSIKELPGTGLCVIA 1219

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D  K ++F  + E+  ++ L  K    ++    + L DG  L +V +D   N+ I  Y P
Sbjct: 1220 DALKGVWFAGYTEEPYKMLLFGKSATRMEVLCADLLPDGKDLFIVAADADGNLHIMQYDP 1279

Query: 1272 KMSESWKGQKLLSRAEFHVGAH-------VTKFLRLQMLATSSDRTGAAPGSDK--TNRF 1322
            +  +S +G  LL R  F +GAH       +     L  L T+S  + +    +   +   
Sbjct: 1280 EHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPAIPSLHPLTTASSSSLSPSPQEDSPSPSQ 1339

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
            +LL  +  G+   ++PL E  +RR  +L   L +++ H  GLNPR++R    +      G
Sbjct: 1340 SLLLTSRTGTFALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-VDKDANEGIVG 1398

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
              +I+D  +L  +  L  + + E+A + G    ++   L++L  G  F
Sbjct: 1399 GRTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDELSELRRGLEF 1446



 Score =  129 bits (324), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)

Query: 57  NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
           NLVV  A++++I+      V + E   K+S   K+   T  R    DG+ A+        
Sbjct: 28  NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87

Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L L+  Y L G V SL     I S+ G +      ++++ F+DAK+S++E
Sbjct: 88  PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
            Q     + D D              +    G  +    SS V+ L  LD   +      
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
           F++ Y EP   IL       +  +  +  H T M+  L I    K    I S   LP+D 
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319

Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           + ++ +  P+GG L+VGAN  IH      +  +A+N +A    +   L +S  ++ L+  
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379

Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
               L  +N   L+   +GD+ +L+   DGR V  L + + +      +LT   S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439

Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
           G    F+GS + DS+++ ++  SG                   PS ++ R  SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480

Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                           + G+  ++  +A+N T S  K   ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540

Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
           G           L++N   S      A G  K  +  ++            ELP  +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600

Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
           T+  K  + +G   +  +         D +Y   +I+S   EA   + E+A     +   
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660

Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
           E     ++    G TI AG L    RVIQV +   R  DG   + Q L     + E+G+ 
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717

Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
                ++S S ADP++LL   D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740


>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1324

 Score =  150 bits (379), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 174/734 (23%), Positives = 304/734 (41%), Gaps = 114/734 (15%)

Query: 739  IDGADGGP-LDQGDIYSVVCY------ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
             D A+ GP +   D Y+   Y      ESGAL I  VP+    F    F  G + + D  
Sbjct: 661  FDAAELGPSVATSDDYAAPTYWALITTESGALYICTVPDLKIAFHCPSFGDGHSLVWDRL 720

Query: 792  MREALK---DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
              +A+    D     + +  +    G +E I    ++ L   +      RP L A  +D 
Sbjct: 721  PNQAIPQAGDGADAPDEARNDDDAHGSEEYIVETLLIGLGQGQ------RPHLLARTSDH 774

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L Y+ +             PV       V +V+ + +R                    
Sbjct: 775  HLLMYEVF-------------PV-------VPSVTEASVR-------------------- 794

Query: 909  APCQRITIFKNISGHQGFFLSGSRPCW--CMVFRERLRVHPQLCDGSIVAFTVLHNVNCN 966
                R+  F+NI+G  G  ++G RP    C    + + + P   + ++  F  LH  +  
Sbjct: 795  ----RLKPFQNIAGCDGVCVTGPRPLLVACGHQLKAITIVPLALEDAVKTFHPLHMDDVE 850

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
            +GFIY T  G L     P G   +     ++  L  T  +I +  +  L  L++  P  +
Sbjct: 851  NGFIYFTKAGTLCCATAPDGLMLNRGVLARRAVLGRTIQKIAFDLDSRLAALLLMEP--R 908

Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHR-TYTVEE-------YEVRILEPDRAGGPWQ 1078
            P           E+     N++  S +L   +Y  +E       +++++L P        
Sbjct: 909  P-----------ELKPSRGNNDPPSNELPNISYRPDEPKALTPFFQLQLLSPKSMKLLPD 957

Query: 1079 TRATIPMQSSENALT-VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
            TR    +     +   VR+ +  N+T K+N   +A+G   ++G+     G V  ++   +
Sbjct: 958  TRIEYDLHHHVTSFAAVRLSSSLNSTGKQN--YIAVGVTLLEGQRATTTGFVDFYTVDVH 1015

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-----GPKIILHKWT-GTELNGIAFY 1191
             D  +  + +  S +  G +SA+   +   L+A+     G KI +  +  G EL  +A++
Sbjct: 1016 -DGKETRLEKRASCKQPGCVSAMDCTEDGFLVAAVGQRLGSKIYVWNFQDGQELQPLAYF 1074

Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSW-KEQGAQ--------------LNLLAKDF 1236
            +A  +Y   + ++KN  ++GD    +  L + +++G Q              L  +  D 
Sbjct: 1075 EAG-IYTSCIRVIKNLAIVGDYESGVQLLRFSRQKGLQQMPVFRGTKHRFYSLVKVGADP 1133

Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
               +C+  +F++  S L+++  D   N+    Y     ++  G+ L+  A FH+G  ++ 
Sbjct: 1134 HKSNCYCADFVVRESDLAMIYGDADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSA 1193

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
             LRLQ  A    R        +       FG ++G  G + PL E  +RRL+ LQKKLV 
Sbjct: 1194 MLRLQ--AAPVVRAPGGLAEAQKCHVVHTFG-IEGQQGVVIPLHEAEYRRLEMLQKKLV- 1249

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
            S   +AGL+P  FR F S+    R     I+D  LL  Y  L   EQL++A Q G +  Q
Sbjct: 1250 SHSSLAGLHPFQFRAFKSSIWRPRSFAQGILDGALLRQYFCLGRREQLDVAEQLGVSAQQ 1309

Query: 1417 ILSNLNDLALGTSF 1430
            +  ++   AL  +F
Sbjct: 1310 LERDMAH-ALDAAF 1322



 Score =  126 bits (316), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 142/291 (48%), Gaps = 29/291 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +RL+G   ++  ++       + RD+++L+F DAKIS ++F+ S   L    + 
Sbjct: 34  LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             E  E ++  +        P+++ DP  RC G LVYG +++I+ A              
Sbjct: 90  NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133

Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
                   R      VI+L  L   +   K F F+ GY  P  ++LHE    W GR +  
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189

Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
             +C++SALS  + TT    P +W+  +LP D + L+  P P+GG L+V  N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +S A+A+N  A          ++  S+ LD A  T++ +  A+ S ++G L
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQL 300


>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. neoformans JEC21]
 gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            neoformans var. neoformans JEC21]
          Length = 1431

 Score =  150 bits (379), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
            I  F NI G  G F++G +P W +        HP     L   ++      H       F
Sbjct: 931  IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            I +       IC LP     D   P  +  ++     IT+      Y    S+ V  P  
Sbjct: 987  IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042

Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                   D+E   Q+  D  +L      R+ T+E +        +   PW+         
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
            +E  +++  V L +          +A+GT +  GED A RG   +F   +          
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149

Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
                   + +      +  ++A+  + G+LL  +GPK+ +      ++L G+AF D   L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y  ++ + KNF+L+GD+ KS +F+S +E   +   ++KD   +     +FL+    ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
             SD   ++++  + P   +S  G++L+ R E+H G+  T    +    T+ +    AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +++ T DG++  +  + +  F+RLQ +  +LV +  HVAGLNPR+FR    N 
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               RP    I+D +LL+ + + P+  Q E+  Q GT    + S+L  L 
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428



 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
           NLVV  A V+ ++ +R +     E  K  ++  E ++ V M+ +                
Sbjct: 48  NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107

Query: 97  --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
             +   L L+  + L+G +  LA  ++         D +I++F+DAK+++LE+  S   +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
              S+H +E    ++     +S+   PL++ DP  R   + +    + +L   Q  S L 
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
              D    G    A    S V++L D+ +  K+++D +F+ G+  P + +L     TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277

Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
           R+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +GG+++V +  I 
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336

Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +  Q    +A+C  A  +   SL  S  +   S  + L+ +   ++     LL  + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394

Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS L +       
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
              S   K+E  +++ D    + L    +DA  D    E   L+G A+          + 
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495

Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
           +  D L  +G + D  +G+                        IN       I+K+  + 
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
             EL   +G+W +      G      +     +   A +++S E          L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
                 + G+T++A   F R  +++V      +LD +  + Q +         G G +  
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            +++ SI+DP+V++  +D S+   VGD    TV+ + P   E       +  ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712


>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 255

 Score =  150 bits (379), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 83/244 (34%), Positives = 140/244 (57%), Gaps = 12/244 (4%)

Query: 1188 IAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
            I F+D P +YV+SL+++K+ FIL+GD + S+  + W+E+   L  L+KD      F+ E+
Sbjct: 23   IGFHD-PRVYVMSLSVIKHKFILVGDAYGSVQLVVWREEDHSLTALSKDHEDCQVFSAEY 81

Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1306
            LID   +++VV+D ++N+++  YAP  + S  G KLL +++F++G+ V K  R +     
Sbjct: 82   LIDEPGMAIVVADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRRRTRGNL 141

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
             D  GA        R+ LL GTLDG +G + P+DE  FRRL +LQ  + +++ H    NP
Sbjct: 142  RD--GA--------RYCLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAANP 191

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
            R++R F            +++D  LL  +  L  + Q ++    GTT  ++++NL D+ L
Sbjct: 192  RAYRLFDHGPTFRYETKQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDIDL 251

Query: 1427 GTSF 1430
             + F
Sbjct: 252  ASLF 255


>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
 gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
          Length = 429

 Score =  150 bits (378), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 200/392 (51%), Gaps = 33/392 (8%)

Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
            +E   V++LEP      W T  +  ++ +E    + V+ L  +  T E + ++ +G++ V
Sbjct: 51   LERGTVKLLEPRN----WSTIDSHELEPAERITCIEVIRLEISELTHERKDMVVVGSSIV 106

Query: 1119 QGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--QGHLLIAS 1171
            +GED+  +G + +F        P    ++   ++++KE +KGA++AL+ +  QG L++A 
Sbjct: 107  KGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQ 166

Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
            G K ++   K  G+ L  +AF D    YV  L  +K     ++GD  K ++F+ + E+  
Sbjct: 167  GQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPY 224

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            +L+L  K+  +L     +FL DG+ L ++V+D+  N+ +  Y P+   S KG +LL R+ 
Sbjct: 225  KLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 284

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAP--------GSDKTNRFALLFGTLDGSIGCIAPL 1339
            FH G     F     L     RT ++P         S   +++ +L     GS+  I PL
Sbjct: 285  FHTG----HFASTMTLLPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSVAVITPL 340

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
             E ++RRL +LQ +LV+++ H   LNPR +R   S+G   + G   ++D  LL  +  + 
Sbjct: 341  GEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWLDMG 397

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             + + EIA + G     I  +L  L  G ++L
Sbjct: 398  AQRKAEIAGRVGADVGAIRVDLEKLHGGLAYL 429


>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. grubii H99]
          Length = 1431

 Score =  148 bits (374), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 128/529 (24%), Positives = 227/529 (42%), Gaps = 48/529 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
            I  F NI G  G F++G +P W +        HP     L   ++      H       F
Sbjct: 931  IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            I +       IC LP     D   P  +  ++     IT+      Y    S+ V  P  
Sbjct: 987  IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042

Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                   D+E   Q+  D  +L      R+ T+E +        +   PW+         
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWRVIDGYEFDQ 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
            +E  +++  V L +          +A+GT +  GED A RG   +F   +          
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149

Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
                   + +      +  ++A+  + G+LL  +GPK+ +       +L G+AF D   L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDAQLMGLAFLDIQ-L 1208

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y  ++ + KNF+L+GD+ KS +F+S +E   +   ++KD   +     +FL+    ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
             SD   ++++  + P   +S  G++L+ + E+H G+  T    +    T+ +    AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLKTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +++ T DG++  +  + +  F+RLQ +  +LV +  HVAGLNPR+FR    N 
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               RP    I+D +LL+ + + P+  Q E+  Q GT    + S+L  L 
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428



 Score =  111 bits (278), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A V+ ++ +R +     E +K  ++  E ++ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
                         +   L L+  + L+G V  LA  ++         D +I++F+DAK+
Sbjct: 96  FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154

Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
           ++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    + 
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +L   Q  S L    D    G    A    S V++L D+   +K+++D +F+ G+  P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265

Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
           GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++    
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384

Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            LL  + G +  +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS 
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +          +   K+E  +++ D    + L    +DA  D    E+   +G A+  
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
                   + +  D L  +G + D  +G+  +           + +G S+ S + +    
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545

Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
                     EL    G+W +      G      +     +   A +++S E    R   
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L +     ++       + G+T++A   F R  ++ V      +LD +        G   
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                  +   +++ SI+DP+V++  +D S+   VGD    TV  + P   E       +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706

Query: 710 CTLYHD 715
             ++ D
Sbjct: 707 VEIFTD 712


>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
            NRRL Y-27907]
          Length = 1348

 Score =  148 bits (374), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 133/591 (22%), Positives = 256/591 (43%), Gaps = 66/591 (11%)

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
            H   +L  +   G ++ Y+ Y F+G                    N    + ++LR +  
Sbjct: 783  HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822

Query: 896  PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
            P +AY     P G   +R +  F N++G+   F++G  P   M     +    Q      
Sbjct: 823  PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877

Query: 955  VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
            ++ +   +    +G I++ +    +IC+L    TY+  WP+++I +  +   ITY    N
Sbjct: 878  LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQIHIGDSIKSITYHETSN 937

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDR 1072
             Y  +VS     P +           G   D   +      +T     Y+  ++++ P  
Sbjct: 938  TY--VVSTFREIPYD-----------GLDEDGKLIVGTLPDKTPRPVAYKGSIKMISPLN 984

Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTAYVQGEDVAARGR 1128
                W    TI +  +E A+ V+ + L   ++    K  +  + IG+   + ED+ A G 
Sbjct: 985  ----WTVIDTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNEDLVANGS 1040

Query: 1129 VLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
              +F        P      +   EV+ ++ +GA++++  L G LLIA G K+I+      
Sbjct: 1041 FKIFEIVDIVPEPGKPETNHKFKEVFQEDTRGAVTSICGLSGRLLIAQGQKVIVRDVQDD 1100

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
             +  +AF D   +YV     + N ++LGD  KS + + +  +  ++ +L KD   L+   
Sbjct: 1101 GVVPVAFLDTA-VYVSESKSLGNLLMLGDPLKSCWLVGFDAEPFRMIMLGKDLHHLNVSC 1159

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             +F+     + ++++D    + +  Y P   +S  GQ+L+S++ F + + V+   +L  +
Sbjct: 1160 GDFITKDEDIYMLIADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRKLPKI 1219

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
             +S +++         + F ++  T DGS   + P+DE ++RR+  LQ++L D   H  G
Sbjct: 1220 ESSFEKSEIK--FSPIDEFQIIGSTSDGSFFNVFPVDESSYRRMYILQQQLTDKEYHYCG 1277

Query: 1364 LNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            LNPR  R     +   N    +P    I+D  L+  Y  L  + +  +A +
Sbjct: 1278 LNPRLNRFGGAIELRDNETNTKP----ILDFGLIKRYAQLNEDRKRNLASK 1324



 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 100/500 (20%), Positives = 196/500 (39%), Gaps = 77/500 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V   N+++I+   + ++ S  +K                  L+++  ++L+G +  L 
Sbjct: 29  LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
            L     +N    D +I++ + AK S+++++  +H +   S+H +E     H  R    E
Sbjct: 75  PLRT--VENPHL-DYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
                 L+ V+P       L +   +  L  +        D+                D 
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185

Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            G      +  +SS +I+ + LD  ++ V D  F+H Y EP + IL  +   WAG +   
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
                   +++    K    ++   NLP++  +++ +PSP+ G L++G N I H  +   
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
              +A+N++      S  S Q+    S S+E D        +   L+   TG    +   
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364

Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
            DG+ ++++ +   + ++ +        ++  + ++L F  +  G+S LVQF        
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
                   + D+  D     +         +     +E  LY    N  E  Q       
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467

Query: 502 SFAVRDSLVNIGPLKDFSYG 521
            F   D L+N GP+  F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487


>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
          Length = 1300

 Score =  148 bits (374), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 128/527 (24%), Positives = 234/527 (44%), Gaps = 60/527 (11%)

Query: 918  KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            +N+SG  G F+SG  P + +     +    +     I++F        N+  I++  +  
Sbjct: 814  ENLSGLTGIFVSGDVPYYIVKTNHSIPRIFKFARIPIMSFGKF----ANNQLIFLDDKKN 869

Query: 978  LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
             +IC++PS   Y+N WP ++I +  T   + Y    N +  ++S     P N      +D
Sbjct: 870  TRICEIPSEFNYENNWPARQINIGETIKDVAYHETSNTF--VISTYKEIPYN-----CLD 922

Query: 1038 QE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
            +E    VG   D  +  S          +  ++++ P      W       +  +E    
Sbjct: 923  EENVPIVGIMEDKPSALSY---------KGSIKLVSP----ISWTVIDEFELDDNEVGTK 969

Query: 1094 VRVVTL-FNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNL 1144
            V  + L   ++T   K     + IGT  ++ ED+AA G   +         P      + 
Sbjct: 970  VSSMVLDVGSSTRRFKSKREFVVIGTGKLRMEDLAANGSFKVLEIIDVIPEPGHPETNHK 1029

Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
              E Y +E KGA++A++ + G  L++ G KII+       +  +AF D   +YV      
Sbjct: 1030 FKEFYKEETKGAVTAVSDVSGRFLVSQGQKIIVRDLQDDGVVPVAFLDCS-VYVSESKSY 1088

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
             NF+LLGD  KS++   +  +  ++ +L KD  S+D    +F++    L ++V D    +
Sbjct: 1089 GNFVLLGDTLKSVWLAGFDAEPYRMIMLGKDLKSIDVNCADFIVKDEELYIIVGDNNNIL 1148

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
             +  Y P+   S  GQ+L+ +A F++ A VT   +L+ L    D + +  GS        
Sbjct: 1149 HLLKYDPEDPNSSNGQRLVEKAAFNLNAKVT---QLKQLPNLMDNSTSCIGS-------- 1197

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHR 1380
               T++GS   + P++E ++RR+  LQ++L D   H  GLNPR  R    +  +N   ++
Sbjct: 1198 ---TIEGSFFTVFPINESSYRRMYILQQQLTDKAYHHCGLNPRLNRFGGLKLTANESNNK 1254

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
            P    I+D +++  Y  L  + +  I  +     S+I  ++ +   G
Sbjct: 1255 P----ILDYDVIKLYAKLNEDRRRNIGAKVSREGSEIWRDMLEFEAG 1297



 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/463 (22%), Positives = 201/463 (43%), Gaps = 45/463 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y+L G + S+  +       + + D +++A + AKIS++ +D + H +R  S+H
Sbjct: 51  LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +E+   +      +      L+ V+P+  C  V    L   +             ++  
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163

Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           G+    +    +SS +IN ++LD +   + D  F+H Y +P + +L  ++  WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   LS+    K+   +    +LP+D  +L+++PSP+ G L+VGAN  IH  +   
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283

Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
           +  +++N +    + +S   +  S  ++ L+      L N+  +L    TG+  +LT   
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343

Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           DG+ ++RL       S               + N+L F+G++ G+S L+Q+         
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
             G KE+   ++ DA   +           D    +  S            ++   F + 
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
           D L+N GP+  F++G   N    +  I+   NY+ V +    G
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSIFSNSG 482


>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
 gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
          Length = 1349

 Score =  148 bits (373), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 137/558 (24%), Positives = 234/558 (41%), Gaps = 109/558 (19%)

Query: 886  RLRNLRFSRTPLDAYTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRER 942
            ++ N   +++P++A   E         R+   +   NI G+   F+ G+ P + +   + 
Sbjct: 846  KVSNSHLAKSPVEAAEDEAVQE----NRVIPLRACDNIGGYSTVFVPGASPSFILKSSKS 901

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST---------YDNYW 993
                  L    +   +  H   C  GFIY  S+G  ++ Q P  +          Y   W
Sbjct: 902  TPKVIGLQGLGVNGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVDDDYHKEW 961

Query: 994  PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
              ++ P+   P    + + K   P+  +V             ID+               
Sbjct: 962  AKEECPM---PPMKEHGSIKLYSPITWNV-------------IDE--------------- 990

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1113
                + +E+YEV +              T+ ++ SE              TKE   L A+
Sbjct: 991  ----FELEQYEVAM-----------CMKTLLLEVSEE-------------TKERRMLFAV 1022

Query: 1114 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKEL-KGAISALASLQGHL 1167
            GTA ++GED+  RGR+L+F        P    T+     +  +E+ +GA+++L       
Sbjct: 1023 GTAILRGEDLPVRGRILVFDVVHVIPQPDRPETDRKLKLIAKEEIPRGAVTSLC----EK 1078

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
             +  G    L +W       +A  D    YVV+++ ++N  + L+ D +  ++F+ + E+
Sbjct: 1079 CMVRG----LRRWHAA---AVALPDLS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEE 1130

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              ++ L  K    L C   +FL+ G+ LS+V SDE   + I  + P+   S +G  LL+R
Sbjct: 1131 PYRMTLFGKSGTQLKCLTADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNR 1190

Query: 1286 AEFHVGA-HVTKFLRLQMLAT------SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            A F V   H    L L    T      S   T AA   ++T    LL  +  G+I  + P
Sbjct: 1191 ASFSVAPNHAWVTLALPRTTTRPYLPQSEPATNAAGSQNRTQ--TLLLASASGAIASLNP 1248

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLS 1393
            + E  +RRL SL   L +++PH AG+NP++ R    +G A  P  D     +IVD  LL+
Sbjct: 1249 ITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLA 1308

Query: 1394 HYEMLPLEEQLEIAHQTG 1411
             +  L   ++ E A + G
Sbjct: 1309 RWNELGARQRAEAAGKGG 1326



 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S V+ L  LD  + H   F F+H Y EP + I+              H T  +   ++  
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
             K    I +  NLP    K++A+  P+GG L++G N  IH      +  +A+N YA  +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
                  +S   + L+      +  D    LL T+ G++ ++T   DGR V  + +    
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374

Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           ++    +L    + +T +  +  F G+  GDS ++ +   S   + ++  K    D   +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +           D   D + GE       ++    +      F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488

Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
           G                +R   D   A G  +  +  L+            E P  +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548

Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
           T        KS +G     + +A  +D     +Y  ++I++ +      ET+D+      
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605

Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
             +           G TI AG +    R+IQV     R  DG   ++Q L     + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
           +      V+S SI DPY+LL   D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687



 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)

Query: 57  NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
           NL+V+  ++++I+ V+         +  +K S  +GET  R + D  G+ +A        
Sbjct: 28  NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                     L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88  MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150 SIHGLRITSMHCFESPEW 167
             HG+   S+H +E  EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162


>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
            98AG31]
          Length = 1427

 Score =  147 bits (371), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 173/384 (45%), Gaps = 45/384 (11%)

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFS 1133
            G W T      Q +E   ++++V+L + + +      +  GT   + ED+AARG V +F 
Sbjct: 1051 GSWDTIDGHEFQQNEWVTSMKLVSLDSKSKRSGRRDFIGAGTTCNRAEDLAARGGVYVFE 1110

Query: 1134 TGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASG-------PKIILHKWT 1181
                  +P++      +   Y +  K  ++A+  L G+ +   G       P+    K++
Sbjct: 1111 VIEIVPDPKHPERNRGLRLRYHETTKACVTAVDGLNGYFIHTMGQKVDPGYPRSPTRKYS 1170

Query: 1182 GT---------------------ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
                                    L  + F D  P Y  +L ++KNFI+LGD  K I  +
Sbjct: 1171 DILADQIIAFYSKLYAKCFEQDERLLAVGFLDIRP-YTTTLKVLKNFIVLGDAVKGITLV 1229

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            +++E+  +L  L   F  L C   +FL+  + LS+V SD    I+IF Y P   ES  G 
Sbjct: 1230 AFQEEPYKLIELGHTFVDLRCSTIDFLVLENKLSIVTSDLGGTIRIFEYNPTNIESQGGL 1289

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            KLL R EF     +   L      +S +   +            LF  LDGSI  + P+ 
Sbjct: 1290 KLLCRTEFGTAGEMGSSLGFGKRLSSKEEAKS---------IGTLFAGLDGSISSLVPVK 1340

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E  F+RLQ +Q +L+  + H AGLNPR FR    N    R     I+D E++  +  L L
Sbjct: 1341 EAVFKRLQIVQTRLIRHLDHFAGLNPRGFRTVR-NDLVSRAMNRGIIDGEIIERFGALKL 1399

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
            +EQ  I    G+ R+ IL NLN+L
Sbjct: 1400 DEQDSIGKLAGSDRNTILINLNNL 1423



 Score =  104 bits (260), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  ++LHG V  L  ++     +    D ++++F+DAKI++LE+      L   S+H FE
Sbjct: 73  VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131

Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               +        F +   ++ DPQ RC  + +    + +L   Q  +    D +T  S 
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185

Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
              SA   RI+S     S +I+L                      +K +  F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL+  + TWAGR+     +C +  +++  +     +I+   NLP+ A+ ++A P  
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305

Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
           +GGVLV+ A+ I +  QS+       N    L +  ++P                    V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365

Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN-PSVLTSDITTIGNS 421
            L+ +   ++  D AL+    G +  L +  DGR + +L L K    SV+ S    I + 
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425

Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
             F+GS LGDS+++                            G+   +  +   E +G  
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
           E D   TK       D +  ++           + N +  ++  S  + DS+   GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530

Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
           F+             +SK   +E  +E+ GC G       T+++K     +    DS+  
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577

Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
           +         A++D   +       IS+  RT +        E   + D      +   T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           I     F +   +Q+     ++L      +     P  +E    ++ + ++   +   Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693

Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
           LL  SD  + L  G   + T+  +Q P       KPV+  +L+    P  +     T+  
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746

Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
              G+G   D     P      +  V    G L I  +P    VFTV        +    
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797

Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
              +   + A++    + +   EE      KEN     +         A  +RP L+  L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854

Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
            +G +  YQ  +        K  DP ++       ++  +++   +F S  P+    R  
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908

Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
                   ++ + K  +   G  LSG  P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933


>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
          Length = 1344

 Score =  145 bits (365), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 133/607 (21%), Positives = 264/607 (43%), Gaps = 69/607 (11%)

Query: 824  VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            +  +        HS+     ILT  G ++ Y+ + F+G                    N 
Sbjct: 774  IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
               + ++L+ +  P +AY     P G   +R  ++  N++G+   F++G  P +      
Sbjct: 814  KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
             +    +      V+F+   + N  +GFIY+ +    ++C++P    Y+N WP++KI + 
Sbjct: 869  SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKKIQMP 928

Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
             T   I Y    N +       V+    ++    +D+E G  I       +D  +    E
Sbjct: 929  ETVKAIAYHELSNTF-------VVSSYEEIPYDCLDEE-GKPI-----VGIDKSKP-PAE 974

Query: 1062 EYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL----FNTTTKENETLLAIGT 1115
             Y+  +R++ P      W    TI +  +E  + V  + L         K  + L+ +G+
Sbjct: 975  SYKGYLRLISPYN----WSVIDTIVLADNEIGMNVLSMVLDVGSSTKKFKSKKELIVLGS 1030

Query: 1116 AYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
               + ED+++ G   +F         G+   N +    EV+ ++ +GA++++  + G LL
Sbjct: 1031 GKYRIEDLSSNGSFKIFEIIDIIPEPGKPETNHK--FKEVHIEDTRGAVTSICEVSGRLL 1088

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
            +  G KII+       +  +AF D   +YV       N ILLGD  KS++   +  +  +
Sbjct: 1089 VTQGQKIIIRDLQDDGVVPVAFLDTA-VYVSEAKSFGNLILLGDSLKSVWLAGFDAEPFR 1147

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            + LL+KD  +LD    +F++    + ++ +D    + +  + P+   S  GQ+L+ +  F
Sbjct: 1148 MILLSKDIQTLDVSCADFIVKDEEIFILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSF 1207

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
            ++ +  T F   + +  + +   +      T  F  +  T+DGS   + P++E T+RR+ 
Sbjct: 1208 NINSAATCF---RTIPKNEENYPSL-----TTSFQSIGSTIDGSFFTVFPINESTYRRMY 1259

Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
             LQ++L D   H+ GLNPR  R    N          +++ +++  +  L  + +   A 
Sbjct: 1260 ILQQQLTDKEFHICGLNPRLNRFGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFAS 1319

Query: 1409 QTGTTRS 1415
            + G+  S
Sbjct: 1320 KIGSKNS 1326



 Score = 93.2 bits (230), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
           LVV  + +++++ +    + SKE K                  L+LV  ++LHG +  L 
Sbjct: 29  LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A+ +    D     D ++++ + AK+S++++D   + +   S+H +E+          E 
Sbjct: 71  AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
                L+ ++P   C  +    L   +              +   G SG  G  D   + 
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182

Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
              + A   SS VI+ ++LD +  +V D  F++ Y EP + I+  +  TW G +      
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242

Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
                LS+    K    +    NLP D   ++ +P P+ G L++G N  IH      +  
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302

Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
           LA+N +  S+ SS +  R  S  +++L+      + ND    L  K GD   +    DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362

Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            ++   L K N             D+  + N+L F+ ++ G+S L++             
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
                    +++   +   +     +QD  NG          ++                
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463

Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
              F   D L+N GP+  F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485


>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
          Length = 1375

 Score =  145 bits (365), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 133/521 (25%), Positives = 226/521 (43%), Gaps = 44/521 (8%)

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
            P  +S  L   N+   +L      R   D    E          +    NI+G+      
Sbjct: 836  PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895

Query: 930  GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            G  P + +           L    + + T  H  +C  GF Y+ S   L+I QLP  + Y
Sbjct: 896  GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955

Query: 990  DNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1045
             +  W  +++P+ A  H + Y     LY +    P    L P +     L  +E   +  
Sbjct: 956  GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014

Query: 1046 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1103
              H +  V   +T+TV   +  +L+P                  E  L ++ + L  + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055

Query: 1104 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAIS 1158
            T + + ++A+GTA V GED+A +G + +F        P +  T      +   E+KG +S
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRLKLIVKDEVKGTVS 1115

Query: 1159 ALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGD 1212
            A++ L  QG L++A G K ++   K  GT L  +AF D    YV +L  + N   +L+GD
Sbjct: 1116 AISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGD 1173

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             +K  +F  + E+  ++ L  +    L+C   +FL     L ++V+D   N+Q+  + P 
Sbjct: 1174 AYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVADADMNLQVLQFDPD 1233

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGSDKTNRFALLFGTL 1329
              +S  G +LL ++ FH G   +    LQ    + T+S+ T +   S   ++  +L  + 
Sbjct: 1234 HPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQ--ILCTSQ 1291

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             G++  I PL E ++RRL  L   L   +    GLN ++FR
Sbjct: 1292 SGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1332



 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
           ++    G + +LT   DGR V  + +    +    ++L +     T +G    F+GS  G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +S+L+ +T                        ++ +LRR  S+  Q   + E++S     
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472

Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
                     N+T +  K             T++F V D L +I P++D           
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532

Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
            + G + ++    A G     N EL        ELP   G+W V+ K           G 
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592

Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
           +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+  
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706

Query: 676 SDGSIRL 682
            D S+++
Sbjct: 707 EDSSVKI 713


>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
 gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
          Length = 1338

 Score =  144 bits (362), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 125/474 (26%), Positives = 214/474 (45%), Gaps = 44/474 (9%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              NI+G+      G  P + +           L    + + T  H  +C  GF Y+ S  
Sbjct: 846  LDNINGYSTVIQRGRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTD 905

Query: 977  ILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVL 1032
             L+I QLP  + Y +  W  +++P+ A  H + Y     LY +    P    L P +   
Sbjct: 906  TLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFH 964

Query: 1033 SLLIDQEVGHQID-NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
              L  +E   +    H +  V   +T+TV   +  +L+P                  E  
Sbjct: 965  YELPKEETSFKPKVEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVI 1005

Query: 1092 LTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1147
            L ++ + L  + TT + + ++A+GTA V GED+A +G + +F        P +  T    
Sbjct: 1006 LCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRL 1065

Query: 1148 --VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSL 1201
              +   E+KG +SA++ L  QG L++A G K ++   K  GT L  +AF D    YV +L
Sbjct: 1066 KLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTL 1123

Query: 1202 NIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
              + N   +L+GD +K  +F  + E+  ++ L  +    L+C   +FL     L ++V+D
Sbjct: 1124 KTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVAD 1183

Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGS 1316
               N+Q+  + P   +S  G +LL ++ FH G   +    LQ    + T+S+ T +   S
Sbjct: 1184 ADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSS 1243

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
               ++  +L  +  G++  I PL E ++RRL  L   L   +    GLN ++FR
Sbjct: 1244 LPLHQ--ILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1295



 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
           ++    G + +LT   DGR V  + +    +    ++L +     T +G    F+GS  G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +S+L+ +T                        ++ +LRR  S+  Q   + E++S     
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472

Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
                     N+T +  K             T++F V D L +I P++D           
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532

Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
            + G + ++    A G     N EL        ELP   G+W V+ K           G 
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592

Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
           +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+  
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706

Query: 676 SDGSIRL 682
            D S+++
Sbjct: 707 EDSSVKI 713


>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
          Length = 1620

 Score =  142 bits (359), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 171/343 (49%), Gaps = 29/343 (8%)

Query: 1094 VRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1149
            + VV   N  T  +E+     +A+GT+  +GED+A RG   +F        P + +    
Sbjct: 1291 INVVKSVNLETLSSESGFKDYIAVGTSTFRGEDLAVRGATYIFEVIEVVSYPDDPLPPYR 1350

Query: 1150 SK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNI 1203
             K     E K  ++A+  L G+L+ + G K+ +  +   E L G+AF DA  + V SL  
Sbjct: 1351 LKLLCRDEAKAPVNAICGLNGYLVSSQGFKVFVRAFEQDERLVGVAFMDAG-VCVTSLTR 1409

Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE--FLIDGSTLSLVVSDEQ 1261
            +KN +L+GD  +S+ F++++E   +L            + T+  FL D    S++ +D++
Sbjct: 1410 LKNLLLIGDAKRSVSFVAFQEDPFKLR---------PTYVTDAAFLFDEGDFSILAADDE 1460

Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
              +++F + P ++ +  G  L+   EF+  +  T  L +      + R    P   +   
Sbjct: 1461 GTLRLFEFDPNLTGATHGNPLICETEFNGQSEHTHILAI------AGRGREDPEEMQIPE 1514

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
              L+FGT+DG++G I+P+ +  F+RLQ L  +L+ SV H AGLNPR+FR    N    RP
Sbjct: 1515 AQLIFGTIDGTLGTISPVPDECFKRLQLLSGQLMRSVQHFAGLNPRAFRTVR-NDLLSRP 1573

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                ++D +LL  +  L +  Q  I  Q GT    IL ++  L
Sbjct: 1574 LNKGMLDYDLLHAFRELDIRRQATITKQIGTDTITILRDIRSL 1616



 Score =  122 bits (306), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 162/667 (24%), Positives = 267/667 (40%), Gaps = 112/667 (16%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L LV  +R+HG V  L  +     G D     D ++++F+DAKI++LE+ D+I+ L   S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
           +H +E    +      E     PL++ DP+ RC  +L+    + IL   Q     + D  
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            D       ++     S+V  L D D  ++HV DF F+  +  P + IL++    W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305

Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
           S       ++ +++    K             +I     LP+DA+ LL     S  GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365

Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
           V+  N++ H         LA + +     S+   P  +F+ E        L+ +   W  
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
               +L    G    L V  +GR V  L     L +T+ P+VL          +   G  
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484

Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
           L +L S  G S+L+Q   G      +  +G ++E    D+E DA S K      +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
           +     L    ++      A    +  V D+L   G + D S+ L        +G  + +
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFR-----SGPDRPT 593

Query: 538 NYELVELP-GCKGIWTVYHK--------------SSRG---------------HNADSSR 567
              +   P G +G WTVY                 +RG                  +   
Sbjct: 594 PKLVCSTPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGE 653

Query: 568 MAAYDDEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
               D E    LI+S +A       RT+ +++   L  ++      +   T+AAG  F  
Sbjct: 654 REWADGEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSH 708

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
             V+QV      +LDG    ++L     N       E S ++   + DP+V++   +GS+
Sbjct: 709 TCVMQVTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSV 763

Query: 681 RLLVGDP 687
            L +GDP
Sbjct: 764 ALYLGDP 770


>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
 gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
          Length = 1363

 Score =  139 bits (349), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 135/555 (24%), Positives = 248/555 (44%), Gaps = 70/555 (12%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
            +L  +  P +AY+     HG   +R  I F ++SG     ++G  P   M+ R R   H 
Sbjct: 837  DLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MITRSR---HS 886

Query: 948  QL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
            Q+         IV+F         +G IY+ ++   +I +LPS  +YD  WP++K+ +  
Sbjct: 887  QVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWPIRKVSIGE 946

Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTY 1058
            T   + +    N   L+VS     P N      ID+E    VG  I  +  S++    + 
Sbjct: 947  TVKSVAFHEGSN--TLVVSTLKEIPYN-----CIDEEGNPIVG--IKPNKPSAISYKGS- 996

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
                  ++++ P      W     I +  +E  L V+ + L   + TK     +  + +G
Sbjct: 997  ------IKLISPVN----WSVIDNIELADNEVGLHVKSMPLDVGSETKRFKSKKEFVLVG 1046

Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   + ED+A  G   L          G+   N +    E   ++ +GA++++  + G  
Sbjct: 1047 TGKYRLEDLACNGSYKLLEIIDIIPEPGKPETNHK--FKEFTQEDTRGAVTSICEVSGRF 1104

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L+A G KII+          +AF D   ++V       N ++LGD  KS++   +  +  
Sbjct: 1105 LVAQGQKIIVRDIKDNSAVSVAFLDTS-VFVSESKSFGNLVVLGDTLKSVWLAGFDAEPF 1163

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            ++ +L KD   LD  + +FL+    + ++V+D  +++ +  Y P+   S  GQ+LL ++ 
Sbjct: 1164 RMIMLGKDLQGLDVSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSS 1223

Query: 1288 FHVGAHVT---KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            F      T      + + L+T  D   A P       F  +  T++G++  + P+ E T+
Sbjct: 1224 FTTNYLTTCTKSVPKHEQLSTWFD-PQAIP-------FQTVGSTVEGAMYVVFPISEPTY 1275

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
            RR+  +Q++L+D   H  GLNPR  R  +  S   A+     +++DCEL+  +  L  + 
Sbjct: 1276 RRMYIMQQQLIDKEYHHCGLNPRLNRIGRIESVNYANL---RAMLDCELIRRFSKLNEDR 1332

Query: 1403 QLEIAHQTGTTRSQI 1417
            +  ++ +  T   Q+
Sbjct: 1333 KRTLSSKISTKNVQV 1347



 Score = 86.7 bits (213), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
           SS ++    LD K   + D  F+H Y +P + +L +++ TWAG +       + S LS+ 
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
              +    +    NLP+D  K++A+PSP+ G L++G N  IH  +   +  +A+N Y   
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343

Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
           + +S +    ++S  ++L+      + ND  LL   +TG++  +    DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403

Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
             PS   S I         ++ N+L FL  R  +S LV+ 
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442


>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 180

 Score =  137 bits (346), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 105/184 (57%), Gaps = 12/184 (6%)

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL RA+FHVGAHV
Sbjct: 1    DAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHV 60

Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
              F R       +   GAA G  K      N+    F TLDG IG + P+ E T+RRL  
Sbjct: 61   NTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLM 113

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+ E+A +
Sbjct: 114  LQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 173

Query: 1410 TGTT 1413
             GTT
Sbjct: 174  IGTT 177


>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Ovis aries]
          Length = 819

 Score =  137 bits (345), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 117/398 (29%), Positives = 179/398 (44%), Gaps = 82/398 (20%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
           A +   GA    +RD+++L+F+DAK              +    FD +   +  TSM   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140

Query: 163 ESPEWLHL-----------------------------------KRGRESFARGPLVKVD- 186
           E P +L L                                   K+ R     G   +V  
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199

Query: 187 ---PQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSH---VINLRD 240
              PQ     + VYG +      +Q G+ L     T+     F  R  S     +I++R 
Sbjct: 200 GELPQDEVDEIEVYGSE------AQSGTQLA----TYS----FEVRWGSEWLPGIIDVRA 245

Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+I
Sbjct: 246 LDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVI 305

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR 357
           WS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +     +
Sbjct: 306 WSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQ 365

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
               + LD A A ++  D  ++S K G++ +LT++ DG
Sbjct: 366 EGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDG 403



 Score =  112 bits (280), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/339 (27%), Positives = 148/339 (43%), Gaps = 31/339 (9%)

Query: 907  HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
            +G    R+ + K    H   F+ G  P W +V  R  LR+HP   DG I +F   HN+NC
Sbjct: 486  YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
              GF+Y   QG L+I  LP+  +YD  WPV+KIPL+ T H + Y  E  +Y +  S    
Sbjct: 544  PRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTST- 602

Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1085
             P  +V  +      G + +   +   + +     E + ++++ P      W+      +
Sbjct: 603  -PCTRVPRM-----TGEEKEFETIERDERYVHPQQEAFCIQLISPVS----WEAIPNARI 652

Query: 1086 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1145
            +  E              ++ +   +  G+   +GE+V  RGR+L+         P   +
Sbjct: 653  ELEEXXXXX------XXGSRGHVYSVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQPL 706

Query: 1146 TE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYVV 1199
            T+     +Y KE KG ++AL    GHL+ A G K           LN  AF        V
Sbjct: 707  TKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKXXXXXLPPHAGLNPRAFRMLHVDRRV 766

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
              N V+N +L G++     +LS  E+G     LAK  G+
Sbjct: 767  LQNAVRN-VLDGELLNRYLYLSTMERGE----LAKKIGT 800



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
           S  SV+L  A     + D  LLS K      +LT++ DG R V+     K   SVLT+ +
Sbjct: 83  SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
            T+     FLGSRLG+SLL+++T         +    E  D E      KR+        
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195

Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
           R    +  QD V+  E+ +YGS A + T+ A  T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231



 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 40/71 (56%)

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            QK     +P  AGLNPR+FR  H + +  +    +++D ELL+ Y  L   E+ E+A + 
Sbjct: 739  QKXXXXXLPPHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKI 798

Query: 1411 GTTRSQILSNL 1421
            GTT   IL +L
Sbjct: 799  GTTPDIILDDL 809


>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            fragment [Candida albicans SC5314]
 gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            fragment [Candida albicans SC5314]
          Length = 758

 Score =  136 bits (343), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 125/562 (22%), Positives = 243/562 (43%), Gaps = 53/562 (9%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +A+     P+G   +R +  F N++G    F++G  P   +  
Sbjct: 193  NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 247

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
               +    Q    + ++ +   +    +G I++ +Q   +IC+LP    Y+   P++ + 
Sbjct: 248  VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 307

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS- 1050
            +  +   I Y    +         VL    Q+    +D+E G  I        D   +S 
Sbjct: 308  IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMSF 359

Query: 1051 --SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
              S+ L   Y     E   LE +  G   ++        S + L     +L     K+  
Sbjct: 360  KGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKKR 419

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALA 1161
              + IG    + ED+AA G   ++         G+   N +    E++ +E +GAI+++ 
Sbjct: 420  EYIPIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSIC 477

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
             L G  L++ G K+I+          +AF D P +YV       N ++LGD+ K  + + 
Sbjct: 478  ELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDLLKGCWLVG 536

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            +  +  ++ +L KD   +     +F+I+   + ++V+D    + +  Y P   +S  G K
Sbjct: 537  FDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTK 596

Query: 1282 LLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLDG 1331
            LL++A F + + ++    L ++    S +T A          P +  +N F ++  T DG
Sbjct: 597  LLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDG 656

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIV 1387
            S   + P++E  +RR+  LQ++L+D   H  GLNPR  R    +  +N    +P    I+
Sbjct: 657  SFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----IL 712

Query: 1388 DCELLSHYEMLPLEEQLEIAHQ 1409
            D +L+  +  L  + +  +A++
Sbjct: 713  DYDLIRSFTKLSDDRKRNLANK 734


>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 1349

 Score =  135 bits (341), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 122/486 (25%), Positives = 226/486 (46%), Gaps = 46/486 (9%)

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI 1019
            +H   C HGF+ + +   ++  QLP  + Y   W +Q++ +      I Y AE+ +Y   
Sbjct: 875  VHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVRHIAYHAEREVY--- 931

Query: 1020 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
                V+     +      ++  H   +      D+     V +Y + ++    +    Q 
Sbjct: 932  ----VVATCRDIDFYFAGEDGRHPEQD------DIELRPQVPQYTIHLV----SAKSHQR 977

Query: 1080 RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TG 1135
              ++ +   E    ++V++L  +  T E + L+ + TA  +GED+ A+G V+L+      
Sbjct: 978  LQSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILYDIIDVV 1037

Query: 1136 RNADNPQN--LVTEVYSKELKGAISALAS-LQGHLL-IASGPKIILH--KWTGTELNGIA 1189
             + D P++   + ++  ++ +GAI+++A  L G  L  A G K+++   K  GT L  +A
Sbjct: 1038 PDPDVPESGFQLHQLAREQARGAITSIAGPLPGGFLGTAQGLKLMVRGLKEDGTCLP-VA 1096

Query: 1190 FYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS-LDCFATEF 1246
            F DA   Y  +L ++  +   L GD  K ++F  + E+  +L ++ K   S ++    EF
Sbjct: 1097 FLDAQS-YTHTLKVLPGRGMWLAGDAWKGLWFGGFTEEPYKLTVMGKSPKSKMEVMTAEF 1155

Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLA 1304
            L     L +++ D   ++ +  Y P+  +S  G +LL R+ FH+G  VT  L +   +  
Sbjct: 1156 LPFDGALYILIMDADNDLHVLQYDPENPKSVGGMRLLHRSTFHIGHLVTNMLLVPSSLKP 1215

Query: 1305 TSSDRTGAAPGSDKTNRFA---------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
              S     A G++  N  A         +L  +  GS+G I PLDE  +RRL +LQ  L 
Sbjct: 1216 FESQDRDMANGTNGNNEEATRAPPSLHHILATSRSGSVGLITPLDEAAYRRLSALQTHLT 1275

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
              + H AGLNPR++R   +       G   +VD  L++    L   ++ ++  + G+   
Sbjct: 1276 AILEHAAGLNPRAYRAVEAESFG---GARGVVDGSLVNRIGELGAAKRADVLGRAGSDGW 1332

Query: 1416 QILSNL 1421
             + S+L
Sbjct: 1333 VVRSDL 1338



 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   ++++++       G K + N G  ++ VL           V  Y L G V S+
Sbjct: 28  NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +     D     ++++L+F+DAK+S++E+D   + +   S+H +E    +    G   
Sbjct: 70  ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
                ++ VDP  RC  +     Q+ IL   Q                   L   + T  
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187

Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
               +G       ++S  + L  LD  + H     F+H Y EP   IL          + 
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
            +      + L++    +    + S   LP   ++++ +P P+GG L++G N + +  QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307

Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
               A A+N +A          +S  +++L+      L +     L+ T  G L +L+  
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
             GR +  L++ +      ++ I T       +  S  F+GS  G S L+ ++    TS 
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           L+   K     +       +       D        E      S +  T +AQ TFS  +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481

Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
            D L++ GP+ +   G        L I A A     S+ +  + ++V       +L   +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541

Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
             W +  K       +   +   +D D+   Y I S   +  +    +A       E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
                 T+A G +  + R  ++    A++ LD           P   E     E+ +++ 
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647

Query: 664 VSIADPYVLLGMSDGSIRLL 683
           ++++DPYVL+   D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667


>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
          Length = 609

 Score =  134 bits (337), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
           F V DSL+N+GPLK F+Y LRINAD  ATGI KQSN+EL                     
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489

Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                VEL GC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549

Query: 597 TE 598
            E
Sbjct: 550 GE 551


>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score =  133 bits (334), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 125/562 (22%), Positives = 242/562 (43%), Gaps = 53/562 (9%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +A+     P+G   +R +  F N++G    F++G  P   +  
Sbjct: 858  NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 912

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
               +    Q    + ++ +   +    +G I++ +Q   +IC+LP    Y+   P++ + 
Sbjct: 913  VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 972

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS- 1050
            +  +   I Y    +         VL    Q+    +D+E G  I        D   +S 
Sbjct: 973  IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMSF 1024

Query: 1051 --SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
              S+ L   Y     E   LE +  G   ++        S + L     +L     K+  
Sbjct: 1025 KGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKKR 1084

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALA 1161
              + IG    + ED+AA G   ++         G+   N +    E++ +E +GAI+++ 
Sbjct: 1085 EYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSIC 1142

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
             L G  L++ G K+I+          +AF D P +YV       N ++LGD  K  + + 
Sbjct: 1143 ELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLVG 1201

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            +  +  ++ +L KD   +     +F+I+   + ++V+D    + +  Y P   +S  G K
Sbjct: 1202 FDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTK 1261

Query: 1282 LLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLDG 1331
            LL++A F + + ++    L ++    S +T A          P +  +N F ++  T DG
Sbjct: 1262 LLTKASFELNSTISCLRSLPLIDIEESVQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDG 1321

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIV 1387
            S   + P++E  +RR+  LQ++L+D   H  GLNPR  R    +  +N    +P    I+
Sbjct: 1322 SFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----IL 1377

Query: 1388 DCELLSHYEMLPLEEQLEIAHQ 1409
            D +L+  +  L  + +  +A++
Sbjct: 1378 DYDLIRRFTKLSDDRKRNLANK 1399



 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)

Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           +ED  G+      R+  +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
           G +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322

Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
             +      +A+N +      S  S Q+  +S  +++L+      + +D   LL  +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
              +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438


>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 1380

 Score =  133 bits (334), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 120/521 (23%), Positives = 236/521 (45%), Gaps = 67/521 (12%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS---IVAFTVLHNVNCNHG 968
            +R+  F N++G+   F++G  P    +  + L   P++   S    V+ +   +    +G
Sbjct: 876  RRLVYFPNLNGYTCIFVTGVIP---FIIIKSLHSIPRIFQFSKIPAVSISAFSDSKIKNG 932

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
             I + +    +IC+L    TY+   P++++ L      + Y  + +   ++ S     P 
Sbjct: 933  LICLDNNKNARICELSLDYTYEFNLPIKRVDLGELVRSLAYHEQSD--TVVASTFKEVPY 990

Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
            N      +D+E G+ I       +    T+   +  ++++ P      W    +  ++ +
Sbjct: 991  N-----CVDEE-GNIIPGVYKEKLPHALTF---KSSIKLISPHN----WTVIDSFDLEDN 1037

Query: 1089 ENALTVRVVTLFN-------TTTKENETLLAIGTAYVQGEDVAARGRVLLFST------- 1134
            E  +TV+ + L            K     + IGT  ++ ED+AA G   ++         
Sbjct: 1038 EVGMTVKSMILDRGSGAASLKKFKSKREYIVIGTGKLRMEDLAANGSFKIYEIIDIIPEP 1097

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
            G+   N +    EV+ ++ +GA++A+  L G L++  G KII+       +  +AF D  
Sbjct: 1098 GKPETNHK--FKEVFQEDARGAVTAICDLSGRLMVGQGQKIIVRDIEDDGVVPVAFLDTS 1155

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             +Y+       N ++LGD  KS++ + ++ +  ++ +L KD   LD    +F+I    + 
Sbjct: 1156 -VYISEAKSFGNLLILGDPLKSVWLVGFEAEPYRMVMLGKDRQHLDVECADFIIKDEDIF 1214

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
            ++V+D    I +  Y P   +S  G  LL++A F + +           AT+  R  + P
Sbjct: 1215 ILVADNNNCIHLIQYDPDDPKSINGTILLNKASFELNS-----------ATTCLR--SIP 1261

Query: 1315 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR------- 1367
              +K + + ++  TLDG++  + P++E T+RR+  LQ+++ D V H  GLNPR       
Sbjct: 1262 KGEKGD-YQIIGSTLDGALYNVFPVNEFTYRRMYILQQQISDKVYHFCGLNPRLNRFGGS 1320

Query: 1368 -SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             + R   +N K        I+D  L+  +  L L+ Q ++A
Sbjct: 1321 VTLRDRETNTKP-------ILDYGLIRRFSKLNLDRQQQLA 1354



 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I   +LD  +  + D  F+H Y +P + +L  R  +WAG +        +  +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
               K    I+   NLP++  +++ + +P+ G L+VG N I H  +   +  +++N++  
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S+ S+Q+  +S+  ++L+      + +D   L+ T+ G         DG+ ++R+ 
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400

Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           +   + ++          +I  +  +L F+ +  GDS LVQ             +K    
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
            I  D   T+R+ +           G E +    +SN  +                  QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498

Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
             S     F ++D L+N  P+  F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score =  132 bits (333), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/287 (33%), Positives = 143/287 (49%), Gaps = 23/287 (8%)

Query: 1120 GEDVAARGRVLLF-------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            GED +++G + +F       S G  A +   L   + ++E++  ++AL  L G L+ A G
Sbjct: 902  GEDRSSKGHMYVFDVVECVPSEGMAASDALRL-QLLCTEEMRAPVTALHDLNGFLVAAVG 960

Query: 1173 PKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
             K+++  W   E L  +AF D   +Y  S+  VKNF+LL D ++S YF++++E  A+L L
Sbjct: 961  QKLLIRSWEYCEWLVTVAFLDMG-MYTTSIQRVKNFLLLTDYYQSAYFVAFQEDPARLVL 1019

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            L +D+         FLID + LS+V  D    +++  Y P    S  GQ+LL+R E+H  
Sbjct: 1020 LGRDYIPTSVTCGAFLIDRARLSIVTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAP 1079

Query: 1292 AHVT--KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
              V   + L    LATS        G   T+   L     +G++  + P+ E  F  LQ 
Sbjct: 1080 GEVVRARMLHGPYLATS--------GECLTSEIVL--AKRNGAVDVLVPVTEKIFPTLQL 1129

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             Q +LV  V H AGLNPR FR    N    RP    I+D  LL   E
Sbjct: 1130 FQSQLVRMVRHTAGLNPRGFRAVF-NQHISRPLAKGILDGTLLHTAE 1175



 Score =  120 bits (301), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 152/591 (25%), Positives = 264/591 (44%), Gaps = 75/591 (12%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           +RL G V  +  + Q  A     RD ++++F DAK++++E+DD    L   S+H FE   
Sbjct: 22  HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80

Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
            L +     SF   P + VDP  RC  +L+    + IL   Q  S L G +D   +    
Sbjct: 81  QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136

Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            A    S +++     D  +++V+D +F+ G+ +P++ +L+E ELTW G +S    T  +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
             +++  T+ ++P+  ++  LP+D   L+A P  +GGVLVV  + + +  Q+A    L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256

Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
           + +     S   LP ++ ++   +L ++  T+ + +  LL  + G ++      +GR V 
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            L L+     VL  +    G + F     L + L++  +    T + +  L E   +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368

Query: 460 D-APSTKRLR-RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
              P  + L   +  DA    + G+      + S   + A       V D L  +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424

Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
            +YG+  NA   A      + Q +  ++E          +     IWTV        N  
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
              + A+D+E    L+ SLE+ +            T  +   +Q RTIA G+   +  VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520

Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           +V  + A +LD  G  MT   +F   ++ +  G         SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561


>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
 gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
          Length = 1420

 Score =  132 bits (332), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 120/572 (20%), Positives = 246/572 (43%), Gaps = 73/572 (12%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +A+     P+G   +R +  F N++G    F++G  P   +  
Sbjct: 855  NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 909

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
               +    Q    + ++ +   +    +G I++ +Q   +IC+LP    Y+   P++ + 
Sbjct: 910  VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 969

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
            +  +   I Y    +         VL    Q+    +D+E     G   D  +  ++   
Sbjct: 970  IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEEGKPIAGIIKDIKDTPAMSFK 1022

Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV-----------------T 1098
             +       ++++ P      W    TI +  +E  +T++ +                 +
Sbjct: 1023 GS-------IKLVSPYN----WTVIETIELGDNEVGMTLKSMILDVGSESGSTLGSDPNS 1071

Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSK 1151
            L     K+    + IG    + ED+AA G   ++         G+   N +    E++ +
Sbjct: 1072 LIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKE 1129

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            E +GAI+++  L G  L++ G K+I+          +AF D P +YV       N ++LG
Sbjct: 1130 ETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILG 1188

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ K  + + +  +  ++ +L KD   +     +F+I+   + ++V+D    + +  Y P
Sbjct: 1189 DLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDP 1248

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNR 1321
               +S  G KLL++A F + + ++    L ++    S +T A          P +  +N 
Sbjct: 1249 DDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNY 1308

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGK 1377
            F ++  T DGS   + P++E  +RR+  LQ++L+D   H  GLNPR  R    +  +N  
Sbjct: 1309 FQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNET 1368

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
              +P    I+D +L+  +  L  + +  +A++
Sbjct: 1369 NTKP----ILDYDLIRSFTKLSDDRKRNLANK 1396



 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
              LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
           +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435


>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
          Length = 108

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 68/98 (69%), Positives = 77/98 (78%)

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F  
Sbjct: 1   MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
             SES   SE     S SIADPYVLL MSDGSIRLL+G
Sbjct: 61  HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98


>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 176/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++++LAF +AK+S++E+D   HG+   S+H +E  +        +  + G ++ V+P  R
Sbjct: 62  EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
           C  V  +G++ + I+   Q G  LV D             ++T      ++    SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L   D  + H     F+H Y EP   IL+ +  T    +  +      S  ++    + 
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
              + S   LP D + ++A+P P+GG L++G+N  +H      + A+ +N ++  + +  
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300

Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
              +S  ++ L+      L ++    +L   TG++VL+    DGR V  +          
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360

Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
            DL K+      S    +GN   FLGS   DSLL+      G S LSSG           
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399

Query: 461 APSTKRLR------RSSSDALQDMVNGEEL---SLYGSASNNTESAQKT---------FS 502
              TKRLR        S D   D ++ +++    LY ++ + T   ++          ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456

Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
           F + D L+NI PL+D + G         + N  A    ++ Q    N  L+ +       
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516

Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
                       +WT   +SS G ++            H Y+I+S +      ET     
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564

Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
              DL       V+   +  TI  G L  + RV+QV +   R  D      DL      P
Sbjct: 565 DGLDLKPIKAPEVNP-NEEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
              E    S+    +S S+ DPYV +   D ++ LL  D S     V+ P  + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           SSC LY DK                TGV  +     G    Q D++  +  +   L I+ 
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716

Query: 768 VPNFNCVFTVD 778
           +P+   V T+D
Sbjct: 717 LPDLLLVSTID 727



 Score = 46.6 bits (109), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
            R+R  P  C   + AF    N    +GFI++ S+  L++CQLP  + +D  WP+++IP+ 
Sbjct: 820  RMRGAPIQC---LDAF----NSPSGNGFIFLDSENALRMCQLPRETHFDYQWPMRRIPIG 872

Query: 1002 ATPHQITYFAEKNLYP 1017
                 + Y A +   P
Sbjct: 873  EQIDHLAYSAAEWKLP 888


>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
 gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
          Length = 1420

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 122/560 (21%), Positives = 242/560 (43%), Gaps = 50/560 (8%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +A+     P+G   +R +  F N++G    F++G  P   +  
Sbjct: 856  NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 910

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
               +    Q    ++++ +   +    +G I++ +Q   +IC+LP    Y+   P++ + 
Sbjct: 911  IHSIPRIFQFSKIAVMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 970

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE-------VGHQIDNHNLS-- 1050
            +  +   I Y    +         VL    Q+    +D+E       + +  D   +S  
Sbjct: 971  IGESIKSIAYHETSD-------TVVLSTFKQIPYECLDEEGKPIAGIIKNIKDTPAISFK 1023

Query: 1051 -SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET 1109
             SV L   Y     E   L  +  G   ++        S++ +     +L     K+   
Sbjct: 1024 GSVKLVSPYNWTVIENIELGDNEVGMTIKSMILDVGSESKSTVGTDPNSLIKKYNKKKRE 1083

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALAS 1162
             + IG    + ED+AA G   ++         G+   N +    E++ ++ +GAI+++  
Sbjct: 1084 YIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEDTRGAITSICE 1141

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            L G  L++ G K+I+          +AF D P +YV       N ++LGD  K  + + +
Sbjct: 1142 LSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLVILGDPLKGCWLVGF 1200

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
              +  ++ +L KD   +     +F+I+   + ++V+D    + +  Y P   +S  G KL
Sbjct: 1201 DAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKL 1260

Query: 1283 LSRAEFHVGAHVTKFLRL------QMLATSSDRTGAA--PGSDKT-NRFALLFGTLDGSI 1333
            L++A F + + ++    L      + +   +D   AA  P  + T N F ++  T DGS 
Sbjct: 1261 LTKASFELNSTISCLRSLPLKDIDEKVQNETDAAAAATIPLPNNTQNNFQVIGSTQDGSF 1320

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIVDC 1389
              + P++E  +RR+  LQ++L+D   H  GLNPR  R    +  +N    +P    I+D 
Sbjct: 1321 FNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----ILDY 1376

Query: 1390 ELLSHYEMLPLEEQLEIAHQ 1409
            +L+  +  L  + +   A++
Sbjct: 1377 DLIRRFTKLSDDRKRNFANK 1396



 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +EDT G+        +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG 
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH  
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316

Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
           +      +A+N +      S+ S Q+  +S  +++L+      + +D   LL  +TG+  
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
            +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430


>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 1062

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 234/1047 (22%), Positives = 402/1047 (38%), Gaps = 174/1047 (16%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
            Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66   YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125

Query: 125  DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
            +      ++++A   A++S++++D   H L   S+H +E  E         S   GPL  
Sbjct: 126  E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173

Query: 183  ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
                +  DP  RC  +      +  L   Q    +  D+      G   A+  S      
Sbjct: 174  YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233

Query: 233  ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                        S+V+ L  LD  + H     F+H Y EP   I+   +          H
Sbjct: 234  GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293

Query: 279  HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
             +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH       
Sbjct: 294  FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351

Query: 338  CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
              +A+N +     +     +S   + L+  +   +  +N   L+    G L ++T   DG
Sbjct: 352  NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411

Query: 396  RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
            R V     +L  ++    ++   ++TI     ++FF+G+   DSL++ +T     +    
Sbjct: 412  RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470

Query: 449  GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
              K    D   D+           D   D + GE  +  +   A+ N  S     +F V 
Sbjct: 471  --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525

Query: 507  DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
            DSL++I P+KD + G +                            +A A  I  Q+    
Sbjct: 526  DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585

Query: 542  V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
            V    E P  +G WT+       KS +G    ++ +A+  D   +Y  ++I+S +     
Sbjct: 586  VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644

Query: 590  LETADLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
             ET+D+        +           G T+ AG +    R+IQV +   R  DG   ++Q
Sbjct: 645  YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
             L     + E+G+      V+S SI DPY+LL   D SI +   D +     V+      
Sbjct: 705  ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759

Query: 702  SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
             S K ++ C LY D                +TG+   +    G P  Q + +  +   +G
Sbjct: 760  LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801

Query: 762  ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
            AL I+ +PN +    V    +G T+ V  ++           + +   GT Q   E +  
Sbjct: 802  ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L         + P+L     +  +  Y+    E  + T      +  +++L    
Sbjct: 846  LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895

Query: 882  VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++     N   +++P++    E  E P   P +      NI+G+   FL G+ P   +  
Sbjct: 896  IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945

Query: 940  RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PV 995
             +  +  P+   L    +   +  H   C  GFIY  S+G  ++ QLP+ S +      V
Sbjct: 946  -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSV 1004

Query: 996  QKIPLKATPHQITYFAEKNLYPLIVSV 1022
            +KIP+      I Y      Y +  S+
Sbjct: 1005 RKIPIGDAVGLIAYHPPMETYAVACSI 1031


>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1351

 Score =  129 bits (324), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/497 (22%), Positives = 216/497 (43%), Gaps = 53/497 (10%)

Query: 888  RNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            ++L  +  P +A+     P+G   +R +  F N++G    FL+G  P   +     +   
Sbjct: 831  KDLTITGAPENAF-----PYGTSIERRLVYFPNLNGFTCIFLTGVIPYLILKTIHAIPRI 885

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQ 1006
             Q      V+ +   +    +G I++ ++   +IC+LP    Y+   P++ +P+  +   
Sbjct: 886  FQFTKIPAVSISAFSDSKIKNGLIFLDNEQNARICELPLDYNYEFNLPMKHVPIGESIKA 945

Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1066
            + Y    +   ++VS     P N      +D+E G  I    +  ++     T  +  ++
Sbjct: 946  MAYHEASDC--VVVSTFKEIPYN-----CVDEE-GKLI----VGVMEDKPAATSFKGSIK 993

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL------FNTTTKENETLLAIGTAYVQG 1120
            ++ P      W    TI +  +E  ++++ + L           K     + +GT   + 
Sbjct: 994  LISPYN----WSVIDTIELDDNEVGMSLKSMVLDIGSSSLIKKFKNKREYIVVGTGKYRM 1049

Query: 1121 EDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1173
            ED+AA G   +F         G+   N +    E + + +KGA++++  L G  L++ G 
Sbjct: 1050 EDLAANGAFKIFEIIDIIPEPGKPETNHK--FKETFQENIKGAVTSVCELSGRFLVSQGQ 1107

Query: 1174 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1233
            K+I+          +AF D P +YV       N ++LGD  K  + + +  +  ++ +L 
Sbjct: 1108 KVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLIGFDAEPFRMIMLG 1166

Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
            KD   L     +F+I    + ++V+D    + +  Y P   +S  G KLL++A F + + 
Sbjct: 1167 KDTQHLSVECADFIIKDDEVYILVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELASP 1226

Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
            ++    L             P  D  N F ++    DGS   + P++E T+RR+  LQ++
Sbjct: 1227 ISCLRTL-------------PIDD--NNFQIIGSCQDGSFFNVFPINESTYRRMYILQQQ 1271

Query: 1354 LVDSVPHVAGLNPRSFR 1370
            L +   H  GLNPR  R
Sbjct: 1272 LTEKEYHYCGLNPRLNR 1288



 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)

Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           +G  F  R   +SS +I+   LD  +  V D  F+H Y EP + +L  +   WAG +   
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   L++    K    ++   NLP++  +++ +PSP+ GV++VG N  IH  +   
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
              +A+N +      S+ S Q+  +S  +++L+ +    + ND   LL  KTG+   +  
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375

Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DG+ ++R+ +   +  +          ++  +  +L F  +  G+S LVQ       S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
               G   E  D E +               ++    E+ +L          ++    F 
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485

Query: 505 VRDSLVNIGPLKDFSYGL 522
             D L+N GP   F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503


>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides brasiliensis Pb18]
          Length = 1343

 Score =  127 bits (320), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 182/745 (24%), Positives = 303/745 (40%), Gaps = 109/745 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G V  L
Sbjct: 28  NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81  GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSARIESS 233
            A  P  + VDP  RC  VL +G + + IL   Q G  LV GD   F S      +I+++
Sbjct: 138 LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQIDTN 193

Query: 234 HVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVILHERE 267
           H    RD                            M H     F++ Y EP   IL+ + 
Sbjct: 194 HTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQV 253

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
              +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N
Sbjct: 254 AASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSN 313

Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
             +H      + A+ +N +A    S     +S   + L+      L  +N   LL    G
Sbjct: 314 ELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNG 373

Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            + +++   DGR V  + L            +T PS        +G    F GS  GDS+
Sbjct: 374 VMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSM 429

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L+ ++  S  + +     E   D  A+    +       DA +D +    ++  G  S N
Sbjct: 430 LIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRN 487

Query: 494 TESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG 546
           T S   T    + F + D L N+GP++D + G      + D   +  S  +  ELV   G
Sbjct: 488 TTSVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQG 547

Query: 547 --------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                       G+ +V+ K  +      S        Y  YL+
Sbjct: 548 YGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLL 607

Query: 581 ISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARIL 634
           +S      + +++V + +    E T + ++   + RTI  G L G  RV+QV +   R  
Sbjct: 608 LSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSY 667

Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
           D + +   L      P   E    SE  +V+  S ADPYVL+   D SI LL  D S   
Sbjct: 668 DSANLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDL 725

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             ++T   IES+     S +LY DK
Sbjct: 726 DEIETDGIIESTT--WISGSLYQDK 748



 Score =  124 bits (311), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 122/523 (23%), Positives = 221/523 (42%), Gaps = 94/523 (17%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 902  DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 961

Query: 979  KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
            ++C+ P  + +D  W  +KI L      + Y +    Y L  S            L  D 
Sbjct: 962  RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTS------QKADFKLPEDD 1015

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+  +  N  +S         +++  V++L P      W    +  ++++E  + V+ + 
Sbjct: 1016 EIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTAERVMCVKCLN 1066

Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
            L  +  T E + ++A+GTA  +GED+AARG + +F   +     D P+    +  +  +E
Sbjct: 1067 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1126

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
            +KGAI                         T L+GI                + F++   
Sbjct: 1127 VKGAI-------------------------TSLSGIGG--------------QGFLIAAQ 1147

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
              K I     KE G+   LL   F  + C+ +            V  E K          
Sbjct: 1148 GQKCI-VRGLKEDGS---LLPVAFMDMQCYVS------------VLKELKGTD------- 1184

Query: 1273 MSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF-ALLFGT 1328
               S KG +LL R+ FH G   + +T   R  +L+   +    A   D +     +L  +
Sbjct: 1185 -PGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTS 1243

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
              GSI  I P+ E+ +RRL +LQ ++++++ H  GLNPR+FR   S+G   R     +VD
Sbjct: 1244 ETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR----GMVD 1299

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
             +L+  +  L  + + EIA + G    +I ++L   A+G + L
Sbjct: 1300 GDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1340


>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
          Length = 316

 Score =  124 bits (312), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 132/265 (49%), Gaps = 34/265 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSF 360
             C ++LN+    +D   + P   F
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDF 297


>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Ogataea parapolymorpha DL-1]
          Length = 1309

 Score =  124 bits (312), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 129/548 (23%), Positives = 227/548 (41%), Gaps = 61/548 (11%)

Query: 837  SRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP 896
            S+ +L A+   G +L Y+ +            DP+  +  L   N            + P
Sbjct: 750  SKDYLVALTFGGEVLIYETFF-----------DPIERTYKLMKIN---------EMCQFP 789

Query: 897  LDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV 955
            +         H    +R  I   N  G++   ++G+     +     +    Q    S +
Sbjct: 790  IVGAPDNSYAHATKIERYLISVDNFQGYKAVLVTGASAFVILKEYNSIPRMLQFTKRSSL 849

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKN 1014
             F   +   C +G I +      +ICQL S  TY N  P+ K  +   T ++I Y +  N
Sbjct: 850  YFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLSN 909

Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQI----DNHNLSSVDLHRTYTVEEYEVRILEP 1070
             Y  I+S     P N V       E G  +    D+  L S  L  T       V ++ P
Sbjct: 910  TY--IISTLEEGPYNPV------DEDGEPLPGLRDDRKLKSTSLKGT-------VHLVSP 954

Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
                  W    TI ++ +E   ++ V+ L  + T   +T++ IGTA  + ED+A  G   
Sbjct: 955  ----ANWTIIDTIELEDNEYVTSIEVIELKVSETIATKTVVLIGTARCRNEDLATHGSWK 1010

Query: 1131 LFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE- 1184
            ++        P     +N +  + S+  +G + ++ ++ G   I  G ++++      + 
Sbjct: 1011 IYEVIDIVPEPGRPEAKNRLKMITSETARGPVLSICNVSGRFAIVQGQRMLVRTLQKDDN 1070

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            +  +AF D   +Y   +   KN +L+GD  +S+    +     ++    KD  +++  A 
Sbjct: 1071 VAPVAFTDTS-IYSKEVKTFKNLVLIGDSFQSVSLYGFDAAPYRMLHFGKDEQNVELRAA 1129

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1304
            +FL+    L L+V+DE     +  Y P    S KG KLL R+     A  TK     M++
Sbjct: 1130 DFLVHDGNLHLLVADEDSVFHLLQYDPYDGNSMKGLKLLRRSLLRSNALTTK-----MIS 1184

Query: 1305 TSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
             + DR+  +  S   +     + ++   +DGS   + P++E  +RRL S+Q  L D   H
Sbjct: 1185 VARDRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVMPVNEYQYRRLYSIQNYLYDKELH 1244

Query: 1361 VAGLNPRS 1368
              GLNP+S
Sbjct: 1245 WLGLNPKS 1252



 Score =  105 bits (261), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 107/464 (23%), Positives = 212/464 (45%), Gaps = 46/464 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+L+  YRL+G + ++    +  ++ +   D +I++ + AK+SV+++D  +H +   S+H
Sbjct: 51  LQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSLH 107

Query: 161 CFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGDE 217
            +++  + L +++  ++  +    + DP   C  + +  L   +   K       L  D 
Sbjct: 108 YYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDDA 164

Query: 218 DTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGRV 274
           +              S ++N   L  D+K++ D+ F+H Y +P M IL+  E +TWAG +
Sbjct: 165 EEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGHL 224

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
                T  +  LS+    K+   I    NLP+D   +  + SP  G L+VG+N  IH +S
Sbjct: 225 PKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVNS 284

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
             +   +  N Y   + + +   +SS  + L+ +    ++ D  L+ T++G+   L    
Sbjct: 285 LGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFEK 344

Query: 394 DG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            G       +Q+++ S       N  ++ + + ++   LFF+  + GDS L++ +  SG 
Sbjct: 345 IGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG- 401

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
            +L    KE+ GD +        L            + E+   + S+  N++       F
Sbjct: 402 -VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ-------F 442

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
              D+++N GPL DF+ G R++ +    G+   +  E V +  C
Sbjct: 443 KKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485


>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit [Pan
            troglodytes]
          Length = 269

 Score =  123 bits (308), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 111/198 (56%), Gaps = 7/198 (3%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
            A I +Q  E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+        
Sbjct: 41   ARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVP 100

Query: 1140 NPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             P   +T+     +Y KE KG ++AL    GHL+ A G KI L     +EL G+AF D  
Sbjct: 101  EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ 160

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             LY+  +  VKNFIL  D+ KSI  L ++E+   L+L+++D   L+ ++ +F++D + L 
Sbjct: 161  -LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLG 219

Query: 1255 LVVSDEQKNIQIFYYAPK 1272
             +VSD  +N+ ++ Y P+
Sbjct: 220  FLVSDRDRNLMVYMYLPE 237


>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
            90-125]
 gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
          Length = 1327

 Score =  122 bits (307), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/512 (21%), Positives = 228/512 (44%), Gaps = 53/512 (10%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            +R+  F N++G+   F++G  P   +     +    Q      V+ +   +    +G I+
Sbjct: 827  RRLVYFPNLNGYTSIFVAGVIPFLIIKSCHSIPRIFQFSKIPAVSISAFSDSKIKNGLIF 886

Query: 972  VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
            + +    +IC+L     Y+   P++++ +  +   + Y  + +   +++S     P N  
Sbjct: 887  LDNNQNARICELSLDYNYEFNLPIRRVHIGESIRSVAYHEQSD--TVVISTFKEIPYN-- 942

Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
                +D+E G  I       +      T  +  ++++ P      W+   TI +Q +E  
Sbjct: 943  ---CVDEE-GKPI----AGVLKDKPPATSFKGSIKLVSPFN----WKVIDTIELQDNEVG 990

Query: 1092 LTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNADN 1140
            + ++ + L   ++ K+ +T    + +GT  ++ ED+AA G   ++         G+   N
Sbjct: 991  MAIKSMVLDVGSSMKKFKTKREYIVVGTGKLRMEDLAANGSFKIYDIIDIIPEPGKPETN 1050

Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
             +    E++ ++ +GA++++  L G  L+  G K+I+       +  +AF D P +YV  
Sbjct: 1051 HK--FKEIFQEDTRGAVTSVCDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVSE 1107

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
                 N  LLGD  KSI+ + ++    ++ +L KD   L     +F++    + ++V+D 
Sbjct: 1108 AKSFGNLFLLGDPLKSIWLVGFEADPFRMVMLGKDRQHLRVECADFIVKDEEIFILVADV 1167

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              ++ +  + P   +S  G  L+++A F   +  T  LR            + P  + T 
Sbjct: 1168 NNSLHLIQFDPDDPKSINGTILINKASFETNSQTT-CLR------------SVPKGE-TG 1213

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSN 1375
             +  +  T+DG+   + P++E T+RR+  +Q+++ D   H  GLNPR  R     Q   N
Sbjct: 1214 DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEYHYCGLNPRLNRFGGAVQIRDN 1273

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
                +P    I+D  L+  +  L L+ Q  I 
Sbjct: 1274 DTNAKP----ILDYNLIKEFAKLNLDRQKNIT 1301



 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L      +    D ++++ + AK S++ ++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L  V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169

Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
                  +GGS    GL  +   F          ++S +I+   LD  +  V D  F+H 
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y EP + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
           P+P+ G L++G N  IH  +   +  +A+N +      S+ S Q+   S  +++L+    
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338

Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
             + +D   LL   TG+   L    DG+ ++++ L   +  +  S        + ++  +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398

Query: 422 LFFLGSRLGDSLLVQ 436
           L F  +  GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413


>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
          Length = 1287

 Score =  122 bits (307), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/457 (24%), Positives = 215/457 (47%), Gaps = 49/457 (10%)

Query: 969  FIYVTSQGILKICQLPSGSTYDNY---WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
            F+Y+      +IC LP G  + NY    P++ + L  TP+++TY     L+       ++
Sbjct: 844  FMYIDIDKTARICSLPIGENF-NYSQNLPIEIVSLGQTPNKVTYHETSGLF-------IV 895

Query: 1026 KPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
                ++    ID++    VG        S  +  +    + + ++++ P      W    
Sbjct: 896  STFEEISYNAIDEDGVPIVG--------SESEKPKAKNFKGF-LKLINPIN----WTIID 942

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGRN 1137
             I M+ +E    VR + L  ++ +K+ +  +  G    + ED++  G    + + S   +
Sbjct: 943  EIEMEENEIINDVRSINLTISSRSKKKKEFIIFGIGKYRLEDLSVFGEFKIIDIISIVPD 1002

Query: 1138 ADNPQNL--VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAP 1194
               P+ +    E++ + +KGA++ +  + G  L + G KII+           +AF D  
Sbjct: 1003 PTKPEAIYKFKEIFQEVVKGAVTTINEISGRFLTSQGQKIIIRDLQQDNSTVPVAFMDCA 1062

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
              Y+       N +L+ D  KSI+FL +  +  +L LL KD    +   T+F++D   + 
Sbjct: 1063 T-YLSDSKSFGNLLLISDSMKSIWFLGFDAEPYRLLLLGKDQQRFNAITTDFIVDDGEIY 1121

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
             +V+D+++++ +  Y P   +S  GQKLL ++ F   + +T  L+L       D+     
Sbjct: 1122 FLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTNS-ITTCLKLVPKFNEFDQGSIT- 1179

Query: 1315 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                   +  +   +DGSI  + P+DE+++RRL  LQ++L D + H  GLNPRS R F +
Sbjct: 1180 ------SYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIAHYVGLNPRSNR-FSA 1232

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            N +  +P    I++  LL  +  L ++++ + + + G
Sbjct: 1233 NEQGQKP----IIEFGLLKWFINLNVDKRKQFSAKVG 1265



 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           I + + +L+ ++    N + + I S    D+   + +I+ +   AK+S++ FD  ++ ++
Sbjct: 42  IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99

Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
             S+H +ES  +   + +  ES      +K+DP  +   ++VY   +  L   Q    ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153

Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
            D+    S       IE  H  +I   N  +  + ++ D  F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
           WA  +S K  T     LS+         I    NLP+D + +  +P PI G L++G N  
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           IH  +   +  + LN Y   +   +   +S  ++ L+ +    L +   L+  + G+   
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L    DG+ V+ L ++K    +       IT I     F+G +  DS+L+++        
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
               LK+E  + +   P+  +      D           +      N        F+  +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
           +D L N GPL  F+ G +IN ++   G++   Q++  +V     G +G  T++++S +  
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494

Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
              S +    +  +   + YLI + L+     +   +   +  +S D+     TI    +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554

Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
             ++R++Q+      + D ++   +  +  F               +++  I DP++++ 
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600

Query: 675 MSDGSIRLLVGDP 687
            S G +++   DP
Sbjct: 601 SSKGEVKIFEMDP 613


>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
          Length = 1334

 Score =  122 bits (307), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 110/512 (21%), Positives = 229/512 (44%), Gaps = 53/512 (10%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            +R+  F N++G+   F++G  P   +     +    Q      V+ +   +    +G I+
Sbjct: 834  RRLVYFPNLNGYTTIFVTGVIPFLIIKSCHSIPRIYQFSKIPAVSVSAFSDSKIKNGLIF 893

Query: 972  VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
            + +    +IC+L    +Y+   P++K+ +  +   + Y  + +         V+    ++
Sbjct: 894  LDNNQNARICELSWDYSYEFNLPIRKVHIGESIKSVAYHEQSD-------TVVISTFKEI 946

Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
                +D+E G  I      ++      T  +  ++++ P      W+   T+ +  +E  
Sbjct: 947  PYDCVDEE-GKPI----AGALKDKPPATSFKGSIKLVSPYN----WKVIDTVELSDNEVG 997

Query: 1092 LTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNADN 1140
            ++++ + L   ++ K+ +T    + IGT+ ++ ED+AA G   ++         G+   N
Sbjct: 998  MSIKSMVLDVGSSLKKFKTKREYIVIGTSKLRMEDLAANGSFKIYDIIDIIPEPGKPETN 1057

Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
             +    E++ ++ KGA++++  L G  L+  G K+I+       +  +AF D P +YV  
Sbjct: 1058 HK--FKEIFQEDTKGAVTSICDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVSE 1114

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
                 N  LLGD  KSI+ + ++    ++ +L KD   L     +F++    + ++V+D 
Sbjct: 1115 AKSFGNIFLLGDALKSIWLVGFEADPFRMVMLGKDRQHLHVECADFIVKDEEIFILVADI 1174

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
               + +  + P   +S  G  L+++A F   +  T  LR            + P  D+  
Sbjct: 1175 NNGLHLIQFDPDDPKSINGTILVNKASFETNSQTT-CLR------------SVP-KDEAG 1220

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSN 1375
             +  +  T+DG+   + P++E T+RR+  +Q+++ D   H  GLNPR  R     Q   +
Sbjct: 1221 DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEFHHCGLNPRLNRFGGAIQIRDS 1280

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
                +P    I+D  L+  +  L L+ Q  IA
Sbjct: 1281 DTNAKP----ILDYNLIREFAKLNLDRQRNIA 1308



 Score = 87.4 bits (215), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L       + + D II++ + AK S+++++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L+ V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169

Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                   +G    + G + + G+        +SS +I+   L+  +  V D  F+H Y 
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           E  + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +  
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289

Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
           P+ G L++G N  IH  +   +  +A+N +      S+ S Q+    +  +E D +    
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
            ++   LL   TG+   L    DG+ ++R+ L               + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
             +  GDS LV+    S   +           +E      K       D   +   GEE 
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
            +   +            F + DSL+N GP+  F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486


>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
 gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
          Length = 1542

 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 130/576 (22%), Positives = 252/576 (43%), Gaps = 59/576 (10%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 930  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVH 946
            L+  R  L + T ++  H +  +R+  F+ +   H+G ++ G  P + +      +L   
Sbjct: 990  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCT 1046

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATP 1004
                  ++  F   H+ + + GF+Y   +G +    + P G    +  W ++++ L  TP
Sbjct: 1047 RHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTP 1105

Query: 1005 HQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVD 1053
            HQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+  
Sbjct: 1106 HQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSATS 1165

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------N 1107
                 T E YEV+      +   WQ    + +  +E  L+  ++ +   TT +       
Sbjct: 1166 GSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTT 1221

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGH 1166
              + A+ TAY  GEDV  RGR+LL +T +        +  ++ + +KG ++A+  + +  
Sbjct: 1222 APVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDC 1281

Query: 1167 LLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            + +A G  + ++++    +  E   I +  A   YV  L   +N++++GD+  S+ F  +
Sbjct: 1282 VAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFARY 1338

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQK 1281
             E+   + +L +D  ++   + + L   +   L+V+D+ +N+    Y P+ + E  K  K
Sbjct: 1339 SEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPK 1398

Query: 1282 LLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
            +L        E+ +   V   L++  L  +S R          N    ++ T  G IG +
Sbjct: 1399 ILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIGYL 1446

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
             PL + T R  Q + ++L   V H  GL PR F  F
Sbjct: 1447 VPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482



 Score = 41.2 bits (95), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
 gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
          Length = 1542

 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 131/577 (22%), Positives = 254/577 (44%), Gaps = 61/577 (10%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 930  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
            L+  R  L + T ++  H +  +R+  F+ +   H+G ++ G  P + +V+     +L  
Sbjct: 990  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVF-LVYHAATNQLVC 1045

Query: 946  HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKAT 1003
                   ++  F   H+ + + GF+Y   +G +    + P G    +  W ++++ L  T
Sbjct: 1046 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCT 1104

Query: 1004 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1052
            PHQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+ 
Sbjct: 1105 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1164

Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1106
                  T E YEV+      +   WQ    + +  +E  L+  ++ +   TT +      
Sbjct: 1165 SGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRST 1220

Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QG 1165
               + A+ TAY  GEDV  RGR+LL +T +        +  ++ + +KG ++A+  + + 
Sbjct: 1221 TAPVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGED 1280

Query: 1166 HLLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
             + +A G  + ++++    +  E   I +  A   YV  L   +N++++GD+  S+ F  
Sbjct: 1281 CVAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFAR 1337

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQ 1280
            + E+   + +L +D  ++   + + L   +   L+V+D+ +N+    Y P+ + E  K  
Sbjct: 1338 YSEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPP 1397

Query: 1281 KLLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
            K+L        E+ +   V   L++  L  +S R          N    ++ T  G IG 
Sbjct: 1398 KILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIGY 1445

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
            + PL + T R  Q + ++L   V H  GL PR F  F
Sbjct: 1446 LVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482



 Score = 41.2 bits (95), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
            strain CL Brener]
 gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
            cruzi]
 gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score =  120 bits (302), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 125/543 (23%), Positives = 228/543 (41%), Gaps = 67/543 (12%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F  I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDAIGGNTGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
             C  GF+   S      C+ P+G      W  ++I L  TPH + Y        ++ S  
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021

Query: 1023 ----PVLKPLNQVLSLLIDQEVG------HQIDNHNLSSVDLH---RTYTVEEYEVRILE 1069
                P   P +  L+++ D+E G       +    N+  +  +   R    + +E+R++ 
Sbjct: 1022 EPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLM- 1080

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAAR 1126
               +   W    T+ ++ +E  L  +++ +      E      +  + TA+  GED+  R
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITCR 1137

Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGTE 1184
            GR+LL +T       +  +   +S+ L G  +A+  ++ H+ +A G  I L +  W+  +
Sbjct: 1138 GRILLLAT--ICTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWSNRK 1195

Query: 1185 LN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            L  G   Y     YV  ++  +N+++ GD+ +S     + E+   L++L KD  ++    
Sbjct: 1196 LVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVVH 1253

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHVT 1295
             + +       L+ SD+++N+ +  Y P++ E+  G   K+L        E+ + G  + 
Sbjct: 1254 CDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCLV 1313

Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
            K LR + LA +S  T              L+ T  G IG I P+ E   R    L ++L 
Sbjct: 1314 KSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRLQ 1359

Query: 1356 DSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
              +PH AGL PR F      + +      + +V   LL+ +  L +  +  IA    T  
Sbjct: 1360 IDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQL 1419

Query: 1415 SQI 1417
             ++
Sbjct: 1420 ERV 1422



 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
           NN  +                   D   E   + F V L  A+ T     + N++    +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500


>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score =  120 bits (301), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 128/545 (23%), Positives = 228/545 (41%), Gaps = 71/545 (13%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F  I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDAIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
             C  GF+   S      C+ P+G      W  ++I L  TPH + Y        ++ S  
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021

Query: 1023 ----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRILE 1069
                P   P +  L ++ D+E G    I       N+  +  +   R    + +E+R++ 
Sbjct: 1022 EPFRPQRSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLM- 1080

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL-----LAIGTAYVQGEDVA 1124
               +   W    T+ ++ +E  L  +++ +     K+ E L       + TA+  GED+ 
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEI--QCEKDAEGLHTAPVCVVSTAFPLGEDIT 1135

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTG 1182
             RGR+LL +T       +  +   +S+ L G  +A+  ++ H+ +A G  I L +  W  
Sbjct: 1136 CRGRILLLAT--MCTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWNN 1193

Query: 1183 TELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
             +L  G   Y     YV  ++  +N+++ GD+ +S     + E+   L++L KD  ++  
Sbjct: 1194 RKLVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSV 1251

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAH 1293
               + +       L+ SD+++N+ +  Y P++ E+  G   K+L        E+ + G  
Sbjct: 1252 VHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGC 1311

Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
            + K LR + LA +S  T              L+ T  G IG I P+ E   R    L ++
Sbjct: 1312 LVKSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRR 1357

Query: 1354 LVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            L   +PH AGL PR F      + +      + +V   LL+ +  L +  +  IA    T
Sbjct: 1358 LQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAYT 1417

Query: 1413 TRSQI 1417
               ++
Sbjct: 1418 QLERV 1422



 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
           NN  +                   D   E   + F V L  A+ T     + N++    +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  +G+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500


>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
          Length = 1361

 Score =  120 bits (301), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
            +T TK     + +G  Y   EDV   G   ++        P    T     E++ ++++G
Sbjct: 1026 DTRTKRKREYIIVGIGYATMEDVPPTGEFHIYDITEVVPEPGKPNTNFKLKEIFKEDIRG 1085

Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
             +S +  + G  LI+   KI++        +  +AF D P ++V SL    N I++GD  
Sbjct: 1086 IVSVVNGISGRFLISQSQKIMVRDVQQDNSVIPVAFLDVP-VFVTSLKTFGNLIVIGDAM 1144

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + I F+ +  +  ++  L       +  + EFL++   +  +V+D    + +  YAP   
Sbjct: 1145 QGIQFVGFDAEPYRMITLGSSITKFEVISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQP 1204

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR-FALLFGTLDGSI 1333
             +  GQ+L+  + F++ +         ML   +D     P   + +R F  +   +DGSI
Sbjct: 1205 NTLSGQRLVHCSSFNLHS----LNNCTMLLPKNDE---FPRDQRYSRSFQTITAQVDGSI 1257

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ---FHSNGKAHRPGPDSIVDCE 1390
              I P+ E T+RRL  +Q++++D  P +AGLNPR  RQ   ++  G + RP    ++D  
Sbjct: 1258 SKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQDNKYYHLGHSLRP----MLDFN 1313

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRS-QILSNLNDLAL 1426
            ++  ++ + +  +  I  + G   + ++  +L DL  
Sbjct: 1314 IIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLEF 1350



 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)

Query: 96  ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           I +  L L+  ++L G +  +A++   S G   N      ++L+   AK+S+L +++   
Sbjct: 43  IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102

Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
            +   S+H     FES   L L       AR   ++++P G     +++   ++ +    
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153

Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
            G     DED                 F    G +   + +H   +IN  +L   +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213

Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
           D  F+ G+ +  + +L++ +L W G    V    +  +IS     SI  T     +I   
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
             LP D +    +   + G L++G N I +   +      L LN+Y+   L   + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330

Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
           S  V  +      L    +N+ +    LL  + G +  + +  +GR++ + +++      
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390

Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
                K+NP   S++ +D   +     F+G + GD+ +++     S   +      +++ 
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449

Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
           + +A A                 SD   D VN +    +G+  SN   +AQ+        
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
            L NIGP+     G   + + +  G+   +  E+        + T  +      NA  + 
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557

Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
           +    ++   ++ I+      ++ +   L T D       + E+  +     QGR     
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617

Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TI    +   +R++QV      + D ++               +   +  V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666

Query: 670 YVLLGMSDGSIRLL 683
           YVL+ +S G I++ 
Sbjct: 667 YVLITLSRGDIKVF 680


>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
            [Cyanidioschyzon merolae strain 10D]
          Length = 1880

 Score =  120 bits (301), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 88/324 (27%), Positives = 162/324 (50%), Gaps = 12/324 (3%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVYS---KELKGAISALAS 1162
            +L +GT +++GED + RGR+L+F      GR     Q  + ++ +    E+KGA+SA+A 
Sbjct: 1546 VLVVGTCFLRGEDTSIRGRLLVFEISRQEGRQHHQHQRTLYQMQTLAATEVKGAVSAVAP 1605

Query: 1163 LQGHLLIAS-GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
            ++G  +  S GP++ ++K    E++ I+FY    L+   +  +K +IL  D+   + FL 
Sbjct: 1606 VKGGFVCCSAGPRLEVYKLIEDEMSCISFYPGINLFFSHVGTLKQYILASDMRYGVSFLF 1665

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN-IQIFYYAPKMSESWKGQ 1280
            W+ +    N L +D    +  A+E+L+ G+  +++ +D   N I++   +P   ES  G 
Sbjct: 1666 WRSRNVSQNFLCRDEAQRELVASEWLMHGTKANVLSADMLGNIIELSIPSPVDPESAGGT 1725

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            ++   A FHVG+      R+++   S++       S   N   +L GT+DG I  ++PL 
Sbjct: 1726 RMTFEAGFHVGSRPNAVRRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLL 1785

Query: 1341 ELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH--RPGPDSIVDCELLSHYEM 1397
                ++L+ + Q  +++       L  RS+R   S   A   R    SI+D ++L  Y  
Sbjct: 1786 RGVAKKLELAAQDLMLEPELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGS 1845

Query: 1398 LPLEEQLEIAHQTGTTRSQILSNL 1421
            L    + EIA + G  +  +   +
Sbjct: 1846 LDTPRRKEIARRIGMPQEALFEAI 1869



 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)

Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
           + HV+  D  F+ G   P MV+L+E   TWAGRV    ++C ++A+ +    +    + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404

Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
           L+ W    LP DA K++ +PS         G+L++ AN + +   +     +L  N++  
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464

Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + LD  Q LP               R+S  +    A    ++     L    G++  L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523

Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            +  D     RL+       S+  P+ L  D   +   L F+ + LG S+L + T     
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                             P  +R R S+++              G+   N  + +  +  
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608

Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
             RD++  +GP+ D            G  ++     +ATG   QS             ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668

Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            +   GC+ +W++Y  H  +  H  +     A+     +   + L+ R    + AD  T 
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
           V       +   TIAAGNL  RR + QV   G R+LD +      +D+   +  P  + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776

Query: 653 GSGSENSTVLSVSIADPYV 671
           G+          S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785


>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
 gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
          Length = 1307

 Score =  120 bits (300), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 148/320 (46%), Gaps = 21/320 (6%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
            N+ T+     L +G  +V+ ED+   G   L+        P    T     +++ +E +G
Sbjct: 978  NSRTRRKREYLVVGNTFVRDEDIGTMGSFCLYDITEVVPEPGKPDTNYKLKQIFYEEFRG 1037

Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            A+S++  + G  LI+   K+++        +  +AF D P ++V       N +++GD  
Sbjct: 1038 AVSSVCEISGRFLISQSQKVLVRDVQEDNSVVPVAFLDVP-VFVTDSKSCGNLLIIGDAM 1096

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +   F+ +  +  ++  L K     +  + EFL++  ++  +VSD    + I  YAP   
Sbjct: 1097 QGFQFVGFDAEPYRMIPLGKSVSKFEVMSLEFLVNNGSIYFLVSDRSNILHILKYAPDEP 1156

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             S  GQKL+    F++  H T      +L      T   P +     F  +    DGS+ 
Sbjct: 1157 NSLSGQKLVHCTSFNL--HSTNTCMKLLLKNDEFPTLGEPPA-----FQAIGAQTDGSLF 1209

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCEL 1391
             + PL E ++RRL  +Q++L++   H+ GLNP+  R    F+  G   RP    ++D  +
Sbjct: 1210 NVVPLSESSYRRLYMVQQQLIEKDVHLCGLNPKMERLQNDFYQLGHLMRP----MLDFTV 1265

Query: 1392 LSHYEMLPLEEQLEIAHQTG 1411
            +  +  LPL ++ +IA + G
Sbjct: 1266 IKSFATLPLNKRKQIAAKAG 1285



 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 127/651 (19%), Positives = 262/651 (40%), Gaps = 131/651 (20%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRITSM 159
           L L+  ++LHG +  +A++ Q         D ++++   AK+S++ FD S+   L   S+
Sbjct: 47  LVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETLSL 102

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLVGD 216
           H +E+      ++     A+   +++DP+ RC  VL++    L ++ L  ++       D
Sbjct: 103 HYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE----D 153

Query: 217 EDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
           ++   +      ++E+         S V+++ DL  ++K+V D  F++ + +P + +L++
Sbjct: 154 DNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVLYQ 213

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
             L W+G         M   ++I+   K++  I+    LPHD + ++ + +     ++VG
Sbjct: 214 PRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVLVG 269

Query: 326 ANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDVAL 378
            N I    ++ +    + LN+++     S+++  SS  V        A+       ++ +
Sbjct: 270 VNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREILI 329

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------FF 424
           L     D+  +T+  +GR++    L +  P V        L + I  +   +      FF
Sbjct: 330 LMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQFF 388

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +G   GD+++V+          S+ L+  F     +A           DAL     G++ 
Sbjct: 389 IGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GDDE 434

Query: 485 SLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYEL- 541
            L    ++N  + +    F +   DSL+N+GP+     G   + +A+  G+   +  EL 
Sbjct: 435 DLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNELA 494

Query: 542 ---------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
                                      ++      IW +  +    +   +   A   D 
Sbjct: 495 IVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKSDV 554

Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           Y       + A+   ++       VT          T+    L G +R++QV  +   + 
Sbjct: 555 YE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVYLF 598

Query: 635 DGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
           +  +   MT    F               V+ VSI DP++LL  S G I++
Sbjct: 599 NLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635


>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score =  119 bits (299), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/220 (29%), Positives = 115/220 (52%), Gaps = 2/220 (0%)

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
            KN+ L+GDI + I  L  + +   ++ +++   + +  A   L+DG+ + LV +D Q+N+
Sbjct: 4    KNYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNL 63

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
            Q++ Y P   ES  G++L+ +A+ ++G  V        L   +D       ++   R   
Sbjct: 64   QVYMYKPDQKESNGGKQLVRQADINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVT 121

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
             +  LDGSIG I P+ E  FRRL+ LQ  +   +PH  GLNPR +R   +  +       
Sbjct: 122  FYAGLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAK 181

Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +I+D +LL  +  L   EQ +++ + G TR  +L ++ D+
Sbjct: 182  NIIDGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDV 221


>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 1536

 Score =  119 bits (297), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 122/528 (23%), Positives = 236/528 (44%), Gaps = 65/528 (12%)

Query: 912  QRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRVHPQLCDGSIVAFTVLHNVNCNH 967
            +R+  F  + G H+G ++ G  P + +V+     +L         ++  F   H+ + + 
Sbjct: 1003 ERLVPFCALQGRHKGIYVCGQTPVF-LVYHYATNQLVCTRHHATSAVRGFAPFHSRHVHG 1061

Query: 968  GFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVS---- 1021
            GF+Y   +G +    + P G    +  W ++++ L  TPHQ+ Y    +   ++VS    
Sbjct: 1062 GFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTPHQVIYSPAAHGCFVVVSRPQP 1120

Query: 1022 -VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAG 1074
              P   P +  L ++ D+E   V H ++  +L   S+       T   YEV++     + 
Sbjct: 1121 FSPKRAPFDVQLRMVEDEEGNRVPHVVEPVSLPPLSATSGSPVPTNGRYEVQLF----ST 1176

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGR 1128
              WQ    + +  +E  L+  ++ +   TT +         + A+ TAY  GEDV  RGR
Sbjct: 1177 LDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCALATAYPLGEDVTTRGR 1236

Query: 1129 VLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGT---- 1183
            VLL +T +        +  ++ + +KG ++A+  + +  + +A G  + ++++  +    
Sbjct: 1237 VLLLATSQQGGQGMQKLRILHEEPMKGPVTAITRIDEDCIAVAVGGTVRVYRYDASKGVM 1296

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            E   I +  A   YV  L  +++++++GD+  S+ F  + E+   + +L +D  ++   +
Sbjct: 1297 ETTAILYAGA---YVTCLQALRDYLVIGDLFHSVLFARYSEEIHTITILGRDTNAISVVS 1353

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVTK 1296
            ++ L   +   L+V+D+ +N+    Y P+ + E  K  K+L        E+ + G  + K
Sbjct: 1354 SDMLYHDTRFGLLVADDARNLMCMSYKPRLLEEPGKPPKVLESLLSVTGEYRLAGGVLLK 1413

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
             +RL+  A  S                 ++ T  G IG + PL + T R  Q + ++L  
Sbjct: 1414 MMRLRASAARSSSV-------------AIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQS 1460

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
             V H  GL PR F  F  +         S+   E + H+   PL EQL
Sbjct: 1461 EVAHAGGLPPRMFLGFPQDDPLR-----SLKGDEWMLHF---PLLEQL 1500



 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
           +++++D  FV    EP++  L E++ TWAGRV    W+         TC I    ++++ 
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357

Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
           T   H L  S ++ LP+DA  +  +P+    P   VL V  N + + S  +   + +N  
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414

Query: 346 AVSLDSSQELPRSSFSVE 363
           A+  + ++ L  S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432


>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi marinkellei]
          Length = 1436

 Score =  119 bits (297), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 127/543 (23%), Positives = 229/543 (42%), Gaps = 67/543 (12%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F +I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDSIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
             C  GF+   S      C+ P+G      W  ++I L  TPH + Y        ++ S  
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021

Query: 1023 ----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRILE 1069
                P   P +  L+++ D+E G    I       N+  +  +   R    + +E+ ++ 
Sbjct: 1022 EPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLM- 1080

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAAR 1126
               +   W    T+ ++ +E  L  +++ +      E      +  + TA+  GED+ +R
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITSR 1137

Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGTE 1184
            GR+LL ST       + L+   +S+ L G  +A+  ++ H+ +A G  I L +  W   +
Sbjct: 1138 GRILLLSTMCTKKKRKILL--FHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWENRK 1195

Query: 1185 LN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            L  G   Y     YV  ++  +N+++ GD+ +S     + E+   L++L KD  ++    
Sbjct: 1196 LVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVVH 1253

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHVT 1295
             + +       L+ SD+++N+ +  Y P++ E+  G   K+L        E+ + G  + 
Sbjct: 1254 CDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCLV 1313

Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
            K LR + LA +S  T              L+ T  G IG I P+ E   R    L ++L 
Sbjct: 1314 KSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRLQ 1359

Query: 1356 DSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
              +PH AGL PR F      + +      + +V   LL+ +  L +  +  IA    T  
Sbjct: 1360 MDLPHNAGLTPRMFLGLSQGSPRTALRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQL 1419

Query: 1415 SQI 1417
             ++
Sbjct: 1420 ERV 1422



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A  T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S IT IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499


>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1542

 Score =  117 bits (293), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/495 (22%), Positives = 220/495 (44%), Gaps = 55/495 (11%)

Query: 912  QRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
            +R+  F+ +   H+G ++ G  P + +      +L         ++  F   H+ + + G
Sbjct: 1009 ERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRGFAPFHSRHVHGG 1068

Query: 969  FIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVS----- 1021
            F+Y   +G +    + P G    +  W ++++ L  TPHQI Y    +   ++ S     
Sbjct: 1069 FVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTPHQIIYSPAAHGCFVVASRPQPF 1127

Query: 1022 VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAGG 1075
             P   P +  L ++ D+E   V H I+  +L   S+       T E YEV+      +  
Sbjct: 1128 SPKRAPFDVQLRMVEDEEGNRVPHVIEAVSLPPLSAASGSPVPTNERYEVQFF----STL 1183

Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGRV 1129
             WQ    + + ++E  L+  ++ +   TT +         + A+ TAY  GEDV  RGR+
Sbjct: 1184 DWQCMGRLVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCALATAYPLGEDVTTRGRI 1243

Query: 1130 LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKW----TGTE 1184
            LL +T +   +    +  ++ + +KG ++A+  +    + A+ G  + ++++    +  E
Sbjct: 1244 LLLTTTQQGGHGMQHLRTLHEEPMKGPVTAITRVGEDCVAAAVGGTVRVYRYDTYKSTME 1303

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
               I +  A   YV  L   ++++++GD+  S+ F  + E+   + +L +D  ++   + 
Sbjct: 1304 TMAILYAGA---YVTCLQAFRDYLVIGDLFNSVLFARYSEEIHTITILGRDTNAISVVSN 1360

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVTKF 1297
            + L   +   L+V+D+ +N+    Y P+ + E  K  K+L        E+ + G  + K 
Sbjct: 1361 DMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEPGKPPKVLESLLTVTGEYRLAGGVLLKM 1420

Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
            +RL+  +  S                 ++ T  G IG + PL + T R  Q + ++L   
Sbjct: 1421 MRLRAASAHSSS-------------VAIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQSE 1467

Query: 1358 VPHVAGLNPRSFRQF 1372
            V H  GL PR F  F
Sbjct: 1468 VAHAGGLPPRMFLGF 1482



 Score = 40.4 bits (93), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
           W+         TC I    ++++ +   H L  S ++ LP+D   +        VPS + 
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402

Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V   ++V  +T     ++ ++     A +L + AVS +      +SQ L      V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462

Query: 366 AAHAT 370
            ++AT
Sbjct: 463 FSNAT 467


>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Ustilago hordei]
          Length = 1568

 Score =  117 bits (293), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 29/312 (9%)

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGA-ISALA 1161
            +  +A+GT    GED   +G V LF      + R     ++L  ++  ++   A ++ALA
Sbjct: 1153 KQFIAVGTTTYHGEDRTCKGSVYLFEIIQVVSSRRFQVGRDLRLKLICRDGSNAPVTALA 1212

Query: 1162 SLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
             L G LL  SG K+ +      E L  +AF D P  Y+ S+ +VKNF+LL D  K ++FL
Sbjct: 1213 ELHGFLLSTSGQKLYVRALEKEEWLISVAFLDCP-FYITSIRVVKNFVLLSDAKKGLWFL 1271

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS------------DEQKNIQIFY 1268
            +++E   +   L            EFL+    LSLV +             +   I+++ 
Sbjct: 1272 AFQEDPYRFVDLGSALDGHCANLGEFLVYNDKLSLVSTSGVALGGFSGFGQDSGVIRLYE 1331

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLF 1326
            Y P    S  GQ+LL R E+   +  T  L    + L+ S  R     G ++  R  LL 
Sbjct: 1332 YNPSSPTSLGGQRLLLRTEYSTPSSTTCSLSAPGRWLSDSELR-----GREQL-RNKLLL 1385

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
               +GS+  +A ++E   +RL  LQ +LV SV H A LNPR+FRQ   N    RP    +
Sbjct: 1386 SKSNGSLDSLASVEEKVAKRLHLLQGQLVRSVLHTAALNPRAFRQVR-NDFVSRPLYKGV 1444

Query: 1387 VDCELLSHYEML 1398
            +D  LL  ++ L
Sbjct: 1445 LDARLLDAFKGL 1456



 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 164/358 (45%), Gaps = 61/358 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  + L G V  L  + Q  + +   RD ++++F DAK+++LE++ +   L   S+H
Sbjct: 84  LVLIRKHSLFGTVTGLQRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIH 142

Query: 161 CFE-SPEWL----HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS---- 211
            +E +P+ L    HL +        P + +DP  RC  +L+    + IL   +  +    
Sbjct: 143 TYERAPQLLNGIPHLFQ--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEF 194

Query: 212 --GLVGDEDTFGSGGGFSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIE 257
             GL  D +   +G   +A     +IES     S V+ +R++D  ++++KDF F+ G+ +
Sbjct: 195 DHGLHLDLNLDFAGEDKAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQK 254

Query: 258 PVMVILHERELTWAGRVSWKHHT----------------CMISALSIS----TTLKQHPL 297
           P + +L     T  G ++ +                    M+ + S S    T    HP+
Sbjct: 255 PTVALLFAHSPTCTGLLAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPV 314

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDS 351
           + ++ +LP+D   +L  P  +GGVLVV  ++I +  QS    A ALN +     A+  +S
Sbjct: 315 LTTSSSLPYDCLYMLPCPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPES 374

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +LP  +   +L  +   +      +L+   GD+   T   DGR +Q   L +   S
Sbjct: 375 LLDLPEIA---DLQGSQLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429


>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
          Length = 1388

 Score =  117 bits (292), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 121/561 (21%), Positives = 242/561 (43%), Gaps = 62/561 (11%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF---FLSGSRPCWCMV-FRERLR 944
            N+  +  P +AY     P G   +R  I  N  G       F+ G +  W        + 
Sbjct: 862  NITITGAPDNAY-----PQGTKLERRLIKLNNIGDSKLSTLFVVGVKSFWITKRHSSSIN 916

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
            +H Q    S ++    +   C +G + + +    ++ ++PS        P++++P+  T 
Sbjct: 917  IH-QFTKLSTISCARFNTSRCKNGLMIIDTNKAARMVEIPSNLELSQRLPIRRVPVGCTI 975

Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYTV 1060
              + +   K     +VS     P N      +D+E    VG  +DN       +++  + 
Sbjct: 976  KCVAF--HKASRTFVVSTVEETPYN-----CVDEEGNPIVG--VDN------TINKPASS 1020

Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTA 1116
             +  ++++ P      W    +  ++     ++++ +TL NT+     K  +  L +G +
Sbjct: 1021 FKSSIKLISP----ISWTVIDSFDLEDEHVCMSLKSMTL-NTSRIPMFKNLKEYLVLGIS 1075

Query: 1117 YVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              + ED+A+ G++ +          G+   N +    +++    KGA+++++ + G  +I
Sbjct: 1076 NYRMEDLASNGQIRIVDVVDIIPEPGKPETNHK--FKDIFQDATKGAVTSVSDISGRFVI 1133

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
              G KII+   +   T L  + F D P  YV      +N +L+GD   S+  + +  +  
Sbjct: 1134 GQGQKIIVRDLQEDNTAL-PVGFVDTP-FYVSETKSFQNLLLVGDSMHSVILVGFDAEPY 1191

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            ++  L KD   +D  A +F++    L ++++DE   + +  Y P+   S +GQ+LL R+ 
Sbjct: 1192 RMISLGKDVAHVDVCAADFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQGQRLLRRSI 1251

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN---RFALLFGTLDGSIGCIAPLDELTF 1344
            F    + T      M           P +  TN    F ++    DGS   + P+ E T+
Sbjct: 1252 FKTNQYTT-----CMKMRERKYVIKPPKNQFTNFSEAFEVVAANSDGSFYKVTPISEATY 1306

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            RRL  +Q+++ D   H  GLNPR  R  + + +   P    I+D + +  +      ++ 
Sbjct: 1307 RRLYVIQQQIFDQENHKCGLNPRENR--YLSDQYSIPNQRLILDFDNIRRFLEFDEIKKR 1364

Query: 1405 EIAHQTG-TTRSQILSNLNDL 1424
            ++ H+ G  T S+   +L +L
Sbjct: 1365 DLVHKLGRNTYSEFYRDLLNL 1385



 Score = 90.1 bits (222), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)

Query: 93  MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           +D      L LV  Y+L G V  L  +           D +++A +  K S++++D S +
Sbjct: 80  IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136

Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
            +   S+H        H K+  E+     F     +  DP   C  +L   + +  L   
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187

Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
           Q       DE+  G          ++ +    DL  ++K + D  F+HGY EP + +L+ 
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T  G +     T  +   S++   K    I    NLP+D  ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
           AN  IH +S   +  ++ N +A    + +   +S+  + L+      + ND  +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363

Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
                +    G V     +Q++   K    VL+  +  T I    FF+G +  DS+L   
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
                      G K        D     ++  +  DAL       E  LY     N    
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
                F   DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487


>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei gambiense DAL972]
          Length = 1452

 Score =  116 bits (291), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 134/573 (23%), Positives = 250/573 (43%), Gaps = 63/573 (10%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A ++R L+  RT ++  T +   H +  +RI  F  ++G  G ++ G  P + M 
Sbjct: 895  IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949

Query: 939  FRERLR---VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS-GSTYDNYWP 994
            +  R R    +     G +  F    +++   GFIY   +G +    + +  S   N W 
Sbjct: 950  WDNRTRQLVAYRHQAPGPVRGFVPFTSMS--GGFIYCC-EGFVDFAVMNTYCSPGGNGWL 1006

Query: 995  VQKIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
             ++I + ATPH I Y        ++ S  V  +P  Q  S  +  ++ +  D++ + SV 
Sbjct: 1007 RRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNTVQSVT 1064

Query: 1054 LH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
                             R    E +EVR+    + G  W     + +  +E  L  ++V 
Sbjct: 1065 TEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLGAQMVE 1122

Query: 1099 LF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            +    N        +  + TA+  GEDV  RGR++L ++ RN    +++V +++S+ L G
Sbjct: 1123 IHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLHSEPLNG 1180

Query: 1156 AISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
              +A+A +   + +A G   KI  + W   +L   AF  A  +Y   L++ +N+I+ GD+
Sbjct: 1181 PATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNYIIYGDL 1239

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +S     + E+   L +L +D  ++     + +       ++ SD+++N+ I  Y P++
Sbjct: 1240 CRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIMGYTPRV 1299

Query: 1274 SESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
             E+  G      + +LS   E+ +    + K LR +  A +S  T              L
Sbjct: 1300 QETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT--------------L 1345

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH-SNGKAHRPGPD 1384
            + +  G IG I P+ E   R    + ++L   +P  AGL PR F   + S+ +    G +
Sbjct: 1346 YVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQSSPRNSLRGKE 1405

Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
             +V   LL     L L  +  IA    T   ++
Sbjct: 1406 MLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487


>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Pseudozyma antarctica T-34]
          Length = 1543

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 148/316 (46%), Gaps = 28/316 (8%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-----ELKGAISALASLQ 1164
             +A GT+   GED  ++G V LF           L  ++  K     + +  ++A+A L 
Sbjct: 1169 FIAAGTSTFHGEDRTSKGSVYLFEVIEVVSGKYQLGRDLRLKLVCRDDARAPVTAIAELN 1228

Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            G LL   G K+ +      E L  +AF D P  Y+ SL ++KNF+L+ D  KS+  L+++
Sbjct: 1229 GFLLSTCGQKLYVRALEKEEWLISVAFLDGP-FYMTSLRVLKNFVLVSDAKKSLCLLAFQ 1287

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE------------QKNIQIFYYAP 1271
            E+  +   L ++    +    +FL+    LSLV + +               I+++ YAP
Sbjct: 1288 EEPYRFVDLGREINDHNASMAQFLVYNDRLSLVSTSDVPLGGISGFGASAGVIRLYEYAP 1347

Query: 1272 KMSESWKGQKLLSRAEFHVGAHV--TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             ++ +  G +LL R+EF   A    +   R + L+ S  R G   G  K     L+    
Sbjct: 1348 HVATTLGGHRLLLRSEFQTPAAAVGSTVCRGRWLSDSELR-GREEGRSK-----LVLAKA 1401

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            +G++  ++ LD+   +RL  LQ +LV SV H A LNPR+FR    N    R     I+D 
Sbjct: 1402 NGALDSLSALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVR-NDFVPRSLAKGILDA 1460

Query: 1390 ELLSHYEMLPLEEQLE 1405
             LL  +  L   + LE
Sbjct: 1461 RLLDRFVWLSRPKMLE 1476



 Score = 94.7 bits (234), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            LV    +++ IY V  +      +  S  T      D     +L +   + L G V  L
Sbjct: 46  QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    +D   R   ++++F DAK+++LE++D+   L   S+H +E      L  G   
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
             R P + VDP  RC  +L+    + IL   +  +     E  F  G GF          
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216

Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                   A +ES     S V+ +R++D  ++++KDF F+ G+ +P + +L +   TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276

Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
            ++ +  +  +    L +S +L                    HP++ ++  LP+D   +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336

Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
             P  +GGVLVV  + I +  QS    +   N      S+ E P S   V    +L A+ 
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +  +  A+LS   GDL  L    DGR V+   L + +     SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441


>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Sporisorium reilianum SRZ2]
          Length = 1567

 Score =  115 bits (288), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 138/561 (24%), Positives = 226/561 (40%), Gaps = 57/561 (10%)

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP--CQRITIFKNISGHQGFF 927
            P   +RSL    ++ +  R       P +  TR E  HGA   C  + +       Q   
Sbjct: 965  PAGGARSLDADGIATTGYRAGAVKLEPFEIGTRAEK-HGADGDCNALAVLGGGREAQASV 1023

Query: 928  LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG-FIYVTSQGILKICQLPSG 986
            L     CW      RL   P+   G +     ++    N   F Y    G L + + P G
Sbjct: 1024 L-----CWTEQGGYRLLDWPE---GDLCCIASIYTPRANDADFAYCDRAGQLWLARAPHG 1075

Query: 987  STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
               +  W    +    T  + T     +    +V+  +     Q    ++  E G  I +
Sbjct: 1076 LYAETSWMSSVV---RTGREYTRVVAHDATHTVVAASI-----QPCRFVLFDEDGEPIAD 1127

Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
                      T      E+ I E DR             +++E    + +VTL   +T  
Sbjct: 1128 PGADEALPSTTAQRGALELFISE-DRTTAA----DGYEFEANETVTALEIVTLDAPSTAS 1182

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAISA 1159
              +  +A GT    GED  A+G V LF      ++ R        +  V   + +G ++A
Sbjct: 1183 GRKQFVAAGTTTFHGEDRTAKGCVYLFEVIEVVASARYQVGRDLRLKLVCRDDSRGPVTA 1242

Query: 1160 LASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            +A L G L+   G K+ +      E L  IAF D P LYV  + +VKNF+LL D  KS++
Sbjct: 1243 IAQLNGFLVSTCGQKLYVRALEKEEWLISIAFLDCP-LYVTGIRVVKNFVLLSDARKSLW 1301

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD------------EQKNIQI 1266
             L+++E+  +   L +D         ++L+    L+LV +             +   +++
Sbjct: 1302 LLAFQEEPYRFVDLGRDIHDHHATLGQYLVYNERLALVSTSGAALGGSTAFGRDAGVVRL 1361

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFAL 1324
            + YAP ++ +    +L+ R EF   +  T  +  R + L+ S  R G   G +K     L
Sbjct: 1362 YEYAPHVASA--NTRLVLRTEFQTASPATASVACRGRWLSDSELR-GREHGRNK-----L 1413

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
            +    +G++  +A  D+   +RL  LQ +LV SV H A LNPR+FR    N    R    
Sbjct: 1414 VLAKANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVR-NDFVSRALGK 1472

Query: 1385 SIVDCELLSHYEMLPLEEQLE 1405
             ++D  LL  +  L   + LE
Sbjct: 1473 GVLDARLLDSFVYLSRPKMLE 1493



 Score = 89.0 bits (219), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  + L G V  L  + Q  A +   RD ++++F+DAK+++LE++D    L   S+H
Sbjct: 92  LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150

Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
            +E +P+ L+   G  +    P++ VDP  RC  +L+    + +L               
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206

Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
            +       +    +A +E+     S V+ +R++D  ++++KDF F+ G+ +P + +L  
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266

Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
              TW G ++ +  T  +                    AL   T    HP++ ++  LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
           D   +++ P  +GGVLVV  +++ +  QS    + ALN +     A+  +S  +LP    
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +L  +   +      +L+   GDL       DGR V+   L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432


>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
            [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei brucei strain 927/4
            GUTat10.1]
          Length = 1452

 Score =  115 bits (288), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 136/578 (23%), Positives = 247/578 (42%), Gaps = 73/578 (12%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A ++R L+  RT ++  T +   H +  +RI  F  ++G  G ++ G  P + M 
Sbjct: 895  IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949

Query: 939  FRERLRV-------HPQLCDGSIVAFTVLHN--VNCNHGFIYVTSQGILKICQLPSGSTY 989
            +  R R         P L  G  V FT +    + C  GF+      ++     P G   
Sbjct: 950  WDNRTRQLVAYRHQAPGLVRG-FVPFTSMPGGFIYCCEGFV---DFAVMNTYCSPGG--- 1002

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHN 1048
             N W  ++I + ATPH I Y        ++ S  V  +P  Q  S  +  ++ +  D++ 
Sbjct: 1003 -NGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNT 1059

Query: 1049 LSSVDLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
            + SV                  R    E +EVR+    + G  W     + +  +E  L 
Sbjct: 1060 VQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLG 1117

Query: 1094 VRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYS 1150
             ++V +    N        +  + TA+  GEDV  RGR++L ++ RN    +++V +++S
Sbjct: 1118 AQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLHS 1175

Query: 1151 KELKGAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            + L G  +A+A +   + +A G   KI  + W   +L   AF  A  +Y   L++ +N+I
Sbjct: 1176 EPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNYI 1234

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            + GD+ +S     + E+   L +L +D  ++     + +       ++ SD+++N+ I  
Sbjct: 1235 IYGDLCRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIMG 1294

Query: 1269 YAPKMSESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
            Y P++ E+  G      + +LS   E+ +    + K LR +  A +S  T          
Sbjct: 1295 YTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT---------- 1344

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG-KAH 1379
                L+ +  G IG I P+ E   R    + ++L   +P  AGL PR F   +    +  
Sbjct: 1345 ----LYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQRSPRNS 1400

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
              G + +V   LL     L L  +  IA    T   ++
Sbjct: 1401 LRGKEMLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487


>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
 gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
          Length = 1340

 Score =  115 bits (288), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 155/326 (47%), Gaps = 23/326 (7%)

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1150
            ++ L + + K+ E ++A G  ++  ED+ + G   ++        P    T     E++ 
Sbjct: 1003 LIQLDSKSRKKREYIVA-GITFIGTEDLPSTGAFHIYDLTEVIPEPGKPDTNFKLKEIFK 1061

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
            ++++G+++++  + G  LI    KI++        +  +AFYD P ++V       NF++
Sbjct: 1062 EDIRGSVNSVCDISGRFLINQSQKIMVRDVQEDNSVVPVAFYDTP-IFVSDAKSFGNFLI 1120

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            LGD  +   FL +  +  ++  L +   S +  + EFLI+   ++  ++D +  + +  Y
Sbjct: 1121 LGDSMQGFQFLGFDAEPYRMIPLGRSVSSFETVSVEFLINAGEINFAITDREDILHVLKY 1180

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGT 1328
            AP    +  GQKL+  + F++ +  T  L L        R      SDK   +F  + G 
Sbjct: 1181 APDEPNTLSGQKLVHCSSFNLYSSNTCMLMLP-------RNDEFETSDKAPPKFQAIGGQ 1233

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDS 1385
            +DG I  I PL E T+RRL  +Q++++D    + GLNPR  R    F+      RP    
Sbjct: 1234 VDGGIFKIIPLKEDTYRRLYVVQQQIIDKEVQLGGLNPRMERLDNDFYQLTHVMRP---- 1289

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTG 1411
            ++D  ++  +  L +E +   A + G
Sbjct: 1290 MIDFNIIRRFSELSIERRTHFAQKAG 1315



 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 83/386 (21%), Positives = 165/386 (42%), Gaps = 64/386 (16%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  + L+  +  +A++ Q  +  S     +++A   AKIS++ FD   + L   S+H
Sbjct: 48  LNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
            +E            + A+   ++VDP  R   +L++    I L     G+         
Sbjct: 104 YYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDEDDE 158

Query: 212 ----GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
                +   E T          +  S + ++++L  ++++V D  F++ + +P + +L++
Sbjct: 159 EDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVLYQ 218

Query: 266 RELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
            +LTW G         + MI  L++ T    T     +I +  +L  D ++LL +     
Sbjct: 219 PKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR--- 275

Query: 320 GVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW--- 371
           G +V+G N + Y   +      + LN++A   D S +  R     EL+       T+   
Sbjct: 276 GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYFWS 332

Query: 372 -------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-VLT 412
                  + ++  L+   + +L  + +  +GR++ + DL K           +NP+ V  
Sbjct: 333 ASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCVAR 392

Query: 413 SDITTIGNSL-FFLGSRLGDSLLVQF 437
            D  +  +S+  F+G   GDSL+V+ 
Sbjct: 393 VDPNSSNSSMDLFIGYLSGDSLVVRL 418


>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1456

 Score =  114 bits (286), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 123/523 (23%), Positives = 229/523 (43%), Gaps = 60/523 (11%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A + R L+  RT ++  T + T H A  +RI  F  ++G  G ++ G  P + + 
Sbjct: 899  IESIEARKCR-LQRERTMIENDT-QSTRHCA--RRIIPFACVAGQSGAYVCGQHPVFLLW 954

Query: 939  FRERLRV--HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-----PSGSTYDN 991
             + + R+  +     G++  F     +    GFIY   +G +   ++     P+G     
Sbjct: 955  DKRKRRIAAYRHQSPGAVRGFVSFPQMA--GGFIYC-CEGFVDFARMNTYCAPNGQG--- 1008

Query: 992  YWPVQKIPLKATPH-----------------QITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
             W  ++I + ATPH                 + T+  ++  + + + +   + LN V S+
Sbjct: 1009 -WLTRRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQRAFFDVQLKIHYDEELNTVQSV 1067

Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
              +  V H    +  + V   R   VE++EVR+L     G  W+      ++ +E  L  
Sbjct: 1068 TAEPPVCHMPPINPGAGV---RVPMVEQFEVRLL--STTGEQWECTHKFALEENEKVLGA 1122

Query: 1095 RVVTLFNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK 1151
            + V L          +  +  + TA+  GEDV  RGR++L ++       +  + +++S+
Sbjct: 1123 QAVELRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLAS--KTVKKKRAIVQLHSE 1180

Query: 1152 ELKGAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
             L G  +A+  +   + +A G   KI  + W   +L   AF  A  +Y   L+  +N+I+
Sbjct: 1181 PLNGPATAVTGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-VYATRLSAFRNYII 1239

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
             GD+ +S     + EQ   L +L KD  ++     + +    T  ++ S++Q+++ +  Y
Sbjct: 1240 YGDLCRSCAMARFNEQNHTLTVLGKDHNAVSVVHCDMMYHDRTFGILCSNDQRDLLLMGY 1299

Query: 1270 APKMSESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
             P++ ES  G+   SR   + F +       L    LA S     AA  S  T     ++
Sbjct: 1300 TPRVQES--GEHTPSRVLESPFSLDGEYR--LPSGCLAKSLRFRSAAGNSSVT-----VY 1350

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
             +  G +G I PL E   R    + ++L   +P  AGL PR F
Sbjct: 1351 ISNYGEVGFIVPLGEQANRTALWITRRLQVDLPCDAGLTPRMF 1393



 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  FV    EP++ +L ER  TWAGRV    W+      + LS       IS  L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPSP---IGGVLVVGANTIHYHSQSASCALALNNYA 346
             HP   L+    ++P++   ++ V S      GV+  G NT+ + +      +  N+  
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373

Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
           +                      L+SS  L R +FS  L    AT           +  +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
           S + G + +L  +  G  V  + +S        S +T I ++LFFLGS       V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGS------AVSFSC 485

Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
            +  +  +SG   +F  +E                    AD  S  R  +S+S  L+D  
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
             E       A N T       S A R +++++ 
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573


>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
 gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
          Length = 1357

 Score =  114 bits (286), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 155/325 (47%), Gaps = 19/325 (5%)

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1150
            ++   N+ T   + L+ +G + V  ED+   G   +++T     +P    T     +V+ 
Sbjct: 1018 MIIQLNSKTNFKKELIVVGISNVGTEDLPPTGSFYIYNTNEVVPDPSKPDTNYRFKDVFH 1077

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFIL 1209
            +++KG I+ +  + G  ++    K+++      E +  +AF+D P ++V  +    N  +
Sbjct: 1078 EQVKGTINNVCEISGRFMVNQSQKLLVRDIQEDESVVPVAFHDVP-VFVADIKSFGNLFI 1136

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            +GD  +   F+ +  +  ++ +L +        A +F++    +  VVSD    + I  Y
Sbjct: 1137 VGDSMQGFQFVGFDAEPYRMIMLGRSVSKFKTMALDFVVRNGEIYFVVSDTDDILHILKY 1196

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
            +P    S  GQ+L   + F++ +  T       L  ++D      G + ++ F  +   L
Sbjct: 1197 SPDEPNSLSGQRLAHYSSFNIHSTNTSM----HLLPANDEFIENKG-NGSSIFQTIGANL 1251

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSI 1386
            DGSI  I PL E +FRRL  +Q++++D+  H AGLNPR  R   +++      RP    +
Sbjct: 1252 DGSIFKILPLSEDSFRRLYVIQQQIIDTEVHAAGLNPRMERLSNEYYQLTNVTRP----L 1307

Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTG 1411
            +D  L+  Y  L ++++  IA + G
Sbjct: 1308 LDFNLIRRYSNLSIKKRKSIAQKAG 1332



 Score = 44.3 bits (103), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           S   L L   ++L+G V  +A++ Q  +    + D +I+    AK+S++ F+   + L  
Sbjct: 45  STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100

Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
            S+H +E     +  LK  +ES  R     +D    C  VL++    I +      +   
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153

Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
            DED                  F S      +I +S +I L+  ++    +++ D  F+ 
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212

Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
            + +P + +L++ +L W G      +  ++    ++ L             S++  L + 
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
            +I     LP++ + ++ + +   G  +VG+N I Y  H+      + +N +    +L  
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329

Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
            + + +S  ++ L+       A +    N+V      L+  +  ++ L+ +  +GR++  
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389

Query: 401 LDLSK 405
            D++K
Sbjct: 390 FDINK 394


>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
          Length = 788

 Score =  114 bits (284), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 198/869 (22%), Positives = 335/869 (38%), Gaps = 166/869 (19%)

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-IA 667
            G T+ AG  F +  V QV     R+L+     + L           G++    + V+ +A
Sbjct: 33   GLTMVAGAFFQQTCVAQVTTNSIRLLEPDGAERQLYL------DAEGNKPRPKIKVAHVA 86

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
            DP++++   D +  L VGD +   V  +  +    +    +S + + D      +RK   
Sbjct: 87   DPFIVVLREDDTFGLFVGDTAKGRVRRKDVSHFGENGTICASASFFTDHTDLFQIRKPGD 146

Query: 728  DAWLSTGVGE--------------AIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
                 +G G               A   + GGP     + ++V  E G  +   V   N 
Sbjct: 147  VVATHSGPGSHRRRPGESSSSKRRASRKSRGGPAASTTLENIVDAEHGT-QWLVVLRKNG 205

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
             F V    S R H +     + L D    ++S          + +I  + +  + + R  
Sbjct: 206  FFEVSTRTSTRQHQL-----QGLPD--VLVDSGQAHTVCTDGETDIEHVIIAPIGITR-- 256

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
                +P L  I    T+  Y+      P ++S++  PV    ++    V         FS
Sbjct: 257  ---PKPHLVVITKSRTLAIYEPVPAPPPPDSSENSAPVRDQLTVQFVKV---------FS 304

Query: 894  RT-PLDAYT------REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRV 945
            R  PLD +       R   P  +P        N+SG    F++G  P W +      LR+
Sbjct: 305  RALPLDMHDTKRVAGRSLVPFKSP--------NLSG---IFVTGDHPFWLLRTDASALRI 353

Query: 946  HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPH 1005
            +P                   H   YV S G   +  LP            +IP +    
Sbjct: 354  YP-------------------HAAQYVNSFGTTVVEWLPDVDIS------HEIPCR---- 384

Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYE 1064
              +Y ++       V+  V        S L      +  D++ L + D  H    +    
Sbjct: 385  --SYASDDGRVYTSVAYDVSTRHILAASALRTTFAYYDEDSNELYTPDATHPNPEIHCSA 442

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1123
            + ++ PD     W T        +E    V  + L   +T+   +  + +GT   +GED+
Sbjct: 443  LELITPDT----WTTVDGYEFAQNEFVNAVESIPLETLSTERGLKDYVVVGTTISRGEDL 498

Query: 1124 AARGRVLLFSTGR-----NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
            A +G   +F          +   Q  +  +  ++ KGA++AL  + G+L+ + G KI + 
Sbjct: 499  AVKGATYVFEVVEVVPEPGSKTRQYRLRLLCREDSKGAVTALCGMNGYLVSSMGQKIFVR 558

Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
             +   E L GIAF D   + V SL  +KN +L+GD+ KS++F++++E+  +L  L KD  
Sbjct: 559  AFDLDEKLTGIAFMDVG-VCVTSLRPLKNLLLVGDMVKSVWFVAFQEEPFKLVPLGKDRQ 617

Query: 1238 SLDCFATEFLIDG-STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT- 1295
             L     +F     + LS  V D   +  +F           G +L+  +EFH   HVT 
Sbjct: 618  QLSVTHADFFFGSQAQLSFAVLD---DFGVF-----------GLRLICSSEFH--THVTH 661

Query: 1296 --------------KFLRLQMLATSSD----RTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
                            + +Q L T S      T   P     +   +  G+ DG+I  + 
Sbjct: 662  RGVLSVSRKADFDSDVMSIQSLGTESSLIFGETKPYPFHQHNSILTMCGGSSDGTIASLT 721

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
            PL+E  F RLQ LQ +L+              R  H NG         ++D  LL+ +E 
Sbjct: 722  PLNESEFGRLQLLQGQLI--------------RNVH-NG---------VLDGNLLAAFEE 757

Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
            LP+ +Q+E+  Q G  R +IL++L  L +
Sbjct: 758  LPVSKQVEMTQQIGAEREKILNDLLKLRI 786


>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
          Length = 1300

 Score =  112 bits (281), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 164/338 (48%), Gaps = 30/338 (8%)

Query: 1088 SENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1141
            SEN++   + T+    N+ T+    L+ IG+++V+ ED  + G +L+         P   
Sbjct: 954  SENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKP 1013

Query: 1142 -QNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG---IAFYDAPPL 1196
              N    +++ +E++G+++A+  + G  +I    K ++      E N    +AF D P +
Sbjct: 1014 DSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQ--EDNSAVPVAFLDMP-V 1070

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            ++       N +++GD  +   F+ +  +  ++ +L K          EFL++   ++ +
Sbjct: 1071 FITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFI 1130

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
            V+D Q ++ +  YAP  + S  GQ+L+    F++      +++L        R     GS
Sbjct: 1131 VTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLV-------RKHVEFGS 1182

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFH 1373
              +N  AL   T DGSI  + PL+E ++RR   +Q++L+D    +AG N +  R   +++
Sbjct: 1183 KTSNYIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYY 1241

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              G + RP  DS    ++L  Y  LP+ ++  I ++ G
Sbjct: 1242 HKGHSLRPTLDS----QVLKKYIHLPITKRTTIENRVG 1275



 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           A  L L   ++L G +  + +L Q G   S  +   IL+   +K+S++ FD     L   
Sbjct: 45  AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
           S+H +   ++++L     S     ++ VDP  RC  +LV+   ++ IL        +  D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155

Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
           ED  G     + R++ +  I    + M         KHV D  +++ + +P + IL++  
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
           L W G      +T     LS+    ++  +I    +LP+D + L  VP   G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272

Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
            + Y S S    SC + LN +A S  +++    S  ++ L  +    +    ++D+ +L 
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
            +   +  +    +G ++ + D  +            I N + F  SRL     GD    
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380

Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
              +L+ F  G    +    LK  F        + ++L  +  D        E  +LYG 
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428

Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           + NNT +     Q+ F  ++ DS+ NIGPL   + G   + + +   +   +  E   + 
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487

Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
              G+      T  H + + H   + +  +    ++    + ++ +   L T D   E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543

Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
           +            + D+    RTI    +   +R++QV   G  + D  +     +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
                            ++   I DPY+L   + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626


>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
          Length = 1356

 Score =  112 bits (280), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 81/346 (23%), Positives = 154/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T K+ E ++A G A    ED    G   ++  
Sbjct: 1000 WKVIDKIDFPNNSVVNEMRSSMIQVNSKTKKKREYIIA-GVANATTEDTPPTGAFHIYDV 1058

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1059 TEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1118

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +L+GD  +   F+ +  +  ++ LL +        + EFL+
Sbjct: 1119 AFLDIP-VFVTDSKSFGNLLLIGDAMQGFQFIGFDAEPYRMILLGRSISKFQTMSLEFLV 1177

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F + +  +  L L        
Sbjct: 1178 NGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTLHSINSCMLLLP------- 1230

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGSI  I PL E T+RRL  +Q++++D    + GLNPR 
Sbjct: 1231 -KNEEFGSSQVPSFQNVGGQVDGSIFKIVPLSEETYRRLYVIQQQIIDREIQLGGLNPRM 1289

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +   A + G
Sbjct: 1290 ERLANDFYQMGHSMRP----MLDFNVIRRFSELAIDRRKNTAQKAG 1331


>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1337

 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 156/330 (47%), Gaps = 24/330 (7%)

Query: 1091 ALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT---- 1146
            ++T++V    N+ TK+   LL +G A +  ED+ + G   +         P    T    
Sbjct: 998  SMTIQV----NSKTKKKRELLVVGVASIGTEDLPSAGSFHVIDINEVVPEPGKPDTNYKF 1053

Query: 1147 -EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1204
             E++ + ++G ++++  + G  +I    K+++      E +  +AF D P +YV      
Sbjct: 1054 KEIFQETVRGNVNSVCEISGRFMINQSQKLLVRDIQEDESVVPVAFLDVP-VYVTDTKSF 1112

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
             N +++GD  +   F+ +  +  ++  L +        A EFL++   +  +VSD    +
Sbjct: 1113 SNLMIVGDSMQGFQFVGFDAEPYRMIPLGRSVSKFKTVALEFLVNNGDIFFIVSDRNDIL 1172

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
             +  YAP    S  GQ+L   + F++ +  T  +    L  S++   ++P    T  F  
Sbjct: 1173 HVLKYAPDEPNSLSGQRLAHYSSFNIHSTNTSMI----LLPSNNEFQSSPNGQAT--FQS 1226

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRP 1381
            +   +DGSI  + PLDE +FRRL  +Q++++D+     GLNPR  R   +++      RP
Sbjct: 1227 VGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPRMERLSNEYYQLVHLMRP 1286

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
                ++D  ++  +  L + ++ +IA + G
Sbjct: 1287 ----MLDFNIIRRFSNLSITKRTKIAQKAG 1312


>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
 gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
          Length = 1541

 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 135/610 (22%), Positives = 264/610 (43%), Gaps = 71/610 (11%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 929  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKR 988

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
            L+  R  L + T ++  H +  +R+  F+ +   ++G ++ G  P + +V+     +L  
Sbjct: 989  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRYKGIYVCGQTPVF-LVYHAATNQLVC 1044

Query: 946  HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYD-NYWPVQKIPLKAT 1003
                   ++  F   H+ + + GF+Y   +G +    + P G     + W ++++ L  T
Sbjct: 1045 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGCSGWWLERVRLGCT 1103

Query: 1004 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1052
            PHQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+ 
Sbjct: 1104 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1163

Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1106
                  T E YEV+          WQ    + +  +E  L+  ++ +   TT +      
Sbjct: 1164 SGSPVPTNERYEVQFFSTLN----WQCMGRLVLDGNEKVLSATLMQVTRDTTMDAANRST 1219

Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QG 1165
               + A+ TAY  GEDV  RGR+LL +T + +      +  ++ + ++G ++A+  + + 
Sbjct: 1220 TAPVCALATAYPLGEDVTTRGRILLLTTSQQSGQGMQQLRTLHEEPMEGPVTAITRVGED 1279

Query: 1166 HLLIASGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
             + +A G  + ++++       E   I +  A   YV  L   + ++++GD+  S+ F  
Sbjct: 1280 CVAVAVGGTVRVYRYDANKSTMETMAILYAGA---YVTCLQAFREYLVIGDLFNSVLFAR 1336

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQ 1280
            + E+   + +L +D  ++   + + L   +   L+V+D+ +N+    Y P+ + E  K  
Sbjct: 1337 YSEEIHTITILGRDTSAISVVSNDMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEHGKPP 1396

Query: 1281 KLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            K+L        E+ + G  + K +RL+  +  S                 ++ T  G IG
Sbjct: 1397 KVLESLLTVTGEYRLAGGVLLKMMRLRAASARSSSV-------------AIYVTNMGEIG 1443

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             + PL + T R  Q + ++L   V H  GL PR F  F  +         S+   E + H
Sbjct: 1444 YLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMFLGFPQDDPLR-----SLKGEEWMLH 1498

Query: 1395 YEMLPLEEQL 1404
            +   PL EQL
Sbjct: 1499 F---PLLEQL 1505



 Score = 43.9 bits (102), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
           W+         TC I    ++++ +   H L  S ++ LP+D   +  +P+    P    
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401

Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V     +H  ++S             A +L + AVSL+      +SQ L      V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461

Query: 366 AAHAT 370
            A+AT
Sbjct: 462 FANAT 466


>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
 gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
          Length = 1331

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 151/321 (47%), Gaps = 20/321 (6%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
             ++ T+  +  + +G A+V+ ED+   G + +F        P    T     EV+ + ++
Sbjct: 999  LDSRTRRRKEYVIVGVAHVETEDLPPSGSLSVFDITEVVPEPGKPDTNFKLGEVFKENIR 1058

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G +S++  + G  LI    K+I+        +  +AF D P ++V  +    NF+++GD 
Sbjct: 1059 GTVSSVCDISGRFLINQSQKVIVRDVQEDNSVVPVAFLDVP-VFVTDVKSFGNFLIIGDS 1117

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +   F+ +  +  ++  L +    L+  A EFL++G  +   V+D    + IF YAP  
Sbjct: 1118 MQGFQFIGFDAEPYRMIPLGRSVSKLETVALEFLVNGGDIFFAVTDTSNILHIFKYAPDE 1177

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
              S  GQ+L+    F++ +  T  + L            + G    +   ++ G  DGS+
Sbjct: 1178 PNSLSGQRLVHCTSFNLHSTNTCMVLL------PKNEEFSVGEKSLSPVQVVGGQTDGSL 1231

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
              + PL E T+RRL  LQ++L +    + GLNPR  R   +++    A RP    +++  
Sbjct: 1232 FKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPRMERLSNEYYHLTHAVRP----MLEFN 1287

Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
            ++  +  L +E++ + A + G
Sbjct: 1288 VIRRFNTLSVEKRKQTAQKAG 1308



 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 140/656 (21%), Positives = 270/656 (41%), Gaps = 124/656 (18%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  LA++ Q  +      D ++L    AKISV+ +D++ + +   S+H
Sbjct: 48  LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
            +E        R     A+   ++VDP  RC   L++   +I L   Q  S         
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158

Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
                  D+    + G  +A    S + N ++L  DM +V D  F+  +  P + ++ E 
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215

Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
           +  WAG        V++   T  +++   ST +K   +I +   L  D + ++ + +   
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271

Query: 320 GVLVVGANTIHYHSQSASCA--LALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
           G ++VG+N + Y   + S    + LN+YA  ++  ++ + RS   + L       W  +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331

Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
                ++TG+ +L+           + Y+GR++ + D+           + +N + ++  
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388

Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T+ GN +  F+G R GD+ +++       + L + ++      E  +P    + +   
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
              +D +  EE S       N E   +T   F   V  SL NI P+   + G   + D  
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499

Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
             G+S  +  E V L    G  T       G +    +M+   +   A   IS+ +   +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551

Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
            ++  D     T+S         +D     + +GR     T  + ++FG  +R++QV   
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611

Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
              + D ++    T    F               V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653


>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 203

 Score =  111 bits (278), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/192 (34%), Positives = 101/192 (52%), Gaps = 2/192 (1%)

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
            L  KD  SL   + +FL+D   L  V+ D+  NI +F Y P+  +S+ GQKLL R +FHV
Sbjct: 3    LFGKDHSSLSVSSADFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHV 62

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-LFGTLDGSIGCIAPLDELTFRRLQS 1349
            G+H+   L L   A   +           N+ +L L  + DGS+G +  L E T+RRL  
Sbjct: 63   GSHIKSILMLPKEAFPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYF 122

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            +Q +L+++   VAGLNP S+R      K   P    I+D +LL  Y  L   +Q ++A +
Sbjct: 123  IQGQLINTEDKVAGLNPISYRTSTYVSKTSNPAR-GILDGKLLYQYNNLERNKQKDMARK 181

Query: 1410 TGTTRSQILSNL 1421
            +G     I+ +L
Sbjct: 182  SGMPVETIIYDL 193


>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1355

 Score =  111 bits (277), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 151/346 (43%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + N+ TK     +  G A    ED    G   ++  
Sbjct: 999  WKVIDKIDFPKNSVVNEMRSSMIQI-NSKTKRKREYIVAGVANATTEDTPPTGSFYIYDV 1057

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1058 IEVVPEPGKPDTNYKLKEIFQEEVNGTVSTVCEISGRFMISQSQKVLVRDIQEDNSVIPV 1117

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++ LL +        + EFL+
Sbjct: 1118 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMILLGRSVSKFQTMSLEFLV 1176

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ I  YAP    S  GQ+L+  + F V +  +  + L        
Sbjct: 1177 NGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCSSFTVHSINSCMMLLP------- 1229

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E T+RRL  +Q++++D    + GLNPR 
Sbjct: 1230 -KNQEFGSSQVPSFQNVGGQVDGSVFKIVPLSEETYRRLYLIQQQIIDRELQLGGLNPRM 1288

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +   A + G
Sbjct: 1289 ERLANDFYQMGHSMRP----MLDFNVIRRFSGLSIDRRKNTAQKAG 1330


>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
           max]
          Length = 281

 Score =  110 bits (275), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
           MSFAAYKMM  PTGI NC  GF+THSR+D+VP    +Q +++D E PS+    +G +PNL
Sbjct: 1   MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56

Query: 59  VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
           +VT ANV+E+Y VR+QE+ S   K + +++   L+DGI  ASLELV
Sbjct: 57  IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100


>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
 gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
          Length = 1597

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 97/341 (28%), Positives = 158/341 (46%), Gaps = 30/341 (8%)

Query: 1085 MQSSENALTVRVVTLFN-TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1143
             +++E   ++ +VTL + +T    +  +A GT    GED  A+G V LF           
Sbjct: 1190 FEANETVTSLEIVTLDSPSTVSGRKQFVAAGTTTFHGEDRTAKGSVYLFEIISVVSAASE 1249

Query: 1144 LVTEVYSK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
            L +++  K     + +  ++A++ + G+L+   G K+ +      E L  IAF D P  Y
Sbjct: 1250 LGSDLRLKLVCRDDSRAPVTAISHINGYLISTCGQKLYVRALEKQEWLISIAFLDCP-FY 1308

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---------FGSLDCFATEFLI 1248
            + S+ +VKN +LLGD  + +   +++E   +   LAK          F   D   +   I
Sbjct: 1309 ITSIEVVKNLVLLGDCKRGLGLWAFQEDPYKFVELAKAEDGCVGVGAFLVRDEKVSMLSI 1368

Query: 1249 DGSTLSLVVSDEQKN--IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLA 1304
             GS L    S E     I+++ YAP ++    G+KL+ R+EF   +     +    + L+
Sbjct: 1369 SGSRLGGDASMEASAGVIRLYEYAPHLAVG--GKKLVLRSEFQTTSEAVARVECSGRWLS 1426

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
             S  R        +T R  ++F   +GS+  +A +DE   +RL  LQ +LV SV H A L
Sbjct: 1427 DSELR------GRETLRNKVVFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAAL 1480

Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            NPRSFR    N    R     ++D  LL  +  L   + LE
Sbjct: 1481 NPRSFRMVR-NDYVPRALVKGVLDARLLDEFMRLSRPKMLE 1520



 Score = 87.4 bits (215), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
            LV    +V+ IY V  Q   S       S+++  +         S  +L +  ++ L G
Sbjct: 46  QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
            V  L  + Q  A +   RD ++++F+DAK+++LE++D+I  L   S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-------EDTFGSG 223
                     P++ VDP  RC  +L+    + IL   +  +    D            + 
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220

Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
              +A ++S     S V+ +R++D  ++++K F F+ G+ +P + +L     TW G +S 
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280

Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +  T  +                     AL   T    HP++ ++  LP+D   ++A P 
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340

Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
            +GGV+VV  +++ +  QS    + ALN +     A+  +S  EL   S   +L  +   
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
           +      +L+   G++       DGR V+ + L +   S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436


>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
 gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
          Length = 1459

 Score =  108 bits (271), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/327 (23%), Positives = 149/327 (45%), Gaps = 22/327 (6%)

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS-------TGRNADNPQNLVTEV 1148
            +V   N+ T+     +  G A +  ED+   G   ++        TG+   N +    E+
Sbjct: 1120 MVIQLNSRTRAKREYIVAGLANIGSEDLPPTGSFYIYDISPVLPETGKPDTNYK--FKEI 1177

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            ++++++G ++++  + G   I    KI++        +  +AF D P +YV       NF
Sbjct: 1178 FTEDVRGLVTSVCEISGRFTINQSQKIMVRDVQEDNSVVPVAFLDIP-VYVTDTKSFGNF 1236

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            +L+ D  + + F+ +  +  ++ LL K    L     EF+ +   +    +D    + IF
Sbjct: 1237 LLISDSMQGLQFVGFDAEPFRMILLGKSIPDLKISTVEFIANNGNIYFAATDYDNILHIF 1296

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
             YAP    S  GQKL+  + F++ +  +  + L      +D        +    F  L G
Sbjct: 1297 KYAPDEPNSLSGQKLVHCSSFNLHSSTSCMIML----PGNDEFSENEQDNFIPSFQTLGG 1352

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPD 1384
             +DGSI  + PL+E  +RRL  +Q+++ D    V GLNP+  R   +++      +P   
Sbjct: 1353 QVDGSIFKVIPLEESPYRRLYVIQQQITDYEVQVGGLNPKMERLSNEYYQKSNMLKP--- 1409

Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             ++D  ++  + MLP++++   A + G
Sbjct: 1410 -MLDFNIIRRFSMLPIDKRRRTAQKAG 1435



 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
           ++  G +  + ++ Q G++     D ++L   +AKIS+++FD+ ++ L+  S+H     F
Sbjct: 54  FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109

Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           E    L L       AR   ++V+P+ +C  VL++  + I +        +  DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156


>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
 gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
          Length = 1327

 Score =  108 bits (271), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/321 (22%), Positives = 144/321 (44%), Gaps = 20/321 (6%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-----LVTEVYSKELK 1154
             ++ T+  +  + +G A V  ED+   G   +F        P        ++EV+ +E++
Sbjct: 995  LDSRTRRKKEYVIVGVAVVGTEDLPPSGSFFVFDITEVVPEPGKPDTNFKLSEVFQEEIR 1054

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G +S +  + G  LI    K+++        +  +AF D P ++V       NF+++GD 
Sbjct: 1055 GTVSTVCEISGRFLINQSQKVLVRDVQDDNSVVPVAFLDIP-VFVTDAKSFGNFMIIGDA 1113

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +   F+ +  +  ++  L +    ++  + EFL++G  +   ++D    + +F YAP  
Sbjct: 1114 MQGFQFVGFDAEPYRMIPLGRSIAKMETVSVEFLVNGGDIFFAITDTDDILHVFKYAPDE 1173

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
              S  GQ+LL    F++ +  T       +A         P       F  + G +DGS+
Sbjct: 1174 PNSLSGQRLLHCTSFNLHSTNT------CMALLPKNEEFEPAQANMKNFQAIGGQVDGSV 1227

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
              + PL E  +RRL  +Q+++ +    + GLNPR  R   + +      RP    ++D  
Sbjct: 1228 FKLLPLREDVYRRLYVVQQQITEKELQLGGLNPRMERLSNEHYKTTHVLRP----MLDFN 1283

Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
            ++  ++ L  + + +I+ + G
Sbjct: 1284 VIQRFKRLSTDRRKQISQKVG 1304



 Score = 73.6 bits (179), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +A L L   ++ HG +  LA++ Q  +      D ++L    AK+S+++FD   + +   
Sbjct: 45  SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100

Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
           S+H +E    +   L+  ++S+ R     +DP  RC  +L    L ++  +A+       
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149

Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
            D++   +      R+++        S +   ++L  ++++V D  F++ + +P + IL 
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209

Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
           E  L WAG  +++ +  + MI  L IS+T          I     L  D + L+ + +  
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267

Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
            G ++VGAN + +   S +    + LN+++  +L   + +  S + + L  + A  W   
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326

Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
               + ++  LL     ++  + +  +GR++ + D+ K    N ++  +   T  + L  
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
             S     LL+ F  G    +  + LK      +     ++ +  S   D  +D +  +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446

Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
            S  G A +N E   +T   F   +  ++ NIGP+   + G   + +    G+    ++ 
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506

Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
           Y +V     G     T    S R     + +  +    ++    + ++ R   L T D  
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562

Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
                + E+  +     +GR      T+      G +R++QV      + D ++  + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               + E         V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649


>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
 gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
          Length = 1388

 Score =  108 bits (269), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 148/322 (45%), Gaps = 24/322 (7%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
            N+  +     +  G  ++  ED+   G   ++        P        + EV+ +E++G
Sbjct: 1055 NSKARRKREYIIAGVTHIGTEDLPPTGAFHIYDITEVVPEPGKPDTNYRLKEVFKEEVRG 1114

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGD 1212
             +S +  + G  L+    K+++      E N +   AF D P +++       +F++LGD
Sbjct: 1115 IVSTVCEISGRFLVNQSQKVMVRD--AQEDNSVVPVAFLDIP-VFINDAKSFGDFLILGD 1171

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
              + ++F+ +  +  ++  L K     +  + EF+++G  L   ++D    + +  YAP 
Sbjct: 1172 AMQGLHFIGFDAEPYRMINLGKSVTKFETVSVEFVVNGGDLYFALTDRNNILHVLKYAPD 1231

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
               S  GQKL+  + F++ +  +  L L       D T  AP +     F  + G +DGS
Sbjct: 1232 ELNSLSGQKLVHCSSFNLFSGNSSLLLLPKNEEFED-TKNAPLT-----FQTIGGQVDGS 1285

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDC 1389
            I  + PL E T+RRL  +Q+ + D  P + GLNPR  R   +++      RP    ++D 
Sbjct: 1286 IFKVIPLREDTYRRLYVIQQHMNDKEPQLGGLNPRMERLSNEYYQLCHVMRP----MLDF 1341

Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
             ++  +  LP++ +  +A + G
Sbjct: 1342 NIIRRFSELPIDRRTRVAKRAG 1363



 Score = 70.5 bits (171), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 148/725 (20%), Positives = 286/725 (39%), Gaps = 145/725 (20%)

Query: 58  LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           L+V   N++ IY +  +    S  S +  ET     +     A L L+  ++L+G V+ +
Sbjct: 29  LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +  G   NS   + I+L+   AK+S+L FD SI+     S+H +E            S
Sbjct: 89  ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
            A+   +++DP  RC  +L++   ++ L              +       ++ + +   S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200

Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
               + RI+         S + N+  L  D+K++ D  F++ + +P + +L++  LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260

Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
            V         MI  L I    ST    H            +I     L  D +K++ + 
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320

Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
           +   G +V+G N I Y   +      + LN++A  +L  ++ +  S F +  +       
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377

Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
               + + T   ++  LL     +L  + +  +GR++ + D+              NP+ 
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437

Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           ++   +   NS    F+G   GDSL+V+           + LK      +  + S +  +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
            +  D        +E +LY     + E A+            + F      SL NIGP+ 
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541

Query: 517 DFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH-NADSSRMAAYDDEY 575
             + G   + D +  G+   +  E         I T    S+  H N     +    ++ 
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF-------SIVTTSGNSTGSHLNVIQQTVQPIVEKA 594

Query: 576 HAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR-----TIAAGNLF 618
             ++ ++      ++ +   L T D       + ++  +     +GR     T     +F
Sbjct: 595 LKFISVTQIWNLKIKNKDKYLVTTDSTKSKSDIYDIDNNFSLHKEGRLRRDATTVYIAMF 654

Query: 619 GR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           G  +RV+Q+      + D ++  + L+    + E         V+ VS+ DPY+L+ +S 
Sbjct: 655 GDGKRVVQITTNHLYLFDTNF--RRLTAIKFDFE---------VVHVSVMDPYILITVSR 703

Query: 678 GSIRL 682
           G I++
Sbjct: 704 GDIKI 708


>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
          Length = 225

 Score =  107 bits (268), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)

Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
            + +V D  F+HG+ EP +++L+E   TWAGRVS +  TC I ALS +   + +P+IW  
Sbjct: 52  KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
            +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+L LN YA    +    Q++P 
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           S   + +D      L     L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score =  107 bits (267), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 88/316 (27%), Positives = 148/316 (46%), Gaps = 28/316 (8%)

Query: 1111 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTA+V  E+   + GR+L+F      DN   LV E   KE+KGA+  L    GH+L 
Sbjct: 836  FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 889

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
            A    + +++WT  +       +   +  + L    +F+L+GD+ +S+  L++K     L
Sbjct: 890  AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 949

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
            + +AKD+      A E L D + L    ++   N+ I       +   +  KL   A FH
Sbjct: 950  DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1006

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
            VG  +  F    ++  +   T  +      ++  +LFGT+ GSIG I  +DE  +  L S
Sbjct: 1007 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1060

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            +Q +L   +  V  ++  S+R F +N K  AHR      VD +L+  +  L  E+  E+A
Sbjct: 1061 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1116

Query: 1408 HQTGTTRSQILSNLND 1423
                  +  ++ N ND
Sbjct: 1117 ------KGLMVKNFND 1126



 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++           +H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253

Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
              +N YA                 +D   + +L  D+A      G L +L +    + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291

Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
           G   V+ L +       +   I+ + N + ++GSRLGDS L++                 
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
              +  D+    R + S    L    N G  + +     +     Q  T S A ++    
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387

Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            G L+    G+ I   AS            ++LPG KG+W +          D+SR  +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425

Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           D      L+IS    + +L+ +    E T+   +  + +T    N+    +++Q+ E+  
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479

Query: 632 RILDGSYMTQDLSFGPSN 649
           R++  +   Q   + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score =  107 bits (267), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 149/319 (46%), Gaps = 27/319 (8%)

Query: 1111 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTA+V  E+   + GR+L+F      DN   LV E   KE+KGA+  L    GH+L 
Sbjct: 832  FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 885

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
            A    + +++WT  +       +   +  + L    +F+L+GD+ +S+  L++K     L
Sbjct: 886  AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 945

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
            + +AKD+      A E L D + L    ++   N+ I       +   +  KL   A FH
Sbjct: 946  DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1002

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
            VG  +  F    ++  +   T  +      ++  +LFGT+ GSIG I  +DE  +  L S
Sbjct: 1003 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1056

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            +Q +L   +  V  ++  S+R F +N K  AHR      VD +L+  +  L  E+  E+A
Sbjct: 1057 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1112

Query: 1408 -----HQTGTTRSQILSNL 1421
                  + GT R   + +L
Sbjct: 1113 KGLMVKEHGTKREATVDDL 1131



 Score = 48.5 bits (114), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 92/217 (42%), Gaps = 40/217 (18%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++      GR    H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS---QGR----HVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
              L   Q+       V+ D +   +L  D+A      G L +L +    + DG   V+ 
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L +       +   I+ + N + ++GSRLGDS L++ 
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336


>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
 gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
          Length = 1318

 Score =  107 bits (266), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 73/321 (22%), Positives = 146/321 (45%), Gaps = 16/321 (4%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
             N+ TK     L +G  YV+ ED++  G   L+        P    T     E++ ++++
Sbjct: 979  LNSNTKRKREYLVVGNTYVRDEDISGTGSFYLYDITEVVPEPGKPDTNYKFKEIFQEDIR 1038

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G +S +  + G  +I+   K ++        +  +AF D P +++       N +++GD 
Sbjct: 1039 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDA 1097

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
                 F+ +  +  ++  L K    L+  + EFL++   +  +++D  + + +  YAP  
Sbjct: 1098 MHGFTFVGFDAEPYRMITLGKSVTKLETMSLEFLVNNGDMYFIITDRSQVMHVLKYAPDE 1157

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
              S  GQ+L+    F++ + +   +RL              GS  +  F  +   +DGSI
Sbjct: 1158 PNSLSGQRLVYCTSFNLHS-INTCMRLIQKNNEFVDLRRNYGSHMST-FQCIGCHIDGSI 1215

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
              + PL E ++RRL  +Q++++D    + GLNPR  R    ++  G   RP    ++D  
Sbjct: 1216 FKVVPLTESSYRRLYLVQQQIIDKEVQLCGLNPRMERLQNPYYQLGHLLRP----MLDFT 1271

Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
            +L  +  L + ++  +A + G
Sbjct: 1272 ILKKFSTLSISKRRSMASKAG 1292



 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           +   L L   ++L G+V S+A++ Q G++       +++     K+S+L+FD     L  
Sbjct: 44  AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99

Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
            S+H +E   + L L       +  P + VD   RC  V     +  L + + K  +   
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155

Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
                ++   +GG FS +            + SS ++    L  D+K+V D  F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL++  L W+G    +  T  +  LS+    ++  +I     LP+D + L+ + + 
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274

Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
               +VVG N + Y   + +    ++LN+++ ++ +++    SS     +     +    
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332

Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
              D+ LL  +   +  + +  +GR+V      RL    D+ K N  P+ +  D+     
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           +L F+G + GD++ V+      +S+ S G   E   +EAD    +               
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433

Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
                LYG ++   +    T   F     D+L+NIGPL   + G   + + +   ++  +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488

Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
             EL  +    G  T  H +   +    +   A               +   YL+ +  +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547

Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
           ++   + + D   +  +S D+     T++       +R++QV  +G  + D ++   MT 
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
           +  F               V+ V I DP++LL  S G I++   +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639


>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
 gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
            YJM789]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
            strain CL Brener]
 gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 444

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 104/451 (23%), Positives = 195/451 (43%), Gaps = 51/451 (11%)

Query: 996  QKIPLKATPHQITYFAEKNLYPLIVSV-----PVLKPLNQVLSLLIDQEVG--HQIDNH- 1047
            ++I L  TPH + Y        ++ S      P   P +  L+++ D+E G    I    
Sbjct: 2    RRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQRAPFDFQLNIVYDEESGGVQSITTEA 61

Query: 1048 ---NLSSVDLH---RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
               N+  +  +   R    + +E+R++    +   W    T+ ++ +E  L  +++ +  
Sbjct: 62   PVCNMPPIAPNAGIRVPMADRFEIRLM----STTDWACTDTLLLEENERVLGAQMMEIHC 117

Query: 1102 TTTKE---NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1158
                E      +  + TA+  GED+  RGR+LL +T       + L+   +S+ L G  +
Sbjct: 118  EKDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLATMCTKKKRKILL--FHSEPLNGPAT 175

Query: 1159 ALASLQGHLLIASGPKIILHK--WTGTELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
            A+  ++ H+ +A G  I L +  W   +L  G   Y     YV  ++  +N+++ GD+ +
Sbjct: 176  AVVGIRHHIAVAVGGTIKLFRFDWEKRKLVVGALLYAGT--YVTRMSSFRNYLIYGDLSR 233

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            S     + E+   L++L KD  ++     + +       L+ SD+++N+ +  Y P++ E
Sbjct: 234  SCAIARFNEENHTLSVLGKDRNAVSVVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQE 293

Query: 1276 SWKG--QKLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
            +  G   K+L        E+ + G  + K LR + LA +S  T              L+ 
Sbjct: 294  TEAGSPNKVLESVLSLDGEYRLSGGCLVKSLRFRSLAGNSSVT--------------LYV 339

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSI 1386
            T  G IG I P+ E   R    L ++L   +PH AGL PR F      + +      + +
Sbjct: 340  TNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEML 399

Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            V   LL+ +  L +  +  IA    T   ++
Sbjct: 400  VSASLLNEFFFLDIHSRKTIASAAYTQLERV 430


>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
 gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
 gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
 gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
 gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
            [Saccharomyces cerevisiae RM11-1a]
 gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1357

 Score =  106 bits (264), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
 gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
          Length = 844

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS       +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
             +      NS+   D+I+++  DAK+S++E+D   HG+   S+H +E  E     W+  
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
                S + G  + VDP G C  +  +G+  + IL   Q G  LV         GD+ T 
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195

Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
             S    SA   +SH        V+ +  LD  + H     F+H Y EP   IL+ +   
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                  +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
           +H      + A+ +N +A    +     +S   + L+      L +     LL    G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375

Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
            +L+   DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
            ++  S T+   S  K   G  E  A           D   D +  ++L     AS   E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489

Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
                  +   F+F   D L ++GP +D + G    + +     S       +EL   +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549

Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
                  TV  +    +  DS +M   DD Y  + I  ++ ++            +L   
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606

Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
               +  ESV Y V                +  TI  G L    RV+QV     R  D  
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
           Y        P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722

Query: 696 TPAAIESSKKPVSSCTLYHDK 716
             AA   S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739


>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
          Length = 1305

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 79/343 (23%), Positives = 150/343 (43%), Gaps = 33/343 (9%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
             N+ TK     L +G  YV+ ED+   G   L+        P    T     +++ ++++
Sbjct: 967  LNSNTKRRREYLVVGNTYVRDEDIGGTGSFYLYDITEVVPEPGKPDTNYKFKDIFQEDIR 1026

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G +S +  + G  +I+   K ++        +  +AF D P +++       N +++GD 
Sbjct: 1027 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDS 1085

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +   FL +  +  ++  L K    L+    EFL++   +  +V+D    + +  YAP  
Sbjct: 1086 MQGFSFLGFDAEPYRMLTLGKSVSKLETMCVEFLVNNGDVYFLVTDRNNLMHVLKYAPDE 1145

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR--------FALL 1325
              S  GQ+L+    F++ +  T       L   +D  G      K +R        F  +
Sbjct: 1146 PNSLSGQRLVHCTSFNLHSTNT----CMRLIKKNDEFG------KVSRGFGIYMPSFQCI 1195

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPG 1382
                DG+I  + PL E ++R L  +Q++L+D    + GLNPR  R    F+  G   RP 
Sbjct: 1196 GSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMERLENPFYQMGHILRP- 1254

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTG-TTRSQILSNLNDL 1424
               ++D  +L  +  L +  ++ +A + G    ++I  +L D+
Sbjct: 1255 ---MLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDI 1294



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)

Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            ++S++ FD     L   S+H +++   E   L  G       P ++ +P  RC  +LV+
Sbjct: 82  GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134

Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
               + +   +G                     D  G   G S  + +SH+ +    D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           +VKD  F+ G  +  + +L++ +L+W G       T     LS+    ++  +I     L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
           P+  + ++ + +   G ++ G N + Y   + +   A++LN ++ S           L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
             E P   F+ + +         D+ LL  +   +  + +  +GR++            +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360

Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           + K N  P+ +  ++     SL F+G + GD++ V+       + L S L+         
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
                  + + S+ L+   + + + LYG   NN E  +K       F     D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453

Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
           +   + G   + + +   ++  +  EL  +     G     T+   +       + +  +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513

Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
               ++        YL+ +  ++T   + + D   +  ++ D+     T++     G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573

Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           ++QV  +G  + D ++   MT +  F               V+ V I DP++LL  S G 
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619

Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
           I++   +P      V+T  P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643


>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
          Length = 1481

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 115/507 (22%), Positives = 214/507 (42%), Gaps = 63/507 (12%)

Query: 899  AYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRER--LRVHPQLCDGSIV 955
            A    ET     C R  I F +++G+ G ++ G  P + +  R    L  +     G + 
Sbjct: 939  AMIESETQLTRHCSRSIIPFDSLAGNVGAYVCGRHPLFLLWDRRTGLLSGYRHQIQGPVR 998

Query: 956  AFTVLH-----NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
             F          V C  GF   T   ++     P G    + W  ++I + ATPH I+Y 
Sbjct: 999  GFAPFPLMEGGFVYCGEGF---TDFAVMNTYCRPIG----HGWLGRRIDVGATPHFISYN 1051

Query: 1011 AEKNLYPLIVS-----VPVLKPLNQVLSLLIDQEVG--HQIDNHNLS-------SVDLHR 1056
                   ++ S      P   P +  L +  ++E G    I    L+       S    R
Sbjct: 1052 MPGRGCFVVTSHKQPFRPQRAPFDVQLKISYNEETGAIQSIATEPLTCSMPPIASSAGVR 1111

Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL--LAI 1113
                + +EVR +    A   W    T  ++ +E  L++++V +  +   K N T+    +
Sbjct: 1112 VPMADWFEVRFMST--AHVDWPCEDTFKLEENERVLSIQMVQIDGDRGMKINGTVPVCVV 1169

Query: 1114 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1173
             TA+  G+DV  RGR+ L +T   +    + +  ++++ L G  +A+A ++ H+ +A G 
Sbjct: 1170 STAFPLGDDVTCRGRIHLLAT--KSLRRGHKIVHLHAEALNGPATAVAEIRHHIAVAVGG 1227

Query: 1174 KIILHKW---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
             I ++++   +G  +  +  Y    +Y   L++++N+I+ GD+  S     + E+   L 
Sbjct: 1228 TIKIYRYDWQSGKLVVSVLLYAG--IYATKLSVIRNYIVYGDLIHSCAMARFNEENHTLT 1285

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-------LL 1283
            +L ++  S+       +    +  ++ SD+Q+N+ +  Y P++ E+  G+        L 
Sbjct: 1286 VLGRNRNSISVVDCNMMYHDRSFGILCSDDQRNVLVMGYTPRVQEAGAGRPAKTLESLLT 1345

Query: 1284 SRAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
               E+ +    + K LR                SD  N   +L+ +  G +G I P+ E 
Sbjct: 1346 LDGEYRLPSGCLAKSLRFS--------------SDFGNSSVMLYTSNYGEVGFIVPIGEQ 1391

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
              R    + ++L   VP  AGL PR F
Sbjct: 1392 ANRTALWVTRRLQTDVPCDAGLTPRMF 1418



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+         T  ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327

Query: 293 KQHP---LIWSAMNLPHDAYKLLAV---PSPIGGVLVVGANTIHYHSQSASCALALNNY- 345
             HP   LI     +P++   +L V      + GV+  G N I + +         N+  
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387

Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHATW--LQNDVALLSTKT 383
                     AVS          LD S  L R + S+   AA +    + +++ +L+   
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447

Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            D V++T+  V  G  V+ + ++        S ++ IG  L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498


>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 283

 Score =  105 bits (261), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)

Query: 3   FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
           +A Y+ +H PTG+ +C    F +                      P ++      NLVV 
Sbjct: 2   YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33

Query: 62  AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
             + + +Y +   +      K+S    +           LE    + + G V S+    Q
Sbjct: 34  KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78

Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
                   RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P
Sbjct: 79  SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHP 135

Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
           +VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           D  + +V D  F+HGY EP ++IL+E   TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226


>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
          Length = 736

 Score =  104 bits (259), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L++   V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D +KL+   +P+GGV+V+ AN   Y +Q+    LA N     L  
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
             +   +  ++ L      +L     LL+   GD  +L++ Y+ R V+    + KT    
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
                        F+GSR GDS+L         + G   S L       +KEE    E  
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480

Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
                                  AP+ +    S S    + VNG   S      N     
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
            ES    +     D L +IG +      +  NAD      S +   ELV           
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591

Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                          EL GC+ +WTV         +   R       Y+AYLI+S+  RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645

Query: 588 MVLETAD 594
           MVL T +
Sbjct: 646 MVLRTGE 652


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/339 (28%), Positives = 157/339 (46%), Gaps = 25/339 (7%)

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFST-GRNADNP 1141
            P+ + EN  ++ V  +F    KE    L +GT YV+ ++   A GR+L+FS  G+ A+  
Sbjct: 789  PLDAFENGSSM-VSCVFANDKKE---YLVVGTGYVREDECEPAVGRLLVFSVEGQGAERK 844

Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY---V 1198
             +L  EV   E +GA+  L    G LL     K+ L +W   + +GI        Y   +
Sbjct: 845  VDLAAEV---ETRGAVYVLNGFNGKLLACINSKVQLFRWIEKD-DGIQELQTECGYHGHI 900

Query: 1199 VSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            ++L++    +FI++GD+ +S+  L +K     +  +A+D+ +    A E L D   +   
Sbjct: 901  LALHMQSRGDFIIVGDLMRSVSLLVYKAVDGAIEEVARDYHANWMTAVEMLNDDVYIG-- 958

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG- 1315
              +   NI         +   +  +L  + EFH+G  V KF R  +L  SS+    +PG 
Sbjct: 959  -GEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSEVN--SPGG 1015

Query: 1316 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
              S       LLFGT++G +G I  L E   R L  LQ  +   V  V G +   +R F 
Sbjct: 1016 MDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHDEWRSF- 1074

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            +NG+   P  +  +D +L+  Y  +P   Q E+     T
Sbjct: 1075 TNGRRTSPSSN-FIDGDLVESYLDMPRHNQEEVLRHVDT 1112



 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
             A+ +     N+R L+   V D  F+ G  +  + +L++ +       + +H    I  
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191

Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            +IST  K+     W+ +N+ H+A +L+ VP+P GGVL++G  TI YHS  A   + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
             +                   A+  W+  D +  L+S  +G L  V+LT       V+ 
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
             +     +   S I+ + N + F+GS  GDS L++                   + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           A  T      + D L  +++              +    T S   +D     G L+    
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  A+            +EL G KG+W++     R  N +  +          YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418

Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            +  + T VL   E  D   ++ E  +  F +G T+  G   G    +QV +RG  ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475


>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
 gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
          Length = 696

 Score =  100 bits (250), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 95/313 (30%), Positives = 141/313 (45%), Gaps = 33/313 (10%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDV 1123
            + ++ P+  G  W T         E    +  VTL  T+T       + +GT     ED+
Sbjct: 396  LELISPEPEG--WVTMDGFESAQKEFVTCLDCVTLETTSTGSGMMDFIIVGTTINCREDL 453

Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
            A +G V +FS      + Q         + KG ++AL  L   L+ + G KI +  +   
Sbjct: 454  AVKGAVYIFSIVEVVPDLQ------CRDDAKGPVAALCGLNNSLVSSMGQKIFVRAFDLN 507

Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
            E L G+AF D   +Y+ SL  VKN +++ D  KS       +   +L +L KD   + C 
Sbjct: 508  ERLVGVAFLDVG-VYITSLRAVKNLLVISDAVKS-------KDPYKLVILGKDPYQV-CV 558

Query: 1243 ATE--FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
             T   F  DG    L++ DE   I+I+ Y P   ES  GQ LL R EFH           
Sbjct: 559  TTADLFFADGQVF-LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHG---------- 607

Query: 1301 QMLATSSDRTGAAPGSD-KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
            QM +  S       G D    +  L+ G+ +GS+     +DE+  +RL  LQ +L  +V 
Sbjct: 608  QMESRMSILIIRRRGKDTDIPQARLISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQ 667

Query: 1360 HVAGLNPRSFRQF 1372
            HV GLNP+ FR +
Sbjct: 668  HVVGLNPKVFRPY 680


>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 366

 Score =  100 bits (249), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 68/216 (31%), Positives = 93/216 (43%), Gaps = 53/216 (24%)

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILK 979
            + H G F++G+RP W +  R  L  H    +G + A T  HNVNC  GFI   S +G LK
Sbjct: 164  ANHSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLK 223

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            +C LP  +  D  W  +++PLK TPH++ +F E      IV+    +P            
Sbjct: 224  VCCLPPHTRLDGAWATRRVPLKVTPHRLAWFREAG----IVAAITSRPAPS--------- 270

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
                            R    EE           GG             E AL ++ V L
Sbjct: 271  ----------------RPRPAEE----------PGG-------------EQALCLKFVYL 291

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
             N TT + +TLLA+GT    GED    GR+LL+S  
Sbjct: 292  RNATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVA 327



 Score = 56.6 bits (135), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
           +Y     TIAAGNLF    ++Q    G R+L+G  + QDL      + G   + S  G  
Sbjct: 4   EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA-AIESSKKPV 707
             T+  + +ADPYVL+ +S+G+  LL  D  + T+ +   A  +   ++PV
Sbjct: 64  PPTITHMQVADPYVLVSLSNGTACLLEADLLSMTLGLGGCAGGVPDCERPV 114


>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
          Length = 1345

 Score =  100 bits (248), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 65/321 (20%), Positives = 147/321 (45%), Gaps = 18/321 (5%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
             N+ TK     + +GT++V  ED+ A G   ++        P        + + + +EL+
Sbjct: 1010 LNSRTKRKIEYVVVGTSFVGTEDLPATGSFQMYDIAEVVPEPGKPDTNYKIKQFFKEELR 1069

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
             A++++  + G  +I+   K+++        +  +AF D P L+   +    N +++GD 
Sbjct: 1070 SAVTSVCDISGRFVISQSQKLMVRDAQEDNSVVPVAFLDIP-LFTADMKSFGNLLIIGDA 1128

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             + I  + +  +  ++  L +     +  + EFL++G  L   + D    + +  YAP  
Sbjct: 1129 MQGIQLVGFDAEPYRMIPLGRSVLKFETLSLEFLVNGGDLYFTLIDRNDILHVLKYAPDE 1188

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
              S  GQ+L+  + F++ +  +       L   ++     P +     + ++ G  DGS+
Sbjct: 1189 PNSLSGQRLIHCSSFNMYSTTS----CTRLIPKNELFVDGPLNPAIQSYQVIGGQADGSL 1244

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
              + P+ E  +RRL  +Q++++D    +AG+NP+  R    ++      RP    ++D  
Sbjct: 1245 FKVMPVPETVYRRLYVVQQQIIDKETPLAGINPKMERLSNDYYQTSHLLRP----MLDYN 1300

Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
            ++  +  + + ++  +AH+ G
Sbjct: 1301 VVKQFCAMSIPKRTTLAHKLG 1321



 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 149/708 (21%), Positives = 285/708 (40%), Gaps = 150/708 (21%)

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           T A+ +E+ +VR       +   SG+              L L   ++L   +  LA++ 
Sbjct: 22  TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
            G  +N+   + ++L   + K+S+++F+   + L   S+H +E       K    SF   
Sbjct: 67  -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
           A+   +++DPQ RC  +L     ++IL  SQ        E+                  +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177

Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
           +ES        S + + + LD  +++V D  F+  +  P + IL++ +LTWAG +     
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237

Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
               ++ +L+I+ +  +  +I     LP D++ L  +P+  G VL+ G+N + Y   +  
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294

Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
              A+ LN+YA  SL   + +  +   + L  D   + W          +++ LL  ++ 
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354

Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
           +L  + + ++GR++ + D+              +P+ +T   +     N   F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414

Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
           SL+V+                   +I +   +    +++SS++    V  E+     LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457

Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK-- 535
                  +AS N ++A     + F   +   L NIGP+     G   +      G+    
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517

Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EART 587
           +  Y LV   G          +  G N    +++   +   A   IS+        + R 
Sbjct: 518 KGEYSLVATCG----------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRD 567

Query: 588 MVLETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILD 635
             L T D       + E+  +   + QGR      T+      G +R++QV      + D
Sbjct: 568 KYLITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYD 627

Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            ++  + L     N E         V+ VS+ DPY+L+ +S G I + 
Sbjct: 628 TNF--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664


>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
 gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
          Length = 1350

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 76/340 (22%), Positives = 153/340 (45%), Gaps = 25/340 (7%)

Query: 1085 MQSSENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1141
            ++  EN+L   + ++    ++ TK     +  G + V  ED+   G   L+        P
Sbjct: 996  VEFEENSLVNDIRSMIIQIDSRTKRKREYIVAGFSAVGTEDLPPSGSFHLYDITAVVPEP 1055

Query: 1142 QNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPP 1195
                T       + +E++G+++++  + G   I+   KI++        +  +AF D P 
Sbjct: 1056 GKPDTNYKFERFFKEEVRGSVTSVCEISGRFAISQSQKIMVRDAQEDGSVVPVAFLDIP- 1114

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            ++V  +    N +++ D      F+ +  +  ++  L K        + EFL++   +  
Sbjct: 1115 IFVTDMKSFGNLMIISDAMHGFQFVGFDAEPYRMIQLGKSVSKFKTMSVEFLVNNGDIYF 1174

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
             V+D    + +  YAP    S+ GQKL+  + F++ A  +      +L   +D       
Sbjct: 1175 AVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYADNS----CMVLLAKNDEFNKV-- 1228

Query: 1316 SDKTNR-FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---Q 1371
             D TNR + ++ G  DGS+  I PL E ++RRL  +Q++++D    + GLNPR  R   Q
Sbjct: 1229 -DDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPRMERLSNQ 1287

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            +       RP    ++D  ++  +  +P+ ++  +A + G
Sbjct: 1288 YLPLCHVMRP----MLDFNVIRKFSAMPISKRQALAQKLG 1323



 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  +A+L +  A      D ++L    AKIS+++FD   + +   S+H
Sbjct: 48  LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
            +E  ++  L        R   +++DP  RC        + V    M   +      G  
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160

Query: 215 GDEDTFGSG-GGFSARIES-----SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
            DEDT+      F A I       S +++ + +  D++++ D  F++ Y +P + IL++ 
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220

Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
           +LTW G +         +I  L  +T    T   H +I     LP D ++L+ V +   G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277

Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
           +++VG N + Y   +      + LN++A      + L +S     S E    + + +++ 
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331

Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
           + +L T  G+               L  + ++ +GR++ + D+ K              T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           + S L S  + +   LFF G + GD+    F C       +   ++E  D+  D PS   
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442

Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
               ++D   +  +G +L     LY   + +T+ A              + F  A+  SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
            NIGPL   + G     D +  G+S  +N EL  +     G     T    S R     +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
            +  +    ++    +  + +   L T D       + E+  +     +GR     T  +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612

Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
             +FG  +R++QV      +LD ++               +   +  V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661

Query: 674 GMSDGSIRL 682
            +S G I++
Sbjct: 662 TVSRGDIKV 670


>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 279

 Score = 97.8 bits (242), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)

Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
           LGSRLG+SLL+ FT    +++++              L++E  ++E +     +L  + +
Sbjct: 1   LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
            A    +  EEL +YGS +  +    + F F V DSL+N+ P+     G R+  +     
Sbjct: 60  QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119

Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
                           +ATG SK           N +++   EL GC  +WTV+      
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              D+++ ++ +D+ H ++++S    T+VL+T   + E+ E+  + V   TI  GNL  +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           R ++QV  R  R+L G+ + Q++               S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279


>gi|303391353|ref|XP_003073906.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon intestinalis ATCC 50506]
 gi|303303055|gb|ADM12546.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon intestinalis ATCC 50506]
          Length = 601

 Score = 96.3 bits (238), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 106/457 (23%), Positives = 206/457 (45%), Gaps = 61/457 (13%)

Query: 974  SQGILKICQLPS---GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK--PL 1028
            S+G L +C++PS   G  + + +  +KIP+  TP  I Y    + Y ++ S   ++  P 
Sbjct: 185  SKGYLMVCRVPSIENGYVFGSGFIGKKIPVLRTPKHIEY---ADRYMVVASCEEVEFSPK 241

Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
            N        ++ G  ++ +    VDL+     E+YE                +T  ++ +
Sbjct: 242  N-------GKDCGVPVNTYRFY-VDLYS----EKYE--------------HISTYELEEN 275

Query: 1089 ENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
            E    ++ + L +      ++  L + T +++GED  A+GR+ +        + ++   +
Sbjct: 276  EYVFDIQYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPAKGRLHVLEIISVVPSLESPFKD 335

Query: 1148 VYSKEL-----KGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSL 1201
               K L     KG+I   + ++G +++  G KI+++K   G+ +  I F+D    +  S+
Sbjct: 336  CKLKVLGIEKTKGSIVQCSEVRGKIVLCLGTKIMIYKIDRGSGIIPIGFHDLHT-FTSSI 394

Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
            ++VKN+IL  DI++ + F  ++ +  +L+L++      +  +TE LI G+ LS+V  D +
Sbjct: 395  SVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNATSTELLITGNELSMVCCDIK 454

Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
              I  + Y+P    S  G KL+ R E       T   RL     SS R G    S     
Sbjct: 455  GTIHAYTYSPNNIISMDGAKLVKRVEMK-----TNLGRL-----SSSRVGFRKNS----- 499

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
              ++  +    +  +  +D  ++ +L  +Q  ++     V GLNPR +   +S+   H  
Sbjct: 500  --IMLYSRSNQLVHVNGVDNSSYLKLLGIQTSIMGCFKAVFGLNPRDY--LNSDIHLHSL 555

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
               S +   +L+ +    L  Q  I+     +R +IL
Sbjct: 556  SLKSPIVLHILNLFSYFDLSIQESISSSVVMSRREIL 592


>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 96.3 bits (238), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+   G + +L++ Y+    + L          
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
                  G    F+GSR GDS+L           L +  +EE  D E            A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
           P  K     S  A ++  + ++L LYG+A    E A  +                     
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378

Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
                 + + +R  D L +IG +     G+  NAD      S +   ELV          
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432

Query: 543 ----------------ELPGCKGIWTV 553
                           EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 95.5 bits (236), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/355 (26%), Positives = 164/355 (46%), Gaps = 35/355 (9%)

Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1119
            +E + VR++E       ++  +  P+   EN  ++   +     T ++     +GTAY  
Sbjct: 743  METHYVRLIEDQ----TFEIISGFPLDPYENGCSIITCSF----TDDSNVYYCVGTAYAL 794

Query: 1120 GEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
             E+   ++GR+L+FS     D    LV E   KE+KGA+  L +  G LL     KI L+
Sbjct: 795  PEESEPSKGRILVFSV---EDGKIQLVAE---KEVKGAVYNLNAFNGKLLAGINQKIALY 848

Query: 1179 KWT----GTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLL 1232
            KWT    GT    + +  +   ++++L +    +FI++GD+ KSI  L +K +   +   
Sbjct: 849  KWTLRDDGTR--ELQYESSHHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEER 906

Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
            A+D+ +    A E L D + L    ++   N+         +   +  +L    E+H+G 
Sbjct: 907  ARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEERGRLEVVGEYHLGE 963

Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
             V +F    ++          P S+ +    ++FGT++G IG IA L +  F  LQ LQ+
Sbjct: 964  FVNRFRHGSLVMR-------LPDSEASQIPTVIFGTVNGVIGVIASLPQDQFLFLQKLQQ 1016

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             LV  +  V GL+   +R F +  K       + +D +L+  +  L   +  EIA
Sbjct: 1017 ALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLSRNKMEEIA 1069



 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    ++G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
              GL +T      S      + GR +   G +  VDP  R  G+ +Y GL  +I   ++
Sbjct: 95  ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP P+GG +++G  T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y++ S   A+ +            + ++   V+ D +          LLS   G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +D   V  L++     +   S ++ + N + F+GS  GDS L++            
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + +ADA      + S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++   SS        
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                 D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D     Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479


>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 461

 Score = 95.5 bits (236), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 58/164 (35%), Positives = 95/164 (57%), Gaps = 7/164 (4%)

Query: 230 IESSHVINLRDL-DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           + +S+V+NL  +  ++ V+D +F+HGY EPV+++LHE + TWAGR+  +  TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYAV 347
           S  LK+H ++W A  LP+D Y+LL +P      LV+  + +   SQ S   A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQR-PAALVLSPSLVMLTSQASQPQAAALNSTAL 246

Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
             ++   L     R + SV      A +  ND A    ++  LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290


>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 613

 Score = 94.7 bits (234), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+  +G + +L++ Y+    + L          
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
                  G    F+GSR GDS+L         +      K+E   I      +  AP  K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
                S  A ++  + ++L LYG+A    E A  +                         
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517

Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
             + + +R  D L +IG +     G+  NAD      S +   ELV              
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571

Query: 543 ------------ELPGCKGIWTV 553
                       EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 94.7 bits (234), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 145/318 (45%), Gaps = 33/318 (10%)

Query: 1107 NETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            N +   +GTAY+  E+    +GR+L+F+ TG + +    LVTE   KE+KGA+  L S  
Sbjct: 802  NASYFVVGTAYIHEEEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNSFN 858

Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFL 1220
            G +L     K  L+KW+    N          Y    V+ +    +FI++GD+ KSI  L
Sbjct: 859  GKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSISLL 918

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            S+K+    +  +AKD  S +  +   ++D  T   + S+   N+         +   +  
Sbjct: 919  SYKQLDGTIEEIAKDLNS-NWMSAVGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERG 975

Query: 1281 KLLSRAEFHVGAHVTKF----LRLQMLATSS---------------DRTGAAPGSDKTNR 1321
            +L +  EFH+G  V +F    L +Q  +++S               D   +AP +     
Sbjct: 976  RLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTPSGVVSTGPTAMVDVGESAPAAPVVQN 1035

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
             ++LFGT+ G IG I P+ +  +  L  +Q+ L   V  V G + + +R F +       
Sbjct: 1036 QSMLFGTVSGMIGVILPISKDQYSFLLRVQQALTHVVKGVGGFSHKDWRTFENRRSVSE- 1094

Query: 1382 GPDSIVDCELLSHYEMLP 1399
               + +D +L+  +  LP
Sbjct: 1095 -ARNFIDGDLVESFLDLP 1111



 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 111/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L    DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E +   A  V   H       L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++                    + AD   T     
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIK--------------------LNADRDETG---- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D++     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471


>gi|401828022|ref|XP_003888303.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon hellem ATCC 50504]
 gi|392999575|gb|AFM99322.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon hellem ATCC 50504]
          Length = 1155

 Score = 94.4 bits (233), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 85/346 (24%), Positives = 164/346 (47%), Gaps = 31/346 (8%)

Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            T  ++ +E    V+ + L +      ++  L I T +++GED  A+GR+ +        +
Sbjct: 823  TYELEENEYVFDVKYLILDDMQGNYGKSPFLLICTTFIEGEDKPAKGRLHVLEIISVVPS 882

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYD 1192
            P++   +   K L     KG+I   + ++G + +  G KI+++K   +  NGI    FYD
Sbjct: 883  PESPFKDCKLKVLGIEKTKGSIVQCSEIRGKIALCLGTKIMIYKIDRS--NGIIPIGFYD 940

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
               ++  S+++VKN+IL  DI++ + F  ++ +  +L+L++      +  +TE LI G+ 
Sbjct: 941  LH-IFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLIAGNE 999

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            LS+V  D +  I  + Y+P    S  G KL+ RAE               + T+  R  +
Sbjct: 1000 LSMVCCDSKGTIHAYTYSPNNIISMDGAKLVKRAE---------------MKTNLGRLFS 1044

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
            +    + N   + +   + SI  +A +D+L + +L  +Q  ++  +  V GLN R +   
Sbjct: 1045 SGIGFRKNSI-MFYSKTNLSIH-LAGIDDLNYPKLLEIQTSIMVHLKSVLGLNQRDY--L 1100

Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            +S+   H     S +   +L+ +    L  Q  I+     +R +IL
Sbjct: 1101 NSDIHLHSLSLKSPIVMHILNLFSYFDLNTQKLISSSVRMSRREIL 1146


>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 279

 Score = 94.0 bits (232), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 123/262 (46%), Gaps = 28/262 (10%)

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            L+  AF+D P L    L  VK+++L  D+H+ ++FL + +    L  ++KDF   D    
Sbjct: 29   LDKRAFFDLPSL-ATGLVTVKDYLLASDVHQGLFFLRYSDASRVLEFMSKDFDGRDVLTC 87

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMS---ESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
              +I    L  + +D    +Q+  +  K     E W GQ+L      HV   V     +Q
Sbjct: 88   GVVIAEPKLHFLAADAAGTLQMMEFYGKRDTNPEFWAGQRLAPMGLLHVARRVGVAASVQ 147

Query: 1302 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPH 1360
            + +            D  NR ALL G+ +G +  +AP+ D     RL +LQ  +  ++PH
Sbjct: 148  LAS-----------RDGRNRHALLCGSAEGGLSFVAPVPDPQAAARLAALQAHMSATLPH 196

Query: 1361 VAGLNPRSFRQFH-------SNGKAHR-PGP----DSIVDCELLSHYEMLPLEEQLEIAH 1408
            VAGLNPRSFR            G+ HR P P      ++D +LL  +  L  ++Q E A 
Sbjct: 197  VAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPRNNSGLLDGQLLLGFPHLSRQQQAEAAE 256

Query: 1409 QTGTTRSQILSNLNDLALGTSF 1430
              G++  Q+L +L  +A   +F
Sbjct: 257  AVGSSPQQLLEDLRAIAAAATF 278


>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
           [Ectocarpus siliculosus]
          Length = 1906

 Score = 94.0 bits (232), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)

Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
           G+ED  G G G +A+ +     NL        DL+   +  FI    F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           +E   T AGR++ K  TC ++ LSI+ T  + P+IW   NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383

Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
           +  N + + +QS     LA+N YA  ++D +  E P                        
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443

Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVYD--------------- 394
            SS+ V  ELD     +L     LLS +TG+     L L TV                  
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503

Query: 395 ----GRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
                RVV +    + + +P SVL             +     L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563

Query: 439 CGS 441
             S
Sbjct: 564 VAS 566



 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 50/225 (22%), Positives = 91/225 (40%), Gaps = 60/225 (26%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +  Y+ +H PTG+ +   G +T + +                            +LVV  
Sbjct: 7   YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39

Query: 63  ANVIEIYVVRVQEEGS----------KESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           A+ +E+Y V   +  S          +++ N  E       D  S   LEL   + L GN
Sbjct: 40  ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDD----DDASGYYLELAGTFPLAGN 95

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           + +LA++           D ++++F  AK++++ +D  +  L   S+H F++        
Sbjct: 96  ITALAVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAG 145

Query: 173 GRES-FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
           G ES +     +K         DP GRC   +V G Q+++L A +
Sbjct: 146 GVESGYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 92.8 bits (229), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 121/550 (22%), Positives = 229/550 (41%), Gaps = 94/550 (17%)

Query: 840  FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
            +  A L DG +L Y+  L +  ++T      +   R +S+    AS           L  
Sbjct: 612  YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658

Query: 900  YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
            +  +   H  A C R T+  + SG       G   C  +  RE  RV    C  S  AF 
Sbjct: 659  FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708

Query: 959  VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP--VQKIPLKATPHQITYFAEKNLY 1016
                 +C    + + ++G L +       T DN     ++ IPL   P +I +    +++
Sbjct: 709  -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753

Query: 1017 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
             ++ +   V + +  N+ LS   ++     ID+   + +++  +Y +E++E         
Sbjct: 754  AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804

Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1132
                   + I +   ++A             K+N+    +GTAY    E   +RGR+L+F
Sbjct: 805  -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846

Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
            +        +  +T V  +  KGA+ ++ +  G +L +    + L +W+ TE       +
Sbjct: 847  AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900

Query: 1193 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
                   ++++ +  + +FIL+GD+ +S+  L++K     +  +A+D         E L 
Sbjct: 901  ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959

Query: 1249 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
                L   +S E   N+         S   +  +L    E+H+G  V +    +++    
Sbjct: 960  ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVL--- 1013

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                  P S  +   +LL+GT +G++G IA +DE TF+ L SLQ  L + +  V G+   
Sbjct: 1014 ----QIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHE 1069

Query: 1368 SFRQFHSNGK 1377
             +R+F S  +
Sbjct: 1070 DWRRFTSERR 1079



 Score = 41.2 bits (95), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 92.8 bits (229), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/374 (24%), Positives = 167/374 (44%), Gaps = 50/374 (13%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + E G +++ HNL  +D +    +  ++                    MQ+ E AL++  
Sbjct: 778  NTEFGQEVEVHNLLIIDQNTFEVLHAHQF-------------------MQT-EYALSLMS 817

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
              L N    +  T   +GT  V  E+   + GR++++   R ADN   +V++   KE+KG
Sbjct: 818  AKLGN----DPNTYFIVGTGLVNPEEPEPKTGRIIIY---RYADNELKMVSD---KEVKG 867

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
            A  +L    G +L      + L++WT  +   L    F +   LY  +     +FIL+GD
Sbjct: 868  ACYSLVEFNGRVLACINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGD 924

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            + +SI  L +K+       +A+D+      A E L D + L    +D   N+ +      
Sbjct: 925  LMRSITLLQYKQMEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDSA 981

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDG 1331
             +   + Q++   A+FH+G  V  F    ++  + S+R+    G        +LFGT+ G
Sbjct: 982  ATTDEERQQMPEVAQFHLGDMVNVFRHGSLVMQNISERSTPTTG-------CVLFGTVSG 1034

Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
            +IG +  +    +  L+ LQ+ L +++  V  ++   +R FH+  K  R   +  +D +L
Sbjct: 1035 AIGLVTQIQSDFYEFLRKLQENLTNTIKSVGKIDHSYWRSFHTETKMER--CEGFIDGDL 1092

Query: 1392 LSHYEMLPLEEQLE 1405
            +  +  L  E+  E
Sbjct: 1093 VESFLDLSREKMRE 1106



 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 117/519 (22%), Positives = 195/519 (37%), Gaps = 126/519 (24%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D+DT                 +L
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-----------NELKATSL 157

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           R  +M HV+D  F++G   P ++++H+        ++ +H    I    IS   K+   I
Sbjct: 158 RMEEM-HVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205

Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
            W   N+  +A  L+AVP P+GG +V+G  +I YH   +  A+A        +N YA   
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
                         +D+    +L  ++A      G+L ++ +  +        V+ + + 
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
                 +   IT + N + F+GSR GDS LV+    +G +     L E F ++   AP  
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
                     L+    G+ ++  GS                       G L+    G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396

Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
              A             ++LPG KG+W +             R+   D  Y   LI+S  
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431

Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
             T VL  +    E TE        +T    N+    +++QV    AR++  D   M  +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
               P N   G    N+T +  + A     + + DG + 
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 149/329 (45%), Gaps = 28/329 (8%)

Query: 1104 TKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            T ++     +GTAY   E+    +GR+L+F      D    LV E   KE+KGA+  L +
Sbjct: 779  TDDSNVYYCVGTAYALPEESEPTKGRILVFLV---EDGKLQLVAE---KEMKGAVYNLNA 832

Query: 1163 LQGHLLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
              G LL     KI L+KWT   GT +  I       +  + +    +FI++GD+ KSI  
Sbjct: 833  FNGKLLAGINQKIALYKWTLRDGTRVLEIESSHHGHILALYVQSRGDFIVVGDLMKSISL 892

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            L +K +   +   A+D+ +    A E L D + L    ++   N+         +   + 
Sbjct: 893  LIYKPEEGAIEERARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEER 949

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
             +L    E+H+G  V +F    ++          P S+ +    ++FGT++G IG IA L
Sbjct: 950  GRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSEASLIPTVIFGTVNGVIGVIASL 1002

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL------LS 1393
             +  F  LQ LQ+ LV  +  V GL+   +R F +  K       + +D +L      LS
Sbjct: 1003 PQDKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLS 1060

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
              +M  +   LEI+ +    R + L+ L+
Sbjct: 1061 RNKMEEIAAPLEISVEELCKRVEELTRLH 1089



 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    L+G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              G  IT      S      + GR +   G +  VDP  R  G+ +Y     ++     
Sbjct: 95  ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
           G               F+ R+E   V++++           F++G   P + +L++    
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H              + P  W   NL + A  L+ VP P+GG +++G  TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y++ S   A+ +            + ++   V+ D +          LLS   G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280

Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            + +D   V  L++     +   S ++ + N + F+GS  GDS L++             
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
                 + +AD      ++ S  + L+  VN G  + L           Q  T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
                G L+    G+ IN  AS            VEL G KG+W++   SS         
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410

Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463

Query: 626 VFERGARILDGSYMTQ 641
           V     R+++     Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 90.9 bits (224), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 147/325 (45%), Gaps = 30/325 (9%)

Query: 1113 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GT Y +  E    RGR+L+F   R  D    LV E   KE+KGA+  L +  G LL   
Sbjct: 802  VGTGYSLPEEPEPTRGRILVF---RAEDGKLQLVAE---KEVKGAVYNLNAFNGKLLAGI 855

Query: 1172 GPKIILHKW---TGTELNGIAFYDAPPL-----YVVSLNIV--KNFILLGDIHKSIYFLS 1221
              K+ L +     G +  G + Y+         ++V+L +     FI++GD+ KS+  L+
Sbjct: 856  NSKVELFRGGDPVGADGAGGSTYELAKECSHHGHIVALYVAVRGEFIVVGDLMKSVSLLA 915

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            +K + + +   A+D+ +    A + L D + L    ++   N+         +   +  +
Sbjct: 916  YKPEESVIEERARDYNANWMTAVDILDDDTYLG---AENNFNLFTLRRQSDAATDEERSR 972

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            L    E+HVG  V +F R  ++    D+  A       +   LLFGT+ G IG +A L  
Sbjct: 973  LEVVGEYHVGEFVNRFRRGSLVMRLPDQENA-------DVPTLLFGTVSGVIGVLATLPR 1025

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIVDCELLSHYEMLP 1399
              F  L +LQ  L  +V  V GL+  ++R F  N   HR   G    VD +L+  +  L 
Sbjct: 1026 EQFEFLSALQAALNKTVSGVGGLSHDAWRSFQ-NEHRHRAKDGARGFVDGDLIESFLDLR 1084

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
             E+  E+A     +  ++   + DL
Sbjct: 1085 PEKAREVAAAVKLSVDELTRRVEDL 1109



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 33/212 (15%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  R+E  +V++++           F+HG   P + +L+E             H      
Sbjct: 153 FDVRLEELNVVDVK-----------FMHGCATPTICVLYED-------TKEARHVKTYEV 194

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
                TL+  P  WS  ++   +  ++ VP+P+GG +VVG + I Y ++           
Sbjct: 195 DVKEKTLRDGP--WSQSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------- 244

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                +      ++ SV + A           LLS  TG L LL +V+D R V  L L  
Sbjct: 245 -----NGAGGAIATKSVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLES 299

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
              + + S ++ + N + ++GS  GDS LV+ 
Sbjct: 300 LGQTSIASTLSYLDNGVVYVGSAYGDSQLVRL 331


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 90.9 bits (224), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 142/312 (45%), Gaps = 22/312 (7%)

Query: 1111 LAIGTAYVQGEDVAA--RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
              +GTAY+  ED A   +GR+L+F+ TG + +    LVTE   KE+KGA+  L +  G +
Sbjct: 806  FVVGTAYIH-EDEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNAFNGKV 861

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFLSWK 1223
            L     K  L+KW+    N          Y    V+ +    +FI++GD+ KS+  LS+K
Sbjct: 862  LAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSVSLLSYK 921

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            +    +  +AKD  S +  +   ++D  T   + S+   N+         +   +  +L 
Sbjct: 922  QLDGTIEEIAKDLNS-NWMSALGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERGRLE 978

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSS------DRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
            +  EFH+G  V +F    +   ++      D    AP        ++LFGT+ G IG I 
Sbjct: 979  TVGEFHLGEFVNRFRYGSLTPAAAGPTDMVDVVEQAPIVPAAQNQSMLFGTVSGMIGVIL 1038

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
            PL +  +  L  +Q+ L   V  V G + + +R F +          + +D +L+  +  
Sbjct: 1039 PLTKDQYSFLLRVQQALTQVVKGVGGFSHKDWRMFENRRSVSE--ARNFIDGDLVESFLD 1096

Query: 1398 LPLEEQLEIAHQ 1409
            LP  +  ++  +
Sbjct: 1097 LPKAQMTKVVDK 1108



 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L   +DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E             H      L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P+GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++              ++E G              
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D+Y     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471


>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
 gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
          Length = 1089

 Score = 90.1 bits (222), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1143
            + + EN  T+   +     T +  T   +GTAY +  E+  ++GR+L+F+     D    
Sbjct: 764  LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GTELNGIAFYDAP--PLY 1197
            LVTE   KE KGA+  L +  G LL     KI L+KWT      EL     +      LY
Sbjct: 817  LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1253
            V S     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D    G+  
Sbjct: 874  VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
            S  +   +KN      +   ++  +G +L    E+H+G  V +F    ++          
Sbjct: 931  SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
            P ++ +    ++FGT++G IG +A L +  F  LQ LQ  L   +  V GL+   +R F 
Sbjct: 977  PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036

Query: 1374 SNGK 1377
            S  K
Sbjct: 1037 SERK 1040



 Score = 67.8 bits (164), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 122/557 (21%), Positives = 217/557 (38%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
 gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
          Length = 1089

 Score = 90.1 bits (222), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1143
            + + EN  T+   +     T +  T   +GTAY +  E+  ++GR+L+F+     D    
Sbjct: 764  LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT----ELNGIAFYDAP--PLY 1197
            LVTE   KE KGA+  L +  G LL     KI L+KWT      EL     +      LY
Sbjct: 817  LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1253
            V S     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D    G+  
Sbjct: 874  VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
            S  +   +KN      +   ++  +G +L    E+H+G  V +F    ++          
Sbjct: 931  SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
            P ++ +    ++FGT++G IG +A L +  F  LQ LQ  L   +  V GL+   +R F 
Sbjct: 977  PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036

Query: 1374 SNGK 1377
            S  K
Sbjct: 1037 SERK 1040



 Score = 70.1 bits (170), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 123/557 (22%), Positives = 218/557 (39%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
          Length = 1089

 Score = 90.1 bits (222), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+  ++GR+L+F+     D    L+ E   KE KGA+ +L +
Sbjct: 778  SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 831

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L + +     +FI++GD+ KSI
Sbjct: 832  FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 890

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1274
              L +K + + +  LA+D+ +    A E L D      +    + N  IF     +   +
Sbjct: 891  SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 945

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1333
            +  +G +L    E+H+G  V +     ++    D   G  P         ++FGT++G I
Sbjct: 946  DEERG-RLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 996

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            G IA L    +  L+ LQ  LV  +  V  L+   +R FH++ K       + +D +L+ 
Sbjct: 997  GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1054

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  L   +  E+A   G    ++   + +L
Sbjct: 1055 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1085



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|396082420|gb|AFN84029.1| pre-mRNA cleavage and polyadenylation [Encephalitozoon romaleae
            SJ-2008]
          Length = 1156

 Score = 90.1 bits (222), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 110/480 (22%), Positives = 209/480 (43%), Gaps = 62/480 (12%)

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPS----GSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
            L++   N   +   S+G L +C++PS       + +    +KIP+   P  I Y    + 
Sbjct: 725  LNSATVNKNQLIQLSRGHLMVCKVPSVRDEQYVFGDGLVGRKIPILRIPKHIEY---ADR 781

Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
            Y ++ S               D E   + +      V+ +R Y V+ Y  R         
Sbjct: 782  YMVVASCK-------------DVEFSSKDEKDCGIPVNTYRFY-VDLYSER--------- 818

Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRV---LL 1131
             ++  +T  +  +E    V+ + L +      ++  L + T +++GED  ARGR+    +
Sbjct: 819  -YEHISTYELDENEYIFDVKYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPARGRLHVLEI 877

Query: 1132 FSTGRNADNP-QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGI 1188
             S   + ++P ++   +V   E  KG+I   + ++G + +  G KI+++K    T +  I
Sbjct: 878  ISVVPSLESPFRDCKLKVLGIEKTKGSIVQCSEVRGKIALCLGTKIMIYKIDRSTGIIPI 937

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
             FYD   ++  S++++KN+IL  DI++ + F  ++ +  +L+L++      +  +TE L 
Sbjct: 938  GFYDLH-IFTSSISVMKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLT 996

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             G+ LS+V  D +  I  + Y+P    S  G KL+ R+E            L  L++S  
Sbjct: 997  AGNELSMVCCDAKGTIHAYTYSPNNIISMDGAKLVKRSEMKTN--------LGRLSSSGI 1048

Query: 1309 --RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
              R  +     KTN    L G           +D+  + +L  +Q  ++  +  V GLN 
Sbjct: 1049 GFRKNSIMFYSKTNLLIYLVG-----------MDDSYYLKLLKIQTSIMVHLKSVLGLNQ 1097

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
            R +   +S+   H     S +   +L+ +    L  Q  I+     +R +IL  L  L +
Sbjct: 1098 RDY--LNSDIHLHSLSLKSPIIMHILNLFSYFDLNTQKLISTSVKMSRREILDVLASLNI 1155


>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
 gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
          Length = 1090

 Score = 90.1 bits (222), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+  ++GR+L+F+     D    L+ E   KE KGA+ +L +
Sbjct: 779  SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L + +     +FI++GD+ KSI
Sbjct: 833  FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1274
              L +K + + +  LA+D+ +    A E L D      +    + N  IF     +   +
Sbjct: 892  SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 946

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1333
            +  +G +L    E+H+G  V +     ++    D   G  P         ++FGT++G I
Sbjct: 947  DEERG-RLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 997

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            G IA L    +  L+ LQ  LV  +  V  L+   +R FH++ K       + +D +L+ 
Sbjct: 998  GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1055

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  L   +  E+A   G    ++   + +L
Sbjct: 1056 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1086



 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G ++P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCVKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP+P+GGV+++G  TI Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFR------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHLLVLTHERERVTGLKIEY 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++AD     
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADPNG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G L+    G+ 
Sbjct: 338 ----SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS            ++D Y  YL++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDMYLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 89.7 bits (221), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKEL 1153
            ++T   T      T   +GTA+V  E+    +GR+L+F+ +G + D    LVTE   KE+
Sbjct: 799  IITCIFTGDSSGGTYYVVGTAFVHEEEAEPHQGRILVFTVSGIHGDRRLQLVTE---KEV 855

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFIL 1209
            KG++  L +  G LL     K+ L KW+ +E NG         +    V+ +    +FI+
Sbjct: 856  KGSVYCLNAFNGKLLAGVNSKVYLFKWSESEENGEELVSECGHHGHTLVLYMESRGDFIV 915

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQ 1265
            +GD+ KSI  L+ K+    +  +A+D  S    A   + D    GS     +   Q+N  
Sbjct: 916  VGDLMKSISLLNHKQLDGSIEEIARDLNSNWMTAVGIIDDDNYVGSETDFNLFTVQRN-- 973

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA-APG--------- 1315
                +   S+  +G +L +  E+H+G  V +F    ++   +   GA APG         
Sbjct: 974  ----SGAASDEERG-RLETIGEYHLGEFVNRFRYGSLVMQHNLSIGAEAPGISLSDDRPE 1028

Query: 1316 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
              S  + + ++LFGT+ G IG I P+ +     L  +Q  L   +  V G +   +R F 
Sbjct: 1029 SLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVIQGVGGFSHSEWRTFE 1088

Query: 1374 ---SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               S+ +AH     + +D +L+  +  L  +E  ++  +    + +  + L  LA
Sbjct: 1089 NRRSSIEAH-----NFIDGDLIESFLDLSKDEMKQVVDELNRDQLEGKTTLEALA 1138



 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 114/511 (22%), Positives = 193/511 (37%), Gaps = 115/511 (22%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D I L  +  +  VL +D ++  +   +           + R  E    G    +DP G
Sbjct: 74  QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
           R  G+ +Y     ++        L            F+ R++      LR LD+K     
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167

Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
            F++GY +P + +L+E             H      L       + P  WS  N+   A 
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217

Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            L+ VP P GGVL++   TI YH+ +   A+ + N  + +  +     S F         
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268

Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
                   LL+ + G L ++ +  +G+ V  + L     + + S ++ + N + F+GS  
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           GDS LV+           +  ++E G              S  + L   VN   +  +  
Sbjct: 321 GDSQLVKL----------NSKRDESG--------------SYIEVLDSYVNIGPIIDFCV 356

Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
              + +   +  T S A +D     G L+    G+ IN  ASA            ELPG 
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
           KG+W +    +               EY  YL+ S   E R M +  +D + EV   ++ 
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           F+  +T+   N+      +QV E   RI+D 
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)

Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F      D    ++TE   KE KGA+ +L S
Sbjct: 722  SDDNNLYYCVGTAYVMPEENEPTKGRILVFLV---EDGKLQVITE---KETKGAVYSLNS 775

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L  + +    +FI++GD+ KSI
Sbjct: 776  FNGKLLAAINQKIQLYKWMLRDDGSRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 834

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E L D   L       + N  +F    K SE  
Sbjct: 835  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGA 888

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FGT++G IG
Sbjct: 889  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTVNGVIG 941

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K       + +D +L+  
Sbjct: 942  VIASLPHEQYIFLEKLQSNLRRVIKGVGGLSHEQWRSFNNEKKTVE--AKNFLDGDLIES 999

Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  L      EI+   G +  ++   + +L
Sbjct: 1000 FLDLSRNRMDEISKAIGVSVEELCKRVEEL 1029



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)

Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           E+S +I    L+   V D  F++G  +P +V+L++           +H      AL    
Sbjct: 95  ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
              + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +        
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
               + R+   V+ D +          LL    G L LL + ++   V  L +     + 
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           + S I+ + N++ ++GS  GDS LV+                   +++ DA      + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282

Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
             + L+  VN   +  +       +   +  T S A +D     G L+    G+ IN  A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337

Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
           S            VEL G KG+W++               ++ DD +  +L++S  + T 
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371

Query: 589 VL 590
           +L
Sbjct: 372 IL 373


>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
          Length = 1090

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 146/328 (44%), Gaps = 27/328 (8%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+  ++GR+L+F+     D    L+ E   KE KGA+ +L +
Sbjct: 779  SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L + +     +FI++GD+ KSI
Sbjct: 833  FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K + + +  LA+D+ +    A E L D   +    ++   NI         +   
Sbjct: 892  SLLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIG---AENNYNIFTVRKNSDAATDE 948

Query: 1278 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
            +  +L    E+H+G    +F    ++    D   G  P         ++FGT++G IG I
Sbjct: 949  ERGRLEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIP--------TVIFGTINGVIGII 1000

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
            A L    +  L+ LQ  LV  +  V  L+   +R FH++ K       + +D +L+  + 
Sbjct: 1001 ASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIESFL 1058

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             L   +  E+A   G    ++   + +L
Sbjct: 1059 DLSRNKMEEVAKGMGVPVEELSKRVEEL 1086



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/311 (23%), Positives = 146/311 (46%), Gaps = 25/311 (8%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
            ++ SE A+++    L N    +++    +GTA +  ++  ++ GR+++F   + ++ P+ 
Sbjct: 805  LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 857

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1203
            +   VY KE+KGA  ++ S+ G L++A    + L +WT  +   +   D   +  + L  
Sbjct: 858  MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 916

Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
              + IL+GD+ +S+  LS+K   +    +A+DF +    A E +   S L       + +
Sbjct: 917  KNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----AENS 971

Query: 1264 IQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
              +F         +K  G +L     F++G  V  F    + AT  D    AP       
Sbjct: 972  YNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYH 1024

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
             ++L+GT DG IG I  +  + +  L  +QK+L D   +   ++   +R F +  ++   
Sbjct: 1025 SSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRSEV- 1083

Query: 1382 GPDSIVDCELL 1392
             P+  +D +L+
Sbjct: 1084 -PNGFIDGDLI 1093



 Score = 47.0 bits (110), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
          Length = 1089

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/396 (23%), Positives = 170/396 (42%), Gaps = 36/396 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ  +  L+        +VEE E   +R+L+       +++    P+   E   ++
Sbjct: 717  RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVYPLDQYECGCSI 772

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
               +       ++     +GTAYV   E+   +GR+L+F+     D    L+ E   KE 
Sbjct: 773  ISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW        EL     +    L + +     +FI
Sbjct: 823  KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K + + +   A+D+ +    A E L D      V ++   N+    
Sbjct: 882  VVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVGAENSYNLFTVR 938

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                 +   +  +L    E+H+G  V +F    ++          P SD      ++FGT
Sbjct: 939  KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDIGQIPTVIFGT 991

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            ++G IG IA L    +  L+ LQ  LV  +  V  L+   +R FH++ K       + +D
Sbjct: 992  INGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTAE--ARNFLD 1049

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +L+  +  L   +  E++   G    ++   + +L
Sbjct: 1050 GDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 1085



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F+HG  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  N+ + A  L+ VP+P+GGV+++G   I Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G L+    G+ 
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS             +D +  YL++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
 gi|219884971|gb|ACL52860.1| unknown [Zea mays]
          Length = 416

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/407 (23%), Positives = 174/407 (42%), Gaps = 41/407 (10%)

Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1083
            PLN+       + + HQ  +  L+        +VEE E   +R+L+       +++    
Sbjct: 38   PLNEQA-----RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 88

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
            P+   E   ++   +       ++     +GTAYV   E+   +GR+L+F+     D   
Sbjct: 89   PLDQYECGCSIISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 141

Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1197
             L+ E   KE KGA+ +L +  G LL A   KI L+KW        EL     +    L 
Sbjct: 142  QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 198

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            + +     +FI++GD+ KSI  L +K + + +   A+D+ +    A E L D      V 
Sbjct: 199  LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 254

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
            ++   N+         +   +  +L    E+H+G  V +F    ++          P SD
Sbjct: 255  AENSYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSD 307

Query: 1318 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
                  ++FGT++G IG IA L    +  L+ LQ  LV  +  V  L+   +R FH++ K
Sbjct: 308  IGQIPTVIFGTINGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKK 367

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   + +D +L+  +  L   +  E++   G    ++   + +L
Sbjct: 368  T--AEARNFLDGDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 412


>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Papio anubis]
          Length = 132

 Score = 88.6 bits (218), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 18  LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +LVYG ++++L       ++   GLVG
Sbjct: 74  YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130

Query: 216 D 216
           +
Sbjct: 131 E 131


>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
          Length = 674

 Score = 88.2 bits (217), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 158/363 (43%), Gaps = 45/363 (12%)

Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1083
            PLN+       + + HQ  +  L+        +VEE E   +R+L+       +++    
Sbjct: 296  PLNEQA-----RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 346

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
            P+   E   ++   +  +    +N     +GTAYV   E+   +GR+L+F+     D   
Sbjct: 347  PLDQYECGCSIISCSFVD----DNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 399

Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1197
             L+ E   KE KGA+ +L +  G LL A   KI L+KW        EL     +    L 
Sbjct: 400  QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 456

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            + +     +FI++GD+ KSI  L +K + + +   A+D+ +    A E L D      V 
Sbjct: 457  LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 512

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGS 1316
            ++   N+         +   +  KL    E+H+G  V +F    ++    D   G  P  
Sbjct: 513  AENGYNLFTVRKNSDAATDDERAKLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGKIP-- 570

Query: 1317 DKTNRFALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                   ++FGT++G IG IA  P D  TF  L+  Q  LV  +  V  ++   +R FH+
Sbjct: 571  ------TVIFGTINGVIGIIASLPHDHYTF--LEKFQSTLVKYIKGVGNMSHEQWRSFHN 622

Query: 1375 NGK 1377
            + K
Sbjct: 623  DKK 625


>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
          Length = 655

 Score = 87.8 bits (216), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 149/314 (47%), Gaps = 31/314 (9%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
            ++ SE A+++    L N    +++    +GTA +  ++  ++ GR+++F   + ++ P+ 
Sbjct: 306  LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 358

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1200
            +   VY KE+KGA  ++ S+ G L++A    + L +WT  +   L    F +   LY+ +
Sbjct: 359  MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 417

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
             N   + IL+GD+ +S+  LS+K   +    +A+DF +    A E +   S L       
Sbjct: 418  KN---DLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----A 469

Query: 1261 QKNIQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
            + +  +F         +K  G +L     F++G  V  F    + AT  D    AP    
Sbjct: 470  ENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP---- 522

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
                ++L+GT DG IG I  +  + +  L  +QK+L D   +   ++   +R F +  ++
Sbjct: 523  LYHSSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRS 582

Query: 1379 HRPGPDSIVDCELL 1392
                P+  +D +L+
Sbjct: 583  EV--PNGFIDGDLI 594


>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.8 bits (216), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1323
            +AP+  ES  GQ+LL  ++FH+G  V+   R ++ A+ S     +   AAP S+  N   
Sbjct: 3    FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
             + GT +G +G + P+ E  FRRL +LQ  +V+++P    LNPR FR   +N +     P
Sbjct: 60   -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118

Query: 1384 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            D+          +D  +L  +  L    Q E+A   GTT   ++ NL ++   TS
Sbjct: 119  DAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 173


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 87.8 bits (216), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
            +T P+   E   ++    L  + + ++     IGTAYV  E+    +GR+L+F      D
Sbjct: 764  STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 816

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW   E  G            
Sbjct: 817  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 873

Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
                LYV +     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 874  HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 930

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             L       + N  +F    K SE    +   +L    E+H+G  V +F    ++     
Sbjct: 931  YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 981

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                 P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   
Sbjct: 982  ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1037

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R F++  K       + +D +L+  +  L      EI+        +++  + +L
Sbjct: 1038 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1091



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 113/502 (22%), Positives = 193/502 (38%), Gaps = 123/502 (24%)

Query: 95  GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
           G+    L+ +    ++G + +L +    G      +D + +A E  K  VL++D     +
Sbjct: 49  GLQCICLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEV 104

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGL 213
              +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++G    
Sbjct: 105 ITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK- 156

Query: 214 VGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                       F+ R+E   V++++           F++G  +P +V+L++        
Sbjct: 157 ----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ------DN 189

Query: 274 VSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G  TI 
Sbjct: 190 KDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIV 243

Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
           Y S SA  A+ +            + R+   V+ D +          LL    G L LL 
Sbjct: 244 YCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGLLHLLV 286

Query: 391 VVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+              
Sbjct: 287 ITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL----------- 335

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDS 508
                      P TK    S  + L+  VN   +  +       +   +  T S A +D 
Sbjct: 336 ----------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD- 381

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
               G L+    G+ IN  AS            VEL G KG+W++               
Sbjct: 382 ----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--------------R 411

Query: 569 AAYDDEYHAYLIISLEARTMVL 590
           +A DD Y  +L++S  + T VL
Sbjct: 412 SATDDPYDTFLVVSFISETRVL 433


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
            pigmentation protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 87.8 bits (216), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
            +T P+   E   ++    L  + + ++     IGTAYV  E+    +GR+L+F      D
Sbjct: 759  STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 811

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW   E  G            
Sbjct: 812  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 868

Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
                LYV +     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 869  HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 925

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             L       + N  +F    K SE    +   +L    E+H+G  V +F    ++     
Sbjct: 926  YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 976

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                 P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   
Sbjct: 977  ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1032

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R F++  K       + +D +L+  +  L      EI+        +++  + +L
Sbjct: 1033 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1086



 Score = 67.0 bits (162), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               +   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + R+   V+ D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                           P TK    S  + L+  VN   +  +       +   +  T S 
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +A DD Y  +L++S  + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428


>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.4 bits (215), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1323
            +AP+  ES  GQ+LL  ++FH+G  V+   R ++ A+ S     +   AAP S+  N   
Sbjct: 3    FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
             + GT +G +G + P+ E  FRRL +LQ  +V+++P    LNPR FR   +N +     P
Sbjct: 60   -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118

Query: 1384 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            D+          +D  +L  +  L    Q E+A   GTT   ++ NL ++   TS
Sbjct: 119  DAWSKKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATS 173


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 87.4 bits (215), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 104/403 (25%), Positives = 176/403 (43%), Gaps = 48/403 (11%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ  +   S   +      EE E   +R+L+       ++  +T P+ + E   ++
Sbjct: 716  RRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQ----TFEFISTYPLDTFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + + ++     +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 772  ----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L       + N  +F 
Sbjct: 881  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT 935

Query: 1269 YAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
               K SE    +   +L    E+H+G  V +F    ++          P SD      ++
Sbjct: 936  -VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVI 987

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            FGT++G IG IA L    +  L+ LQ  L   +  V GL+   +R F++  K       +
Sbjct: 988  FGTVNGVIGVIASLPHEQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKT--VDAKN 1045

Query: 1386 IVDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
             +D +L      LS   M  + + +EI+ +    R + L+ L+
Sbjct: 1046 FLDGDLIESFLDLSRSRMDEISKAMEISVEELCKRVEELTRLH 1088



 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++ ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+HG  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +    LK    I   WS  NL + A  L+ VP P  GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +   A+ +            + ++   V+ D +          LL    G 
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA  T        + L   VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +  DD +  +L++S  + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428


>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
          Length = 1275

 Score = 87.0 bits (214), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 97/391 (24%), Positives = 175/391 (44%), Gaps = 52/391 (13%)

Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-- 1118
            EE+ VR+ +       ++T A  P++ +EN  ++   +       +++    +GTA+   
Sbjct: 892  EEFFVRLFD----NKTFETLAKYPLEPNENDASIISCSF----DGDDDIYFVVGTAFADP 943

Query: 1119 QGEDVAARGRVLLF-------STGRNA-----DNP-----------QNLVTEVYSKELKG 1155
              E  ++RGR+L+F       S G NA     D+            Q  +T V  KE +G
Sbjct: 944  HSEPESSRGRILVFKVSNTSSSGGGNAVVNGNDHGDGRASASSSVLQKSLTLVCEKETRG 1003

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAF-YDAPPL-YVVSLNI--VKNFILLG 1211
            A+  L +  G LL      + L  W  ++ N     ++   + ++++L +    N I++G
Sbjct: 1004 AVYNLNAFCGKLLAGINSLVKLFNWGVSKENKRELVHECSHMGHIIALKVETKDNLIVVG 1063

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ KSI  L ++ +  ++  +A DF S    A E L D + L    S     +Q    A 
Sbjct: 1064 DLMKSITLLQYQRESGRIEEVAHDFSSNWMTAVEILDDNTYLGAESSYNLFTVQ--RNAD 1121

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LLFG 1327
              +E  +G   L  A FH+G  V +F R  ++    D       SD T+  +     LFG
Sbjct: 1122 ADTEDKRGTLELCGA-FHLGDSVNRFRRGSLVMRMPDL------SDDTSSLSEISTWLFG 1174

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G +G +A L +  F  L  +Q+ +   V  V   +   FR FH+  ++      + +
Sbjct: 1175 TISGGLGVVATLPKRDFMLLNKVQEAMQKVVTGVGNFSHSDFRSFHNVQRSVE--MRNFI 1232

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            D +L+  +  L  E+Q+ ++  +G + S+ L
Sbjct: 1233 DGDLVEIFLDLSKEDQVAVSELSGVSNSEDL 1263


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 87.0 bits (214), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 156/365 (42%), Gaps = 31/365 (8%)

Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-V 1118
            VEE  +++ +       ++   T  +Q  EN  +V   +  +  T        +GTA  V
Sbjct: 744  VEEQFIKLFDDQ----TFEILDTYQLQEFENTCSVECASFSDDPT----LYYIVGTATAV 795

Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
              E     GR+L+F      D   +L     SKE+KGA   +    G LL     KI L 
Sbjct: 796  PQESEPKEGRLLVFEV---IDRKLHLKA---SKEIKGAPYQIKPFNGKLLAGINSKIELF 849

Query: 1179 KWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + + ++   +           + V+ L    +FI+ GD+ +SI  L++K+   Q+  +A+
Sbjct: 850  RLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDLMRSISLLTYKQVDGQIEEIAR 909

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
            DF +    A + L D + L    ++   N+         +   +  +L    E+H+G  V
Sbjct: 910  DFNANWMTAVDILDDDTFLG---AEGYFNLFTVRKNTDATSDEERARLEVVGEYHLGDMV 966

Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
             +F R  ++  SSD     P +D      ++FGT++G IG IA L +  +  L  +Q  L
Sbjct: 967  NRFQRGSLVLRSSD----TPTTD-----TIIFGTVNGMIGVIAVLSKEEYEFLLKVQDAL 1017

Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
               +  V GL    +R F +        P   +D +L+  +  L  E+  E+ H  G+  
Sbjct: 1018 NFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKMEEVCHAIGSIT 1077

Query: 1415 SQILS 1419
             + LS
Sbjct: 1078 VEELS 1082



 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+++  ++   L+ V    ++G + ++ + +  GA+    R+S+ +  E  K  ++E+D 
Sbjct: 37  RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
           S   L   +M   +           +S  R    GP+  +DP+ R  G L+Y GL  +I 
Sbjct: 93  STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
             ++ G               F+ R+E   V++++           F++GY +P +V+L 
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
                      ++    M    +   +++    I   WS   +   A  ++ VP+PIGG 
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           +++G  TI Y +         +   + +D +  + R+   ++ D            LL  
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L +L + +DG  V  L L     +     IT + + + F+GS  GDS L++     
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
                                  K    S+ + L+   N   +  +       +   +  
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369

Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
           + +        G LKD S  LR+  +    GI++Q+    VELPG KG+W++        
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
                   + D +Y  YLI S    T VLE AD     TE   +    +TI   N+ G  
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
            ++Q+ E   R++          + P N E  + +G     V+  S     + L +S+G 
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519

Query: 680 I 680
           +
Sbjct: 520 V 520


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 86.7 bits (213), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 75/304 (24%), Positives = 138/304 (45%), Gaps = 26/304 (8%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GTA V  E+   + GR++++     A      +T+V  KE+KGA  +L    G +
Sbjct: 826  TYYIVGTAMVNPEEREPKVGRIIIYHYADGA------LTQVSEKEIKGACYSLVEFNGRV 879

Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            L      + L++WT  +   L    F +   LY  +     +FIL+GD+ +SI  L +K+
Sbjct: 880  LATINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
                   +A+D+      A E L D + L    ++   N+ +       +   + Q++  
Sbjct: 937  MEGSFEEIARDYQPKWMTAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPE 993

Query: 1285 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
             A+FH+G  V  F    ++  +  +RT    G        +LFGT+ G+IG +  +    
Sbjct: 994  VAQFHLGDMVNVFRHGSLVMQNIGERTTPTSG-------CVLFGTVSGAIGLVTQIPPDY 1046

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +  L+ LQ+ L +++  V  ++   +R FH+  K      +  +D +L+  +  L  E+ 
Sbjct: 1047 YEFLRKLQENLTNTIKSVGRIDHTYWRSFHTEMKTE--NSEGFIDGDLVESFLDLTREKM 1104

Query: 1404 LEIA 1407
             E A
Sbjct: 1105 HEAA 1108



 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                           +D+    +      LL    G L ++ +  +     +L +    
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301

Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             +L        IT + N + F+GSR GDS LV+    +  S     + E F ++   AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                       L+    G+ ++  GS                       G L+    G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394

Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
            I   A             ++LPG KG+W +             R+   D  Y   L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
               T +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480


>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 655

 Score = 86.3 bits (212), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 140/293 (47%), Gaps = 27/293 (9%)

Query: 1106 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            +++    +GTA +  ++  ++ GR+++F   + ++ P+ +   VY KE+KGA  ++ S+ 
Sbjct: 331  DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPERMRL-VYEKEIKGAAYSIQSMD 386

Query: 1165 GHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
            G L++A    + L +WT  +   L    F +   LY+ + N   + IL+GD+ +S+  LS
Sbjct: 387  GKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN---DLILVGDLMRSLSLLS 443

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK--G 1279
            +K   +    +A+DF +    A E +   + L       + +  +F         +K  G
Sbjct: 444  YKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSYNLFTVMKDSFTVFKEEG 498

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
             +L     F++G  V  F    + AT  D    AP        ++L+GT DG IG I  +
Sbjct: 499  TRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHSSILYGTSDGGIGVIVQM 551

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
              + +  LQ +QK+L +   +   ++   +R F +  ++    P+  +D +L+
Sbjct: 552  PPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--APNGFIDGDLI 602


>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
          Length = 1138

 Score = 86.3 bits (212), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 141/304 (46%), Gaps = 26/304 (8%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GTA V  E+   + GR++++     AD   NL T+V  KE+KG+  +L    G +
Sbjct: 826  TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879

Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            L +    + L++WT  +   L    F +   LY  +     +FIL+GD+ +SI  L +K+
Sbjct: 880  LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
                   +A+D+      A E L D + L    +D   N+ +       +   + Q++  
Sbjct: 937  MEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDGAATTDDERQQMPE 993

Query: 1285 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
             A+ H+G  V  F    ++  +  +RT    G        +LFGT+ G+IG +  +    
Sbjct: 994  VAQVHLGDMVNVFRHGSLVMENIGERTTPTSG-------CVLFGTVSGAIGLVTQIPADY 1046

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            +  L+ LQ+ L D++  V  ++   +R FH+  K  R   +  +D +L+  +  L  E+ 
Sbjct: 1047 YEFLRKLQENLTDTIKSVGKIDHAYWRSFHTEMKTER--CEGFIDGDLVESFLDLSREKM 1104

Query: 1404 LEIA 1407
             E A
Sbjct: 1105 HEAA 1108



 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
            +T P+ + E   ++    L  + + ++     +GTAYV   E+   +GR+L+F      D
Sbjct: 738  STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 790

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW     GT EL   + +   
Sbjct: 791  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 847

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             L  + +    +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   L 
Sbjct: 848  IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 906

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
                  + N  IF    K SE    +   +L    E+H+G  V +F    ++        
Sbjct: 907  -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 954

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
              P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   +R 
Sbjct: 955  -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1013

Query: 1372 FHSNGK 1377
            F++  K
Sbjct: 1014 FNNEKK 1019



 Score = 57.4 bits (137), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
           +I +A    S  +G     G    F  + +     N+R L+   V D  F++G  +P +V
Sbjct: 99  VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157

Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
           +L++           +H      AL       + P  W+  NL + A  L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           L++G  TI Y S SA  A+ +            + ++   V+ D +          LL  
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L LL + ++   V  L +     + + S I+ + N+  ++GS  GDS L++     
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
                          ++ DA      + S  + L+  VN   +  +       +   +  
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
           T S A +D     G L+    G+ IN  AS            VEL G KG+W++      
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384

Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                    ++ DD +  +L++S  + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407


>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 85.9 bits (211), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
            +T P+ + E   ++    L  + + ++     +GTAYV   E+   +GR+L+F      D
Sbjct: 759  STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW     GT EL   + +   
Sbjct: 812  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             L  + +    +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   L 
Sbjct: 869  IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
                  + N  IF    K SE    +   +L    E+H+G  V +F    ++        
Sbjct: 928  -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
              P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   +R 
Sbjct: 976  -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1034

Query: 1372 FHSNGK 1377
            F++  K
Sbjct: 1035 FNNEKK 1040



 Score = 57.0 bits (136), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
          Length = 1068

 Score = 85.9 bits (211), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F+     D    L+ E   KE KGA+  L +
Sbjct: 757  SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 810

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW     GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 811  FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E + D   L        +N    +   K SE  
Sbjct: 870  SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FGT++G IG
Sbjct: 924  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K
Sbjct: 977  VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019



 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L     L + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+FS     +    LV E   KE KGA+  L +
Sbjct: 860  SDDNNVYYCVGTAYVLPEENEPTKGRILVFSV---EEGKLQLVAE---KETKGAVYCLNA 913

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW     GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 914  FNGKLLAAINQKIQLYKWVLREDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 972

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E L D   L        +N    +   K SE  
Sbjct: 973  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDVYLG------AENSFNLFTVRKNSEGA 1026

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  + +F    ++          P SD      ++FGT++G IG
Sbjct: 1027 TDEERGRLEVAGEYHLGEFINRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 1079

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K
Sbjct: 1080 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1122



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/349 (22%), Positives = 145/349 (41%), Gaps = 60/349 (17%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   SM           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  +L + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNSLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +   A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++ 
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
            castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/401 (24%), Positives = 167/401 (41%), Gaps = 73/401 (18%)

Query: 1041 GHQIDNHNLSSVDLHRTYTV--------EEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1092
            G +++ HNL  +D H T+ V        +EY + I+  +R GG                 
Sbjct: 791  GQEVEVHNLLIIDQH-TFEVLHAHQLMQQEYAMSIISTNRLGGDM--------------- 834

Query: 1093 TVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSK 1151
                          NE  + +GTA V  E+   + GR+L+F    N       +T+V  K
Sbjct: 835  --------------NEYYI-VGTATVNPEESEPKQGRILIFQWNDNK------LTQVSEK 873

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            E+KGA  +LA   G LL +    + L +WT  +   +       +  + L    +FILLG
Sbjct: 874  EIKGACYSLAEFNGKLLASINSTVRLFEWTVEKELRLECSHFNNILTLFLKTKGDFILLG 933

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      A E L D   L    ++   NI +     
Sbjct: 934  DLMRSMTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDIFLG---AENSFNIFVCQKDS 990

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLF 1326
              +   +  ++     FHVG  +  F    L +Q L  TS+  TG            +LF
Sbjct: 991  AATTDEERSQMHEVGRFHVGDMINVFRHGSLVMQNLGETSTPTTG-----------CVLF 1039

Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
            GT+ G+IG +  + +  +  L  LQ KL   +  V  ++   +R F+++ K      +  
Sbjct: 1040 GTVSGAIGLVTQITQDFYDFLLELQNKLSTVIKSVGKIDHSQWRAFNTDIKTEP--SEGF 1097

Query: 1387 VDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +D +L      LSH +M  + + L+I  + G  +   + +L
Sbjct: 1098 IDGDLIESFLDLSHDKMKEVADGLQITGEGGMKQDCTVDDL 1138



 Score = 68.6 bits (166), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 104/469 (22%), Positives = 175/469 (37%), Gaps = 110/469 (23%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DP+ R  G+ +Y     I+   +  S L                       N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG   P ++++H+        V+ +H    +    IS   K+   + 
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VPSP+GG +++G   I YH       +A             + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256

Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
           +     ++D     +L  D+A      G L +L +  D R     VV+ L +        
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
              IT + N + F+GSRLGDS LV+ T     S     + E F ++   AP    L    
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364

Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            D L+    G+ ++  G+                       G L+    G+ I   AS  
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401

Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
                     ++LPG KG+W +                A D  Y   L+++   +T VL 
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
                 E T+   +    +T   GN+    +++Q+    AR++     T
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKT 485


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 93/394 (23%), Positives = 175/394 (44%), Gaps = 31/394 (7%)

Query: 1038 QEVGHQIDNHNLS-SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + + HQ+D +  + +V+   +   +E  +R+++     G + T     ++  E A ++  
Sbjct: 746  RRIAHQVDTNTFAVAVEHLMSKGDQELFIRLIDD----GSFDTLHQFRLEEHELASSLMS 801

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
             + F   ++E   ++  G AY Q ED  +RGR+L+     +A     LV+E   KE++GA
Sbjct: 802  CS-FAGDSREY-YVVGTGFAYEQ-EDEPSRGRILVLRVEADA---LELVSE---KEVRGA 852

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGD 1212
            +  L + +G LL     K+ L KWT  E +             +   S+    ++IL+GD
Sbjct: 853  VYNLNAFKGKLLAGINSKLELFKWTPREDDAHELVSECSHHGQIITFSVKTRGDWILVGD 912

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            + KS+  L +K +   ++ +A+DF +    A   L D  T        + ++ +F  A  
Sbjct: 913  LLKSMSLLQYKPEEGAIDEIARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVARN 968

Query: 1273 MSESWKGQ--KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
            M+     +  +L    E+H+G  V  F    ++ +  D        D      LLFGT +
Sbjct: 969  MNAMTDEERSRLEITGEYHLGEFVNVFSPGSLVMSLKD-------GDSLEVPTLLFGTGN 1021

Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            G IG +A L +  +   + LQ  +   +  V GL    +R F    +       + VD +
Sbjct: 1022 GVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVDGD 1081

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            L+  +  L +E+   +A      R++I+  + +L
Sbjct: 1082 LVESFLDLKVEQADVVAADMKCDRAEIIRRVEEL 1115



 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + A  L+ V    ++G + ++++   G  D   R   + L  E    +VL +D++   L+
Sbjct: 58  LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
             +    +         GR +   G +  VD   R  G+ +Y GL  +I    +GG    
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                          ++ +  I L +L    V+D  F+HG  +P + +L+ R+   A  V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             K +   I      ++    P  W+  +L   + K++ VP+PIGGV+V+G   I Y   
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253

Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                  LN +    D    +  +P       +    A        LL    G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306

Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
           ++DG+ V+ L + +   + + S ++ + N + F+GS  GDS L++             L 
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
            E   I+ D   T          L  +V+    +L  +G     T S             
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
              G LKD S  LR+  +    GI +Q+   +++LPG KG++++        ++D S+M 
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442

Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                   YL+++    T +L    +  D L E TE   +  + +T+  GN+ G    +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494

Query: 626 VFERGARIL 634
           V  RG R++
Sbjct: 495 VTHRGVRLV 503


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1088

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 99/397 (24%), Positives = 170/397 (42%), Gaps = 38/397 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ      +   L    + EE E   VR+L+       ++  +T P+ + E   ++
Sbjct: 716  RRICHQEQTRTFAICCLRNQPSAEESEMHFVRLLDAQ----SFEFLSTYPLDAFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      +    L+TE   KE 
Sbjct: 772  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    +D   N+    
Sbjct: 881  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILDDDIYLG---ADNCFNLFTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  ++    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 938  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G IG IA L +  +  L+ LQ  L   +  V GL+   +R F  N +       S +
Sbjct: 990  TVSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKSYL 1047

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            D +L+  +  L   +  EI+        ++   + +L
Sbjct: 1048 DGDLIESFLDLSRGKMEEISKGMDVQVEELCKRVEEL 1084



 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK+   +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              +G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA        S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
          Length = 1089

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F+     D    L+ E   KE KGA+  L +
Sbjct: 778  SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 831

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW     GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 832  FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E + D   L        +N    +   K SE  
Sbjct: 891  SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FGT++G IG
Sbjct: 945  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K
Sbjct: 998  VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040



 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL     L + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|440492924|gb|ELQ75450.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            (CPSF subunit) [Trachipleistophora hominis]
          Length = 1254

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/231 (26%), Positives = 123/231 (53%), Gaps = 25/231 (10%)

Query: 1094 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVY 1149
            +++++L ++    N+ ++ + +     E    RGR+L+F         AD       ++ 
Sbjct: 946  LKIMSLCDSNGDHNDFVVVVLSQRTSNE--ILRGRILVFEVIDVISDTADRKTKKALKLL 1003

Query: 1150 -SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNF 1207
             S+  KG IS  A+++G + ++   +++++++   T +  IAFYD   +Y VSL ++KN+
Sbjct: 1004 GSERTKGPISCCAAVRGRIAVSLATRLMVYEFDRNTGIVAIAFYDLY-MYAVSLAVIKNY 1062

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
            I++GDI   ++F+ ++ +  +L+LL+K     + GSLD F       G  L +   D+  
Sbjct: 1063 IVVGDIMMGLHFVYFQSEPVKLHLLSKSGRVANLGSLDFFNA-----GDRLFITGIDKTG 1117

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
             +QIF ++P    S +G+KL+ R +F   AH       Q + T++ R+ A+
Sbjct: 1118 EVQIFSFSPGNLYSNEGEKLVKRQQFETYAH------FQSIRTNTYRSYAS 1162


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 85.1 bits (209), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ S E   ++
Sbjct: 695  RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 750

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T++      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 751  ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 800

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 801  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 859

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 860  VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 916

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 917  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 968

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 969  TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1018



 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)

Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            D  I+G RI ++  F    E+ ++L +   R  F    +++ DP+             +
Sbjct: 54  LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I +A    S  +G     G    F  + +     N+R L+   V D  F+ G  +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  +L + A  L+ VP P+ 
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S SA  A+ +            + ++   V++D +          LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+   
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           ++  DA      + S  + L+  +N   +  +       +   +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
          Length = 1088

 Score = 85.1 bits (209), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 152/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ S E   ++
Sbjct: 716  RRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 772  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 881  VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLVTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 990  TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
            DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 85.1 bits (209), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ S E   ++
Sbjct: 716  RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T++      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 772  ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 881  VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 990  TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  +L + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
          Length = 727

 Score = 85.1 bits (209), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ S E   ++
Sbjct: 355  RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 410

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T++      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 411  ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 460

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 461  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 519

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 520  VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 576

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 577  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 628

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 629  TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 678


>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
          Length = 596

 Score = 84.7 bits (208), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)

Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F      +    L+ E   KE KG++ +L +
Sbjct: 285  SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 338

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KWT    GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 339  FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 397

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E L D   L        +N    +   K SE  
Sbjct: 398  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 451

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FG+++G IG
Sbjct: 452  TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 504

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             IA L    +  L+ LQ  L   +  V GL+   +R F  N +       + +D +L+  
Sbjct: 505  VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 562

Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  L   +  EI+     +  ++   + +L
Sbjct: 563  FLDLNRSKMEEISRAMSVSAEELCKRVEEL 592


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
          Length = 1068

 Score = 84.7 bits (208), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+++F+     D    L+ E   KE KGA+  L +
Sbjct: 757  SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 810

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW     GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 811  FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E + D   L        +N    +   K SE  
Sbjct: 870  SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FGT++G IG
Sbjct: 924  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K
Sbjct: 977  VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG + LL ++++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
          Length = 1089

 Score = 84.7 bits (208), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+++F+     D    L+ E   KE KGA+  L +
Sbjct: 778  SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 831

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW     GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 832  FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E + D   L        +N    +   K SE  
Sbjct: 891  SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FGT++G IG
Sbjct: 945  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             IA L    +  L+ LQ  L   +  V GL+   +R F++  K
Sbjct: 998  VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040



 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L ++++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)

Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F      +    L+ E   KE KG++ +L +
Sbjct: 782  SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 835

Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KWT    GT EL     +    L  + +    +FI++GD+ KSI
Sbjct: 836  FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 894

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L +K +   +   A+D+ +    A E L D   L        +N    +   K SE  
Sbjct: 895  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 948

Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              +   +L    E+H+G  V +F    ++          P SD      ++FG+++G IG
Sbjct: 949  TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 1001

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             IA L    +  L+ LQ  L   +  V GL+   +R F  N +       + +D +L+  
Sbjct: 1002 VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 1059

Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  L   +  EI+     +  ++   + +L
Sbjct: 1060 FLDLNRSKMEEISRAMSVSAEELCKRVEEL 1089



 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G   P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
            [Dekkera bruxellensis AWRI1499]
          Length = 289

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/272 (22%), Positives = 126/272 (46%), Gaps = 14/272 (5%)

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
            + +  + +G+   + ED+A +G  +++       +P     +N +  + S+  +G+I   
Sbjct: 3    DTKNYVIVGSGKYRVEDLATKGSWMVYEIIDVVPDPNHPEAKNRLKLIKSESSRGSILGS 62

Query: 1161 ASLQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             ++ G   +    ++++   K  G  +  +AF D   LY   +   ++ +++GD    + 
Sbjct: 63   CNISGRFSLVQAQRMLVRTIKKDGNAV-PVAFXDTS-LYTKDVKSFEDMMIIGDAFDGLS 120

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
               +  +  ++  L K+  +L   A +F++    L ++ +DE   + +  Y P   ES K
Sbjct: 121  LYGFDAEPYRMLKLGKETQNLSLTACDFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMK 180

Query: 1279 GQKLLSRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
            G KLL+R+ F    + T      R   + +  D     PG+D    F ++   ++GS   
Sbjct: 181  GXKLLTRSVFRFNGYTTAMRLCDRKNSIFSMLDTLAIPPGADLG--FEVIGCNIEGSFYK 238

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
            + P +E T+RRL +LQ  + D   H  GLNP+
Sbjct: 239  VTPANEYTYRRLYALQNHISDKESHWLGLNPK 270


>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
 gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
          Length = 1297

 Score = 83.6 bits (205), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV   +  G +  +     GG       DSII+  + +K+ VL+  D+   L+ T  H
Sbjct: 48  LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100

Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            F  P        +               ++  DP G C  +L+    + +L  +     
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157

Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
            +  E T  +G  + +    I+ +   ++   D K     ++D +F+ GY  P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVH-TDFKSPLYRIRDMVFLDGYKNPTLAIIHE 215

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
              TW+ R+  +  T  +S +S     K+  LI         W++  LPH+++ L+ VP 
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
           PIGG LV+  N I Y   +   ALALN  A     V +D +   P      EL +   T 
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
           +     LL+     L +LT+ Y+G  V           + +PS   S   T   SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392

Query: 427 SRLGDSLL 434
           S   DS L
Sbjct: 393 STTHDSTL 400



 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 98/463 (21%), Positives = 175/463 (37%), Gaps = 50/463 (10%)

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWP-----VQKIPLKATPHQITYFAEKNLYPLIV 1020
            NH  I    Q  +++C L +     N++      V++IP+  T  +I Y    N    I 
Sbjct: 810  NHFLIADEDQ--IRLCNLENIKPEHNFFIIDGCIVERIPVGMTVRRIAYCQNPNCVAFIA 867

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQT 1079
            S P  +P        ID EV   +  H        + Y  E+YE +     +R      T
Sbjct: 868  SHP--EPFTTENEKKIDVEVYENLQVHYQEPPSPAKVYPDEDYETIPKWNEERYSLFLYT 925

Query: 1080 RATIP-MQSSENALTVRVVTLFNTTTKENE------TLLAIGTAYVQGEDVAARGRVLLF 1132
            +  +  M    N   V  V   +TT    +      T LA+G+ ++   +   RG + ++
Sbjct: 926  KDGLQQMVDYANHEIVNTVQFVHTTPMPEDGITLLNTYLAVGSGFLSQPEKMMRGVLYIY 985

Query: 1133 ST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
                     G N    + L  E  +K  K  I  +    G++ I  G  + L ++     
Sbjct: 986  QIRYMQNDEGFNEITLRPLYNET-NKIYKNPIIEITDNSGYMAIFCGNLLYLMRFFNENT 1044

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
              I  +     +  S+  +KN++L  D ++      W++ G +L  +A+D  +    +  
Sbjct: 1045 VKIEAFLVGRFFASSIVSLKNYLLYADSYEGFEVARWRKYGKKLISMARDTMTKLPLSAA 1104

Query: 1246 FLIDGSTLSLVVSDEQKNIQIF----YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
            FL     L  VV D+  N  IF    Y  P          ++ ++ F++G       +  
Sbjct: 1105 FLQYEDCLGGVVFDDDGNAHIFDVDEYAIP-------ADAVVRKSIFYIGGRAISSGQFP 1157

Query: 1302 MLATSSDRTGAAPGSDKTNRFALL----------FGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            + A +   T   P  +       L          + T  G IG   P+DE    +L  +Q
Sbjct: 1158 IKAVTQ-ATQQNPNEEIDEELLQLQTKIGGHIAWYVTTHGKIGAFTPIDENDRHKLVGVQ 1216

Query: 1352 KKLVDSVPHVAGLNPRS--FRQFHSNGKAHRPGPDSIVDCELL 1392
                 S+  ++ L  RS  F+        ++  P +++DC++L
Sbjct: 1217 SAYEKSLCGLSHLEYRSGKFKNMIEQDIFNQ-SPKNVIDCDML 1258


>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
          Length = 1136

 Score = 83.6 bits (205), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 165/380 (43%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+G +I+ HNL  +D H   T E      L P                 +E AL+     
Sbjct: 777  EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 811

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA++  ++   + GR+LLF  S G+        +++V  KE+K
Sbjct: 812  LISTRLGEDSTSYYVVGTAFINPDETEPKMGRILLFHWSDGK--------LSQVAEKEIK 863

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+G
Sbjct: 864  GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKSDFVLVG 920

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      + E L D + L    ++   N+ I     
Sbjct: 921  DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFICQKDS 977

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 978  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1027

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++L+ KL   +  V  +    +R F ++ K  +   +  +
Sbjct: 1028 TVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKIEHNFWRSFKTDLKIEQ--CEGFI 1085

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  ++  E+A
Sbjct: 1086 DGDLIESFLDLSHDKMAEVA 1105



 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
          Length = 384

 Score = 83.6 bits (205), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F+     D    L+ E   KE KGA+ +L +
Sbjct: 73   SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 126

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L + +     +FI++GD+ KSI
Sbjct: 127  FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 185

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1273
              L +K + + +  LA+D+ +    A E + D    G+  S  +   +KN      +   
Sbjct: 186  SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 239

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1332
            ++  +G +L    E+H+G  V +F    ++    D   G  P         ++FGT++G 
Sbjct: 240  TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 290

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG IA L    +  L+ LQ  L   +  V  L+   +R FH+  K       + +D +L+
Sbjct: 291  IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKT--AEARNFLDGDLI 348

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              +  L   +  E++   G +   +   + +L
Sbjct: 349  ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 380


>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
          Length = 1150

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 123/565 (21%), Positives = 230/565 (40%), Gaps = 102/565 (18%)

Query: 840  FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
            +  A L DG +L Y+  L +  ++T      +   R +S+    AS           L  
Sbjct: 612  YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658

Query: 900  YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
            +  +   H  A C R T+  + SG       G   C  +  RE  RV    C  S  AF 
Sbjct: 659  FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708

Query: 959  VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP--VQKIPLKATPHQITYFAEKNLY 1016
                 +C    + + ++G L +       T DN     ++ IPL   P +I +    +++
Sbjct: 709  -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753

Query: 1017 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
             ++ +   V + +  N+ LS   ++     ID+   + +++  +Y +E++E         
Sbjct: 754  AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804

Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1132
                   + I +   ++A             K+N+    +GTAY    E   +RGR+L+F
Sbjct: 805  -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846

Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
            +        +  +T V  +  KGA+ ++ +  G +L +    + L +W+ TE       +
Sbjct: 847  AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900

Query: 1193 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
                   ++++ +  + +FIL+GD+ +S+  L++K     +  +A+D         E L 
Sbjct: 901  ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959

Query: 1249 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF----LRLQML 1303
                L   +S E   N+         S   +  +L    E+H+G  V +     L LQ+ 
Sbjct: 960  ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVLQIP 1016

Query: 1304 AT-----SSDRTGAAPGSDKT------NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
             +      S   G     D        +++    GT +G++G IA +DE TF+ L SLQ 
Sbjct: 1017 ESGISILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKTFQFLHSLQT 1076

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGK 1377
             L + +  V G+    +R+F S  +
Sbjct: 1077 ALNEVIKGVGGIQHEDWRRFTSERR 1101



 Score = 40.8 bits (94), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/339 (25%), Positives = 149/339 (43%), Gaps = 42/339 (12%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
            + S+E AL++    L +    +  T   +GTA V  ED   + GR+L+F    + D+  +
Sbjct: 816  LNSNEYALSIISAKLGD----DPATYYILGTAVVNPEDQDPKLGRILIF----HWDDSSS 867

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1200
             +T +  KE+KGA   +A   G LL A    + L +WT  +   L    F +   L+V +
Sbjct: 868  KLTPITEKEVKGACYGMAEFNGKLLAAVNCTVRLFEWTAEKELRLECSHFNNIVALFVKT 927

Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
                 +FI+ GD+ +S+  L +K        +A+D+      A E + D   L    ++ 
Sbjct: 928  KG---DFIVCGDLMRSLTLLQYKTMEGSFEEIARDYNPKWSTAIEIIDDDVFLG---AEN 981

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
             KN+ I +    ++      +L    +FH G  +  F           R G+      T+
Sbjct: 982  DKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVF-----------RHGSLVMQHFTD 1030

Query: 1321 RFA-----LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
             +      +L+GT  G++G +  L    F  L  L+K L   V  V  +N + +R +H+ 
Sbjct: 1031 TYVSVQGGILYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVGKINHQFWRSYHTE 1090

Query: 1376 GKAHRPGPDSIVDCEL------LSHYEMLPLEEQLEIAH 1408
             +      +S VD +L      LS  EM+ + + L+ A+
Sbjct: 1091 IRTE--PSESFVDGDLIESFLDLSKREMIAVVDALQGAY 1127



 Score = 47.8 bits (112), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G +  +DP  R  G+ +Y GL  II              D  G    +  R+E    + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
           +D+D        F++G   P ++I+H+  +   GR         I A  +S   K+    
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            W   N+  +A  ++ VP P+ G +++G  ++ YH+ S+  A+          S   + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257

Query: 358 SSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
           S+      +D     +L  D+A      G L +L + Y+        +      +L    
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311

Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
               +T + N + ++ SR+GDS L++ 
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
            DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ      +   L    + EE E   VR+L+       ++  ++ P+ + E   ++
Sbjct: 716  RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      +    L+TE   KE 
Sbjct: 772  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL +   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             +GD+ KSI  L +K +   +   A+D+ +    A E L D   L    +D   NI    
Sbjct: 881  AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
               + +   +  ++    E+H+G  V +F    ++          P SD      ++FGT
Sbjct: 938  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            + G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 991  VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ      +   L    + EE E   VR+L+       ++  ++ P+ + E   ++
Sbjct: 716  RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      +    L+TE   KE 
Sbjct: 772  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL +   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             +GD+ KSI  L +K +   +   A+D+ +    A E L D   L    +D   NI    
Sbjct: 881  AVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILNDDIYLG---TDNCFNIFTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
               + +   +  ++    E+H+G  V +F    ++          P SD      ++FGT
Sbjct: 938  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            + G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 991  VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
            anatinus]
          Length = 1045

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 728  KDANTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 781

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 782  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 836

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 837  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 893

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 894  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 943

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+  
Sbjct: 944  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1001

Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +      +M  +   L+I   +G  R   + +L
Sbjct: 1002 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1034


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 159/346 (45%), Gaps = 44/346 (12%)

Query: 1103 TTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
            T  +NE  + +GTA  +G E+  ++GR+L+       D+   L  E   K++KGA+  L 
Sbjct: 775  TDDDNEYFI-VGTAITEGDEEEPSKGRILVLQV---QDDKLVLKAE---KDVKGAVMVLH 827

Query: 1162 SLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAP---PLYVVSLNIVKNFILLGDIHKS 1216
            S  G LL     +++L KW  ++   N     +      +Y++ ++   +FIL+GD+ KS
Sbjct: 828  SFNGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGDMMKS 887

Query: 1217 IYFLSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            ++   ++    Q     L L++KD+      +   +++ S    V  D+Q N+       
Sbjct: 888  VHLFVYENPEEQHVSGNLRLISKDY-QYSWLSCSLMLNES--EYVAVDQQGNMITLKKND 944

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
            + +   + ++L+   +++    V +    F+ ++   +SSD     P          LFG
Sbjct: 945  EAASEEERKQLVRVGKYYCSDRVNRIQPGFIGMRFANSSSD-INTQPVK------TALFG 997

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI- 1386
            T+ G IG +A L   TF  +  +QK +   V  +A ++  ++RQ+ S     R   DS+ 
Sbjct: 998  TISGGIGVLAQLPPETFAFVTKIQKAMSSVVTGLANISRETYRQYRS----ERTREDSVG 1053

Query: 1387 -VDCELLSHYEMLPLE------EQLEIAHQTGTTRSQILSNLNDLA 1425
             +D + +  +     E      E+L   HQ   T  +++ N+ DL+
Sbjct: 1054 FIDGDFVESFLEFDFETQQRVIEELSNNHQEQITLEELVKNIEDLS 1099



 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)

Query: 82  KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
           KN      R+  +G+S+     V  +   G ++++++    G     ++D + +  ED  
Sbjct: 35  KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85

Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
            ++   D  I  L   S+   + P       GR S   G +  +DP  R   + +Y GL 
Sbjct: 86  FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            II            ++  F     F+ R+E  +VI++  L+    K          P  
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            +L++  +          H       ++   +++  L  + +N+ H A  L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230

Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           V+ VG   + Y ++S        N++V+  ++  +   S+  +LD  +  W   D     
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            ++G L LL++      V  L L +   + ++S I+ + N   F+GS  GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333


>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1090

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            + +N     +GTAYV   E+   +GR+L+F+     D    L+ E   KE KGA+ +L +
Sbjct: 779  SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832

Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL A   KI L+KW        EL     +    L + +     +FI++GD+ KSI
Sbjct: 833  FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 891

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1273
              L +K + + +  LA+D+ +    A E + D    G+  S  +   +KN      +   
Sbjct: 892  SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 945

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1332
            ++  +G +L    E+H+G  V +F    ++    D   G  P         ++FGT++G 
Sbjct: 946  TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 996

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG IA L    +  L+ LQ  L   +  V  L+   +R FH+  K       + +D +L+
Sbjct: 997  IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKTAE--ARNFLDGDLI 1054

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              +  L   +  E++   G +   +   + +L
Sbjct: 1055 ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 1086



 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G + P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP+P+GGV+++G  TI Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL +  +   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G ++    G+ 
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS            ++D Y  +L++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ      +   L    + EE E   VR+L+       ++  ++ P+ + E   ++
Sbjct: 730  RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 785

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      +    L+TE   KE 
Sbjct: 786  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 835

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL +   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 836  KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 894

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             +GD+ KSI  L +K +   +   A+D+ +    A E L D   L    +D   NI    
Sbjct: 895  AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 951

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
               + +   +  ++    E+H+G  V +F    ++          P SD      ++FGT
Sbjct: 952  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 1004

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            + G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 1005 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1053



 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P + +L++           +H       
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204

Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
            +   +LK    +   WS  NL + A  L+ VPSP+ GVL++G  TI Y S +A  A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264

Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
                       + ++   V+LD +          LL    G + LL + ++   V  L 
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
           +     + + S I+ + N++ F+GS  GDS L++                   +++ DA 
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
                + S  + L+  VN   +  +       +   +  T S A +D     G L+    
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  AS            VEL G KG+W++  KSS             D+ +  +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432

Query: 581 ISLEARTMVL 590
           +S  + T +L
Sbjct: 433 VSFISETRIL 442


>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
          Length = 1140

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 127/286 (44%), Gaps = 34/286 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K  +
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ 1084


>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
            boliviensis]
          Length = 1140

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
          Length = 1239

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 922  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 975

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 976  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 1030

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 1031 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1087

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1088 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1137

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1138 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1180


>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
          Length = 400

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)

Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
           E+ +YGS A + T+ A  T+SF V DS++NIGP    S G         + N +      
Sbjct: 54  EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111

Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
             +G  K     ++            ELPGC  +WTV +   +         G + +  +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171

Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230

Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           QV   G R+L+G      L F P +         S ++  S+ADPYV++  ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
            C-169]
          Length = 1101

 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 143/321 (44%), Gaps = 27/321 (8%)

Query: 1113 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA  V  E    +GR+L+F          +LV E   KE+KGA   L   QG L+   
Sbjct: 796  VGTAITVAEEPEPTKGRILVFGA---KGGKLSLVCE---KEVKGAAYNLHPFQGKLIAGI 849

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPL--YVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1227
              ++ L KWT +E       +      +V++L IV   +F+++GD+ +S+  L ++    
Sbjct: 850  NSRVQLFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRSLQLLIYRADEG 909

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
             L + A+D+ +    A E L D + L    ++   NI         +      +L +  +
Sbjct: 910  ILEVRARDYKTHWMTAVEVLDDDTYLG---AENSNNIFTLRKNTDAAADEDRNRLETVGQ 966

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            +H+G  V +F    ++          P S+      +LF T++GSIG IA L +  F+ L
Sbjct: 967  YHLGVFVNRFRHGSLVMK-------LPDSEAAKIPTVLFVTINGSIGVIASLPQQQFQFL 1019

Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEI 1406
              LQ  L   +  V GL+  ++R F      H   P  + VD +L+  +  L  +    +
Sbjct: 1020 SRLQDCLRKVIKGVGGLSHVAWRTFQDE---HTKMPSQNFVDGDLIEQFLDLKRDSMERV 1076

Query: 1407 AHQT--GTTRSQILSNLNDLA 1425
            A +   G T   +L  + +L+
Sbjct: 1077 AREMGEGVTSEDLLRMVEELS 1097



 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ V    ++G V ++ +    G      +D + L+ E  K  VLE+D 
Sbjct: 44  RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
               L   +    E       + GR     G +  VDP          G +MI L    G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
              ++  +D       F+ RI+  +VI           D IF+ G  +P + +L++    
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H      +     L + P  W   NL   A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
            Y  Q                  Q +  +     +  AH    ++    LL    G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +DG  V  L +     +   S +T + N + F+GS  GDS LV+            
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
                        P T +   +  + L+ M N   +  +       +   +    +    
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
               G + D S  LRI  +    G+ +Q+    VELPG KG+W +           +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419

Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
            A+D     +L+IS   E R + +   D L E  E   +    +T+  GN      ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473

Query: 627 FERGARILDGS 637
                R++D S
Sbjct: 474 AGADVRLVDAS 484


>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
            sapiens]
          Length = 923

 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 606  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 659

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 660  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 714

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 715  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 771

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 772  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 821

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 822  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 864


>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
          Length = 1108

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 125/292 (42%), Gaps = 22/292 (7%)

Query: 1109 TLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GTA+    E    +GR+ + +           +  V  KE +GA+ +LA  QG L
Sbjct: 798  TYYVVGTAFAPPNEPEPTKGRIFVLAAAGGK------LCVVCEKETRGAVYSLAEFQGRL 851

Query: 1168 LIASGPKIILHKWTGTELNGIAFYD----APPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            L     ++ ++KW      G A       A  +  + L    + +++GD+ KSI  L+W 
Sbjct: 852  LAGINSRVQMYKWLEQGEGGRALVPECSHAGHVLALYLATRGDLVVVGDLMKSIQLLAWG 911

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L L A+DF      A   L D + +    ++   N+         +   +  +L 
Sbjct: 912  EEEGALELRARDFHPNWMSAVTVLDDDTYMG---AENSYNLFTVRRNADAATDEERSRLE 968

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
            +   +H+G  V +F    ++          P S+ +    +LFGT++G IG +A L    
Sbjct: 969  TVGRYHLGEFVNRFQPGSLVMR-------LPDSELSQIPTVLFGTINGVIGVVASLPHAQ 1021

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            ++ L+SLQ+ +   V  V G +   +R F +      P     VD +L+  +
Sbjct: 1022 YQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQHMPATPA-RQFVDGDLIEQF 1072


>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1098

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/333 (24%), Positives = 148/333 (44%), Gaps = 27/333 (8%)

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
            P + + + +T+R        T ++     +GT +    ED   RGR+L+F       N  
Sbjct: 765  PFEIASSLITIRF-------TDDDTLYYTVGTGFAFPHEDEPVRGRILVFKV-----NDM 812

Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSL 1201
             L+  V+  +++G+  +  S+ G L+      +++ +W + T L  +   +   +  +SL
Sbjct: 813  RLLQLVHEYDIRGSAYSFVSVHGRLVAGVNSNVMVLRWNSDTSLLELQSMNHGHVLALSL 872

Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
             +  +FIL+ D+ KSI  L +      L  LA D  S    A E + D + L    +D  
Sbjct: 873  AVRGDFILVADLIKSITLLQFDLATDSLKELAYDADSNWMTAAELIDDDTFLG---ADSS 929

Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
             NI             + Q+L  +  FH G  + +F +  +   ++D T A P   +   
Sbjct: 930  MNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGSLTLHATDETLALPAIPE--- 986

Query: 1322 FALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
              +L+ T+ G+IG +A  P DE T + L +LQ+ L   V  V GL    +R++ +  ++ 
Sbjct: 987  --ILYCTVHGAIGVVARIPSDE-TAKILSTLQEALKSVVQGVGGLIHSDWRRYRTERRSI 1043

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            +     I+D +L+  +  L    Q  +  Q  T
Sbjct: 1044 KSA--GIIDGDLIESFLELDRSMQDHVFTQVAT 1074


>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
 gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
          Length = 1155

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 116/236 (49%), Gaps = 12/236 (5%)

Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN--ETLLAIGTAYVQGEDVAARGR 1128
            D     ++  +T  ++S E  L ++ ++L N +   N     + I    V+GED  +RGR
Sbjct: 820  DLYTSAYKFISTFDLESDEYVLDIKELSL-NDSIGINGKNNFIVICVTKVEGEDKHSRGR 878

Query: 1129 VLLFSTGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
            +++F       +  N+     +  + S+ +KG I+    ++G+L++A G K +++K   +
Sbjct: 879  IIVFELIDIIVDKANVHKDKKLKVLASENIKGCITKCDEIKGNLIVALGIKTMIYKIDRS 938

Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
            E L  I  +D   L   S+  +KNF+L  DI++ + F  ++ +  +LNL+       +  
Sbjct: 939  EGLIPIGIHDLYTL-TTSMITIKNFVLFSDIYRGLSFFYYQNKPVRLNLVCTSESIKNAV 997

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE--FHVGAHVTK 1296
              +F++    L ++ +D   NI  + Y+P    S  G K + R E  F++G  V K
Sbjct: 998  HVDFIVKEPALGIICTDFAGNIHTYTYSPVNILSCNGTKFVKRCETNFNLGKLVIK 1053


>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
          Length = 1193

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 876  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 929

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 930  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 984

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 985  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1041

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1042 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1091

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1092 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1134


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/350 (25%), Positives = 151/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ + E   ++
Sbjct: 716  RRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQ----SFEFVSTYPLDAFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  +   +      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 772  ----LSCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KG++ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGSVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 881  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 990  TVNGVIGVIASLPQEQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428


>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
            sapiens]
          Length = 896

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/284 (26%), Positives = 126/284 (44%), Gaps = 34/284 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 579  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 632

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 633  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 687

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 688  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 744

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 745  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 794

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K 
Sbjct: 795  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT 838


>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
 gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Hbx
 gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Whx
 gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Ddb2
 gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdtc1
 gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Iqwd1
 gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr40a
 gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr22
 gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr21a
 gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
 gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
          Length = 1143

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 826  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 879

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 880  NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 934

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 935  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 991

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 992  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1041

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1042 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1084


>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
 gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++    G LL + 
Sbjct: 830  VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 883

Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+  L++K   
Sbjct: 884  NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                 +A+DF      A E L D + L    ++   N+ +       +   + Q L    
Sbjct: 939  GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995

Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
             FH+G  V  F    L LQ L  SS  T  +          +LFGT++G IG +  L E 
Sbjct: 996  VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             +  L  LQ +L   +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096



 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
          Length = 1158

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 841  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 895  NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 950  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099


>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
 gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
 gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
 gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
 gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
 gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
 gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
          Length = 1122

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
 gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1144

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 827  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 880

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 881  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 935

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 936  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 992

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 993  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1042

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1043 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1085


>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
 gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
            cuniculus]
 gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DDBa; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
 gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
            Containing Dna-Duplex
 gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
            Containing Dna-Duplex
 gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
          Length = 1158

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 841  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 895  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 950  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099


>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
          Length = 826

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 509  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 562

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 563  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 617

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 618  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 674

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 675  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 724

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 725  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 767


>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K  
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083


>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Nomascus leucogenys]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 842  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 951  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100



 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVPSP GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498


>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
 gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Callithrix jacchus]
 gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
 gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
 gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
 gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DNA damage-binding protein a;
            Short=DDBa; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=HBV X-associated protein 1;
            Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
            AltName: Full=UV-damaged DNA-binding protein 1;
            Short=UV-DDB 1; AltName: Full=XPE-binding factor;
            Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
            E-complementing protein; Short=XPCe
 gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
 gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
 gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
 gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
            sapiens]
 gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
 gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
 gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
 gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
 gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
 gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
          Length = 1150

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 833  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 886

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 887  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 941

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 942  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 998

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 999  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1048

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1049 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1091


>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K  
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083


>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
 gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1159

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 842  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 951  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100


>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
          Length = 1159

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 842  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 951  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 151/365 (41%), Gaps = 60/365 (16%)

Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
            G +++ HNL  VD H       +EV                 +P   SE AL++    L 
Sbjct: 783  GEEVEVHNLLVVDQHT------FEV-----------LHAHQFLP---SEYALSLVSCRL- 821

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
                K+      +GTA V  E+   + GR+++F      D     V E   KE+KGA+ +
Sbjct: 822  ---GKDPSVYFIVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYS 872

Query: 1160 LASLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            +    G  L +    + L++WT      TE N   + +   LY   L    +FIL+GD+ 
Sbjct: 873  MVEFNGKFLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLM 927

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+  L++K        +A+DF      A E L D + L    ++   N+ +       +
Sbjct: 928  RSVLLLAYKSMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAAT 984

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
               + Q L     FH+G  V  F    L LQ L  SS  T  +          +LFGT++
Sbjct: 985  TDEERQHLQEVGLFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVN 1034

Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            G IG +  L E  +  L  LQ +L   +  V  +    +R FH+  K  +      +D +
Sbjct: 1035 GMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGD 1092

Query: 1391 LLSHY 1395
            L+  +
Sbjct: 1093 LIESF 1097



 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479


>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
            Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
 gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
            (Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
          Length = 1159

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 842  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 951  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100


>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 444

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++    G LL + 
Sbjct: 135  VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 188

Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+  L++K   
Sbjct: 189  NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 243

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                 +A+DF      A E L D + L    ++   N+ +       +   + Q L    
Sbjct: 244  GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 300

Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
             FH+G  V  F    L LQ L  SS  T  +          +LFGT++G IG +  L E 
Sbjct: 301  VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 350

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             +  L  LQ +L   +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 351  WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 401


>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQRDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
 gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
 gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
 gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
 gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
          Length = 1140

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++    G LL + 
Sbjct: 831  VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 884

Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+  L++K   
Sbjct: 885  NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 939

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                 +A+DF      A E L D + L    ++   N+ +       +   + Q L    
Sbjct: 940  GSFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 996

Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
             FH+G  V  F    L LQ L  SS  T  +          +LFGT++G IG +  L E 
Sbjct: 997  LFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1046

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             +  L  LQ +L   +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 1047 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1097


>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKLGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
          Length = 1125

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 808  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 861

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 862  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 916

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 917  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 973

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 974  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1023

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1024 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1066


>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
          Length = 1138

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 90/378 (23%), Positives = 161/378 (42%), Gaps = 58/378 (15%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            ++  +I+ HNL  +D H   T E     +L P+                 E AL+     
Sbjct: 779  DISQEIEVHNLLIIDQH---TFEVLHAHMLMPN-----------------EYALS----- 813

Query: 1099 LFNTTTKENETLL-AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
            L +T   E+ T    +GTA V  ++   + GR+LL+            +T+V  KE+KG+
Sbjct: 814  LISTKLGEDPTFYYVVGTALVNPDETEPKMGRILLYHWNDGK------LTQVAEKEIKGS 867

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
              +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+GD+
Sbjct: 868  CYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVGDL 924

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +S+  L +K        +A+D+      A E L D + L    ++   N+ +       
Sbjct: 925  MRSLTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDTFLG---AENCFNLFVCQKDSAA 981

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
            +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFGT+
Sbjct: 982  TSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFGTV 1031

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
             G+IG +  +    +  L+ L+ +L + +  V  +  R +R F++  K      +  +D 
Sbjct: 1032 SGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIEHRFWRSFNTELKVE--NCEGFIDG 1089

Query: 1390 ELLSHYEMLPLEEQLEIA 1407
            +L+  +  L  ++  E+A
Sbjct: 1090 DLIESFLDLSPDKMAEVA 1107



 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 92/422 (21%), Positives = 166/422 (39%), Gaps = 90/422 (21%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +      DG +VV+ L +       + 
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
             IT + N + F+GSRLGDS L++                     +AD   +  +   + 
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKLIT------------------KADENGSYCVPMETF 356

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG 532
             L  +V+   + L        +    T S A ++     G L+    G+ I   AS   
Sbjct: 357 TNLAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS--- 404

Query: 533 ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
                    ++LPG KG+W +      G N D++            L++S   +T +L  
Sbjct: 405 ---------IDLPGIKGMWAL---RIGGGNFDNT------------LVLSFVGQTRILTL 440

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
                E T+   +    +T   GN+      IQ+    AR++     T    + P N  +
Sbjct: 441 NGEEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRT 499

Query: 653 GS 654
            S
Sbjct: 500 IS 501


>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++    G LL + 
Sbjct: 830  VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 883

Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+  L++K   
Sbjct: 884  NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                 +A+DF      A E L D + L    ++   N+ +       +   + Q L    
Sbjct: 939  GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995

Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
             FH+G  V  F    L LQ L  SS  T  +          +LFGT++G IG +  L E 
Sbjct: 996  VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             +  L  LQ +L   +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096



 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  ++ R              E    L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|19074861|ref|NP_586367.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
            cuniculi GB-M1]
 gi|19069586|emb|CAD25971.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
            cuniculi GB-M1]
          Length = 1156

 Score = 80.9 bits (198), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 72/323 (22%), Positives = 148/323 (45%), Gaps = 26/323 (8%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1164
             L + T +++GED  ARGR+ +        + ++   +   K L     KG+I     ++
Sbjct: 853  FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912

Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            G + +  G KI+++K   +  +  I FYD   ++  S+++VKN+IL  DI++ + F  ++
Sbjct: 913  GKIALCLGTKIMIYKIDRSSGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFFFQ 971

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
             +  +L+L++      +  +TE L  G+ LS++  D +  I  + Y+P    S  G +L+
Sbjct: 972  SKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGARLV 1031

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
             RAE               + T+  R  +     K N  +++F +    +  ++ +D+  
Sbjct: 1032 KRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDDAH 1074

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            + +L  +Q  ++  +  V GLN R +   +S+   H     S +   +L+ +    +  Q
Sbjct: 1075 YLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKSPIVLHILNLFSYFDMSTQ 1132

Query: 1404 LEIAHQTGTTRSQILSNLNDLAL 1426
              ++      R +I   +  L L
Sbjct: 1133 ESVSSSARIDRKEISDMIASLNL 1155


>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
 gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
          Length = 1140

 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+  +   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|429965418|gb|ELA47415.1| hypothetical protein VCUG_01066 [Vavraia culicis 'floridensis']
          Length = 1176

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 18/181 (9%)

Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            S+  KG IS  A+++G + ++   K+++++    + +  IAFYD   +Y VSL ++KN+I
Sbjct: 927  SERTKGPISCCAAVRGKIAVSLATKLMVYECDRNSGIVAIAFYDLY-MYAVSLAVIKNYI 985

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
            ++GDI   ++F+ ++ +  +L+LL+K     + GSLD F       G +L +   D+   
Sbjct: 986  IVGDIMMGLHFVYFQSEPVKLHLLSKSDRIANLGSLDFFNA-----GESLFITGIDKTGK 1040

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
            +QIF ++P    S  G+KL+ R EF   A        Q + TSS R+ A+  S +    A
Sbjct: 1041 VQIFSFSPSNLYSNGGEKLVKRQEFETYAC------FQSIKTSSYRSYASFFSSQNFLIA 1094

Query: 1324 L 1324
            L
Sbjct: 1095 L 1095


>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
          Length = 1140

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 89/380 (23%), Positives = 163/380 (42%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+G +++ HNL  VD H   T E      L P                      T   ++
Sbjct: 781  EIGQEVEIHNLLIVDQH---TFEVLHAHTLVP----------------------TEYAMS 815

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA +  ++   + GR+LL+  + G+        +T+V  KE+K
Sbjct: 816  LISTKLGEDPTPYYIVGTAMINPDESEPKSGRILLYHWNDGK--------LTQVAEKEIK 867

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+G
Sbjct: 868  GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 924

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      + E L D + L    ++   N+ +     
Sbjct: 925  DLMRSVTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 981

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 982  AATSEEERQQMQEVGQFHLGDMVNVFRHGSLVMQHLGESSTPTHGC----------VLFG 1031

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++L+ +L   +  V  +    +R F+++ K  +   +  +
Sbjct: 1032 TVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTDLKIEQ--CEGFI 1089

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  E+  E+A
Sbjct: 1090 DGDLIESFLDLSHEKMAEVA 1109



 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 90/420 (21%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D ++V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +         Y   +    +    S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +  D +     V++ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS L++       +       E F ++   AP           A
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---APIVDM-------A 366

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + D+                +    T S A ++     G L+    G+ I   AS     
Sbjct: 367 VVDL------------ERQGQGQIVTCSGAFKE-----GSLRIIRNGIGIQEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +   S    N                L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E TE   +    +T   GN+     +IQ+    AR++     +    + P N  + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501


>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
          Length = 451

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 134  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 187

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 188  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 242

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 243  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 299

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 300  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 349

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 350  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 392


>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
 gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
          Length = 1140

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  ++   + GR+++F       N   L T V  KE+KGA+ ++   
Sbjct: 823  KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+  
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096

Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +      +M  +   L+I   +G  R   + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129


>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
            tropicalis]
          Length = 1140

 Score = 80.1 bits (196), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  ++   + GR+++F       N   L T V  KE+KGA+ ++   
Sbjct: 823  KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+  
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096

Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +      +M  +   L+I   +G  R   + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129


>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
          Length = 1140

 Score = 80.1 bits (196), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  ++   + GR+++F     +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPDEAEPKQGRIVVF---HYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LYV +     +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEFGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+  
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1096

Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +      +M  +   L+I   +G  R   + +L
Sbjct: 1097 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129


>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1123

 Score = 80.1 bits (196), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 142/302 (47%), Gaps = 31/302 (10%)

Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLF 1132
            G  ++ R +  + S+E  ++  +   F   + ++   L +GTA+V   ED   RGR+L+F
Sbjct: 840  GQTFEIRDSFQLPSTETIMSF-ISCSFANDSSDSTVYLVVGTAFVIPSEDEPKRGRILVF 898

Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW--TGTELNGIAF 1190
                 A    +LVT   +K++KG + +L +  G LL     K+ L KW  TG  +  +  
Sbjct: 899  DVAGGA---LHLVT---AKDVKGCVYSLNAFNGKLLAGINSKVNLFKWNLTGDGIRELVS 952

Query: 1191 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFA 1243
              +   ++++L +    +FI++GD+ +SI  L +K   + +  +A+D       ++D   
Sbjct: 953  ECSHHGHILTLYLKSRGDFIIVGDLMRSISLLMYKSGTSSIEEIAQDTCPNWVTAVDMLD 1012

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
             +  I G + S  +   ++N++        S   + ++L    EFHVG  + +F    ++
Sbjct: 1013 DDVFIGGES-SFNIFTCRRNLE-------ASTDEERKRLEVVGEFHVGEFINQFRAGSLV 1064

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
                D        ++  + + LFGT +G IG IA L    +  LQ +Q  +   +  V G
Sbjct: 1065 MKLPDE------QEQPIQPSTLFGTGNGVIGVIARLTRSQYEFLQLVQAAMAKVIKGVGG 1118

Query: 1364 LN 1365
            LN
Sbjct: 1119 LN 1120



 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+   V D  F+ GY  P +++L++             H      L       + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259

Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
             W+  N+   A  L+ V  P+GGVL+VG  TI YHS SA  ++A+    +         
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
              +SV         +  +  LL+   G+L+ + + +D +  V  + + +   + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           T + N + F GS+ GDS L++              ++E G       S   L      A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
            D+                +    T S A +D     G L+    G+         GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442

Query: 536 QSNYELVELPGCKGIWTV 553
           Q+    +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457


>gi|449328561|gb|AGE94838.1| cleavage and polyadenylation specific factor [Encephalitozoon
            cuniculi]
          Length = 1156

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/325 (22%), Positives = 149/325 (45%), Gaps = 30/325 (9%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1164
             L + T +++GED  ARGR+ +        + ++   +   K L     KG+I     ++
Sbjct: 853  FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912

Query: 1165 GHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
            G + +  G KI+++K   +  NGI    FYD   ++  S+++VKN+IL  DI++ + F  
Sbjct: 913  GKIALCLGTKIMIYKIDRS--NGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFF 969

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++ +  +L+L++      +  +TE L  G+ LS++  D +  I  + Y+P    S  G +
Sbjct: 970  FQSKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGAR 1029

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            L+ RAE               + T+  R  +     K N  +++F +    +  ++ +D+
Sbjct: 1030 LVKRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDD 1072

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
              + +L  +Q  ++  +  V GLN R +   +S+   H     + +   +L+ +    + 
Sbjct: 1073 AHYLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKNPIVLHILNLFSYFDMS 1130

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLAL 1426
             Q  ++      R +I   +  L L
Sbjct: 1131 TQESVSSSARIDRKEISDMIASLNL 1155


>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
          Length = 599

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+ +F   + +D     V E   KE+KGA+ ++   
Sbjct: 282  KDPNTYFIVGTAMVYPEEAEPKQGRIAVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 335

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 336  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 390

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 391  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 447

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 448  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 497

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 498  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 540


>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
          Length = 1146

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 40/291 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS------FRQFHSNGKAH 1379
             +  L E  +  L  +Q +L   +     ++P S      +R FH+  K  
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFQISPNSLTDMSTWRSFHTERKTE 1089


>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
            mellifera]
          Length = 1141

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/380 (24%), Positives = 166/380 (43%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            ++  +I+ HNL  +D H   T E     +L P                 +E AL+     
Sbjct: 782  DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA V  ++   + GR+LL+  S G+        +T+V  KE+K
Sbjct: 817  LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY+ S     +FIL+G
Sbjct: 869  GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      A E L D + L    ++   N+ +     
Sbjct: 926  DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 983  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +  + +  L++L+ +L   +  V  +    +R F++  K  +   +  +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  ++  E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
 gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
          Length = 1134

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/336 (23%), Positives = 152/336 (45%), Gaps = 35/336 (10%)

Query: 1080 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED---VAARGRVLLFSTGR 1136
            +  + +Q  E  +++   T  N    + E    +GTA+V  ED        R+L +  G+
Sbjct: 798  QCALQLQDCEWGMSLISCTFEN----DPEAYYCVGTAFVNLEDKEPTKGNIRILKYFEGK 853

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAP 1194
                    + +V+SKE+ GA+  + +  G LL +    + +++WT  +  +   +F++  
Sbjct: 854  --------IQQVHSKEVSGAVYCMVAFNGRLLASVNSTVSVYEWTSNKELVEETSFHNN- 904

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             +  + L    +FIL+GD+ +SI   +++    ++ L+ K+       A E + D S L 
Sbjct: 905  -VLALYLKTKGDFILIGDLMRSISLCAYRPMNNEIELICKNNDPNWMTAVEIIDDDSYLG 963

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQK-LLSRAEFHVGAHVTKFLRLQMLATSS-DRTGA 1312
                  + +  +F      S S + QK L +   +HVG  V  F +  ++  ++ D   +
Sbjct: 964  -----GENSHNLFTCQKNSSSSEEEQKHLPTVGVYHVGEFVNVFRQGSLVMQNTVDIPDS 1018

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
              GS       +LFGT+ G++G +  L    F  + ++  KL   V  V  +  + +R F
Sbjct: 1019 VQGS-------ILFGTVSGAVGVVVTLAPAMFEFVSAIANKLSTVVKGVGKIEHQFWRSF 1071

Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
             SN +   P   S VD +L+  +  L  E+   +A+
Sbjct: 1072 -SNDRKTEPC-QSFVDGDLVESFLDLSPEDMQRVAN 1105



 Score = 47.4 bits (111), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 91/201 (45%), Gaps = 31/201 (15%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT-LKQHPLIW 299
           L+   V D  F++G+ EP + +++E         S ++       +S+    + + P  W
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---------SGQNRYLKTYEISLQNADIHRQP--W 206

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           +   +  +A+ +L VP P  G++V+GA +I Y+    S    L+    SL       R +
Sbjct: 207 NIGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RIT 257

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDI 415
               +D+    +L  D +      G L +L +V +    G  V+ L L     + + S I
Sbjct: 258 CFGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCI 311

Query: 416 TTIGNSLFFLGSRLGDSLLVQ 436
           T + N+  ++GS  GDS L++
Sbjct: 312 TYLDNAFAYIGSSCGDSQLIK 332


>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
          Length = 1009

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 692  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 745

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 746  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 800

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N  +       +   +
Sbjct: 801  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNSFVCQKDSAATTDEE 857

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 858  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 907

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 908  LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 950


>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
          Length = 1140

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KG + ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGDVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
          Length = 1138

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+G +I+ HNL  +D H   T E      L P                 +E AL+     
Sbjct: 779  EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 813

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA +  ++   + GR+LL+  S G+        +T+V  KE+K
Sbjct: 814  LISTRLGEDPTSYFVVGTALINPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 865

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+G
Sbjct: 866  GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      + E L D + L    ++   N+ +     
Sbjct: 923  DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 980  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++L+ +L   +  V  +    +R F++  K  +   +  +
Sbjct: 1030 TVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  ++  E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + F+GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336


>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
 gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 78.6 bits (192), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/48 (75%), Positives = 41/48 (85%)

Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
           + DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1   MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 143/308 (46%), Gaps = 26/308 (8%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            K++E  + +GTA    E     GR+ +FS  +     +  +T V +K++ GA+ ++ +L 
Sbjct: 821  KKDEQFIVVGTAITADEQECKNGRICVFSYSK-----EEKLTLVSTKQVNGAVYSVKALN 875

Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFY--DAP---PLYVVSLNIVKN-FILLGDIHKSIY 1218
            G+ +I +    I  +    E+N       +AP    +  V++++ KN FIL  D+ +SI 
Sbjct: 876  GNKIICA----INQQLKVFEMNEQTTLQSEAPIANHITCVAVDVSKNGFILSADLMRSIS 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
              S+K     L  +A+D+      A + + D + +    ++  +NI I     +  +   
Sbjct: 932  VFSYKPLEGALEEIARDYHPNWMTAIKMIDDDNYIG---AENSENIFICTRNTEAPDEED 988

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
             Q+LL    +HVG H+   +   ++      +   P    T  F  L G++ G +G +A 
Sbjct: 989  RQQLLPTGYYHVGEHINTIVEGNLVMDVHVESSITP----TRTF--LMGSVSGYVGLLAI 1042

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
              E  ++ L  L+ K+   +  V  ++  S+R+F S+ +         VD +L+  ++ L
Sbjct: 1043 FPEKQWQFLSKLEAKMRKVIRGVGKIDHESWRRFESDSRME--DCKGFVDGDLIEMFQDL 1100

Query: 1399 PLEEQLEI 1406
              E+Q E+
Sbjct: 1101 RPEKQKEV 1108



 Score = 49.3 bits (116), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 130/626 (20%), Positives = 228/626 (36%), Gaps = 173/626 (27%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++  +   L+ V  + L+G +  + +        + ++D + +  E     +LE+ D
Sbjct: 43  RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
              G  IT         +  ++    S ++ G    VDP+ RC  + +Y   + I+  + 
Sbjct: 99  ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
               L   E           RIE   V+           D  F+H   +P + +L++   
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186

Query: 269 TWAGRVSWKH-HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
                 S +H  T  I+  +  +    H   +    +  D   ++AVP P+ G+L++G  
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLGHV 240

Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            I YH        ++ N                 V+      T +     L     G+L 
Sbjct: 241 NITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGELF 284

Query: 388 LLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           LL + Y+   +     RL       + L + ++ I N + F+GS  GDS L++       
Sbjct: 285 LLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIR------- 337

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                        IE                               + NN  S Q   S 
Sbjct: 338 -------------IEV------------------------------SDNN--SGQHFTSL 352

Query: 504 AVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELPGC 547
              D+L   GP+KD           G  + A    TG S +          Y  ++L G 
Sbjct: 353 HQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGV 409

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESVDY 605
           KG+W + + SS      S++  +        L++S   +T+   LE  D +TEV E +  
Sbjct: 410 KGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEIPG 453

Query: 606 FVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS----ENST 660
           F  G +T+ AGN+   ++ +Q+ E+  R++                ES  GS    EN+ 
Sbjct: 454 FTNGEQTMYAGNV-TDQQFLQITEKQVRLI--------------ADESLKGSWEPEENTQ 498

Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGD 686
           +   S+    VLLG+   +I L + D
Sbjct: 499 INLCSVNKNQVLLGVGSTAIYLEIND 524


>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 38/285 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  V  KE+KGA+ ++ 
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGK--------LQTVAEKEVKGAVYSMV 874

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 875  EFNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 930  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L     FH+G  V  F    L +Q L  +S  T            ++L GT++G 
Sbjct: 987  EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLLGTVNGM 1036

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081


>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
          Length = 1138

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 86/380 (22%), Positives = 164/380 (43%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+G +I+ HNL  +D H    +  +                     + ++E AL+     
Sbjct: 779  EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 813

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA++  ++   + GR+LL+  S G+         T+V  KE+K
Sbjct: 814  LISTKLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 865

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+G
Sbjct: 866  GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      + E L D + L    ++   N+ +     
Sbjct: 923  DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 980  AATSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++++ +L   +  V  +    +R F++  K  +   +  +
Sbjct: 1030 TVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  ++  E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107



 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
          Length = 1079

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 762  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 813

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 814  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 868

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 869  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 925

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L      H+G  V  F    L +Q L  +S  T  +          +LFGT++G 
Sbjct: 926  EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 975

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+
Sbjct: 976  IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLI 1033

Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +      +M  +   L+I   +G  R   + +L
Sbjct: 1034 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1068


>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
          Length = 1140

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 875  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 930  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L      H+G  V  F    L +Q L  +S  T  +          +LFGT++G 
Sbjct: 987  EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094

Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +      +M  +   L+I   +G  R   + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 74/296 (25%), Positives = 123/296 (41%), Gaps = 26/296 (8%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+      +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++   
Sbjct: 823  KDPSVYFVVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             G LL +    + L++WT  +           +  + L    +FIL+GD+ +S+  L++K
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECSHYNNIMALYLKTKGDFILVGDLMRSVLLLAYK 936

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
                    +A+DF      A E L D + L    ++   N+ +       +     Q L 
Sbjct: 937  PMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEDRQHLQ 993

Query: 1284 SRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
                FH+G  V  F    L LQ L  +S  T  +          +LFGT+ G IG +  L
Sbjct: 994  EVGVFHLGEFVNVFCHGSLVLQNLGETSTPTQGS----------VLFGTVTGMIGLVTSL 1043

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             E     L  LQ +L   +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 1044 SEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--AKGFIDGDLIESF 1097



 Score = 47.4 bits (111), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGD  LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479


>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
            sapiens]
          Length = 1146

 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 128/289 (44%), Gaps = 40/289 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGL------NPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +     +      +P ++R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFLISTCSLTHPSTWRSFHTERK 1087


>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
 gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
 gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
          Length = 1140

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 875  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 930  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L      H+G  V  F    L +Q L  +S  T  +          +LFGT++G 
Sbjct: 987  EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094

Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +      +M  +   L+I   +G  R   + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129


>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
 gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
          Length = 1152

 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 74/305 (24%), Positives = 129/305 (42%), Gaps = 26/305 (8%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   IGTA V  E+   + GR+++F            + +V  KE+KGA+ +L      L
Sbjct: 834  TYFIIGTAMVYPEESEPKSGRIIVFQYTDGK------LQQVAEKEVKGAVYSLVQFNNKL 887

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L +    + L +WT  +   +       +  + L    +FIL+GD+ +S+  L++K    
Sbjct: 888  LASINSTVRLFEWTAEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYKPMEG 947

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---KGQKLLS 1284
                +A+DF      A E L D + L        +N   F+   K S +    + Q L  
Sbjct: 948  CFEEIARDFNPNWMSAVEILDDDNFLG------AENSFNFFTCQKDSAATTDEERQHLQE 1001

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDELT 1343
               FH+G  V  F    ++          PG   T  + ++LFGT++G++G +  L    
Sbjct: 1002 VGHFHLGEFVNVFRHGSLVMQH-------PGETSTPTQGSVLFGTVNGAVGLVTQLPADF 1054

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            F  LQ +Q KL   +  V  +    +R F++  K         +D +L+  +  L  ++ 
Sbjct: 1055 FNFLQEVQSKLTRVIKSVGKIEHSFWRSFNTERKTE--ACQGFIDGDLIESFLDLSRDKM 1112

Query: 1404 LEIAH 1408
             E+  
Sbjct: 1113 QEVVQ 1117



 Score = 48.5 bits (114), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 46/228 (20%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
            F+ R+E  +VI+++           F++G   P +V +++             H   + 
Sbjct: 154 AFNIRLEELNVIDVK-----------FLYGCQVPTVVFVYQ-----------DPHGRHVK 191

Query: 285 ALSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
              IS   K+     W   N+  +A  ++AVP P  G L++G  +I YH+     A+A  
Sbjct: 192 TYEISVRDKEFSKGPWKQDNVETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA-- 249

Query: 344 NYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV 397
                      + +S+      +DA  + +L  D++      G L +L +    + DG V
Sbjct: 250 --------PPAIKQSTLICHGRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSV 295

Query: 398 -VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
            V+ L +     + +   +T + N + +LGSRLGDS L++    +  S
Sbjct: 296 TVKDLKVELLGETSIAECLTYLDNGVVYLGSRLGDSQLIKLNVDADDS 343


>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
          Length = 1140

 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 129/301 (42%), Gaps = 36/301 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+      +GTA V  E+   + GR+++F      D     V E   KE+KGA+ ++   
Sbjct: 823  KDPSVYFIVGTAMVYPEEPEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L LQ L  SS  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGVFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E     L  LQ +L   +  V  +    +R F++  K  +      +D +L+  
Sbjct: 1039 LVTSLSEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFYTERKTEQ--ATGFIDGDLIES 1096

Query: 1395 Y 1395
            +
Sbjct: 1097 F 1097



 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 74/341 (21%), Positives = 130/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G  +D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479


>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
          Length = 278

 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 64/250 (25%), Positives = 116/250 (46%), Gaps = 22/250 (8%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIV 1204
            VY KE+KGA  ++ S+ G L++A    + L +WT  +   L    F +   LY+ + N  
Sbjct: 15   VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN-- 72

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
             + IL+GD+ +S+  LS+K   +    +A+DF +    A E +   + L       + + 
Sbjct: 73   -DLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSY 126

Query: 1265 QIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
             +F         +K  G +L     F++G  V  F    + AT  D    AP        
Sbjct: 127  NLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHS 179

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
            ++L+GT DG IG I  +  + +  LQ +QK+L +   +   ++   +R F +  ++    
Sbjct: 180  SILYGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--A 237

Query: 1383 PDSIVDCELL 1392
            P+  +D +L+
Sbjct: 238  PNGFIDGDLI 247


>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
          Length = 1128

 Score = 76.6 bits (187), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 78/299 (26%), Positives = 136/299 (45%), Gaps = 27/299 (9%)

Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
            A+GTA +  E+   + GR+LLF            +T+V  KE+KG    L    G LL +
Sbjct: 819  AVGTAILNPEESEPKQGRILLFHWCEGK------LTQVAEKEIKGGCYTLVEFNGKLLAS 872

Query: 1171 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
                + L +WT  +   L    F +   LY   L +  +FIL+GD+ +S+  L +K+   
Sbjct: 873  INSTVRLFEWTSEKELRLECSHFNNIVALY---LKVKGDFILVGDLMRSMSLLQYKQMEG 929

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
                +A+D+      A E L D + L    ++   N+ +       +   + Q++    +
Sbjct: 930  SFEEIARDYSPNWMTAVEILDDDTFLG---AENSFNLFVCQKDSAATTDEERQQMGYMGQ 986

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            FHVG  V    R  ++A  +D   AAP +       +L  T+ G+I  +  L +  F  L
Sbjct: 987  FHVGDMVNVMRRGALVAQLADT--AAPVAR-----PVLLATVSGAICLVVQLSQELFDFL 1039

Query: 1348 QSLQKKLVDSVPHVAGLNPRSF-RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
              L+++L  ++  V  + P SF R F+++ K   P  +  +D +L+  +  L  + Q E
Sbjct: 1040 HQLEERLTHTIKSVGKI-PHSFWRSFNTDIKTE-PA-EGFIDGDLIESFLDLSRDMQQE 1095



 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DPQ R  G+ +Y     I+   +  + L             S R+E    +N+ 
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
           DL+        F+HG   P ++++H+        ++ +H    I    I+   K+   I 
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSP+GG +V+G  +I YH   +  A+A       ++        
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
                +D     +L  D+A      G L +L +    R     V+ L +       +   
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + F+GSRLGDS LV+                    +  DA    +   + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L     N   +    F          +G L+    G+ I   AS     
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +    + G              +H  L++S   +T VL    
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
              E TE   +    +T   GN+    ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477


>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
          Length = 1143

 Score = 76.6 bits (187), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +   R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084


>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
          Length = 1143

 Score = 76.6 bits (187), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +   R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084


>gi|387593561|gb|EIJ88585.1| hypothetical protein NEQG_01275 [Nematocida parisii ERTm3]
 gi|387597215|gb|EIJ94835.1| hypothetical protein NEPG_00359 [Nematocida parisii ERTm1]
          Length = 1261

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/251 (23%), Positives = 126/251 (50%), Gaps = 26/251 (10%)

Query: 1054 LHRTYTVEEY---EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
            L R Y+++ Y   E+++    R  G         ++++E     ++VTL +    E    
Sbjct: 894  LTRAYSLKIYSHEEMKVC--SRTEGVLMAVDEYRLENNEYIAYHKIVTLPDKQNTEGFSE 951

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQ 1164
             + + T Y+  ED+ ARGR+++         R+    ++ +  + +++ KGA +    ++
Sbjct: 952  FVIVCTTYITDEDLMARGRLIVLEIASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVK 1011

Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            G++++  G K++++ +   E L  +AF+D   +++ S  +++N I+ GD +K  + L ++
Sbjct: 1012 GNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-VFLTSCMVMRNIIVCGDAYKGTFLLFYQ 1070

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDG-------STLSLVVSDEQKNIQIFYYAPKMSES 1276
                 L++L++  G +      +L+ G       + LSL+  D  K + I+ Y+P+   S
Sbjct: 1071 SDPPLLHMLSQSSGGV------YLLKGIGMTLHDTALSLISYDSLKTVCIYTYSPQHILS 1124

Query: 1277 WKGQKLLSRAE 1287
              G +L+SR E
Sbjct: 1125 QDGSRLISRGE 1135


>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
            [Strongylocentrotus purpuratus]
          Length = 630

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/303 (24%), Positives = 132/303 (43%), Gaps = 30/303 (9%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
            +G A V  ++   + GR+++F  S G+        + E+  KE+KGA  +L    G LL 
Sbjct: 322  VGLANVHPDEAEPKSGRIVVFQYSDGK--------LQEIAEKEIKGAPYSLVEFNGKLLA 373

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
            +    + L +WT      +       +  + L    +FI++GD+ +SI  L++K     L
Sbjct: 374  SVNSVVRLFEWTPEHSLRVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLAYKPMEGCL 433

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
              +A+D+      A E L D + L    ++   N+         +   + + L     FH
Sbjct: 434  EEIARDYSPNWMSAVEILDDDTFLG---AENSSNLFTCQKDSAATTDEERRHLQEVGLFH 490

Query: 1290 VGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
            +G  V  F    L +Q +  S+  T  +          +LFGT+ GS+G +  L+E  +R
Sbjct: 491  LGEFVNVFRHGSLVMQNIGESTIPTTGS----------VLFGTVSGSVGLVTQLNEEFYR 540

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
             L  +Q KL   +  V  +    +R F+S  K      D+ +D +LL  +  L  +   E
Sbjct: 541  FLLEVQNKLTKVIKSVGKIKHSFWRSFYSERKTE--PMDNFIDGDLLESFLDLSRDTMDE 598

Query: 1406 IAH 1408
            +A 
Sbjct: 599  VAQ 601


>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
          Length = 1124

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 36/285 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 805  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 858

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 859  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 913

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 914  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 970

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 971  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1020

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAG--LNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   + ++     +  ++R FH+  K
Sbjct: 1021 LVTSLSESWYNLLLDMQNRLNKVIKNITHSLTHLSTWRSFHTERK 1065


>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
          Length = 836

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+S+L +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++                    GF    ++S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
            I + GY  P +  LH +  TW GR V+    T  I  LS+   +   KQ  +   +   
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228

Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
            +  Y  L +    P  G+L++  N+I Y S ++  S  L    + V +  +   P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
            +       T + N V  +   T  L ++         V+   +  R+ D+  TN P   
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
                TI ++  F+GS + DS +  +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370



 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            K+ +  L +G      ED   +G+  +F    N +N   L+ ++   + K ++ A+  + 
Sbjct: 519  KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572

Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
            G L +ASG ++ L       +W     + I+      +  PL V+     K    ILL D
Sbjct: 573  GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMEKGNEKECYLILLSD 632

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             ++S+  L +K     +  L KD  ++ C  + F+I     S++  D ++N+ +  Y+  
Sbjct: 633  FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +E     ++   A F++G ++ KF RL          G          +  ++ T++GS
Sbjct: 693  ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I+ ++E  ++ L+ +  K+     H AG N   +R     G          +D ++L
Sbjct: 735  VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794

Query: 1393 SHYEMLPLEEQ 1403
              + +L  E+Q
Sbjct: 795  KQFRLLNEEQQ 805


>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
          Length = 1141

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            ++  +I+ HNL  +D H   T E     +L P                 +E AL+     
Sbjct: 782  DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA V  ++   + GR+LL+  S G+        +T+V  KE+K
Sbjct: 817  LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +FIL+G
Sbjct: 869  GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      A E L D + L    ++   N+ +     
Sbjct: 926  DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 983  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++L+++L   +  V  +    +R F++  K  +   +  +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090

Query: 1388 DCELLSHY 1395
            D +L+  +
Sbjct: 1091 DGDLIESF 1098



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
            tropicalis]
 gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
 gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
          Length = 306

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 121/279 (43%), Gaps = 35/279 (12%)

Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG--- 1182
            +GR+++F      D     V E   KE+KGA+ ++    G LL +    + L++WT    
Sbjct: 11   QGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKE 64

Query: 1183 --TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
              TE N   + +   LY   L    +FIL+GD+ +S+  L++K        +A+DF    
Sbjct: 65   LRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNW 119

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--- 1297
              A E L D + L    ++   N+ +       +   + Q L     FH+G  V  F   
Sbjct: 120  MSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHG 176

Query: 1298 -LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
             L LQ L  SS  T  +          +LFGT++G IG +  L E  +  L  LQ +L  
Sbjct: 177  SLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNK 226

Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
             +  V  +    +R FH+  K  +      +D +L+  +
Sbjct: 227  VIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 263


>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
          Length = 323

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 46/73 (63%)

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
            LR+HP   +G + +F + HNVNC  GF+Y   QG L+I  LP+  +YD+ WPV+KIPL  
Sbjct: 184  LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKIPLCC 243

Query: 1003 TPHQITYFAEKNL 1015
            T H + Y  E  +
Sbjct: 244  TVHCVAYHVESKI 256


>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
          Length = 1043

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 128/283 (45%), Gaps = 33/283 (11%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
            +T P+ + E   ++    L  + + ++     +GTAYV   E+   +GR+L+F      D
Sbjct: 759  STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW     GT EL   + +   
Sbjct: 812  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
             L  + +    +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   L 
Sbjct: 869  IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
                  + N  IF    K SE    +   +L    E+H+G  V +F    ++        
Sbjct: 928  -----AENNFNIF-TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
              P SD      ++FGT++G IG IA L    +  L+ LQ  L
Sbjct: 976  -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANL 1017



 Score = 56.6 bits (135), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
          Length = 1141

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            ++  +I+ HNL  +D H   T E     +L P                 +E AL+     
Sbjct: 782  DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA V  ++   + GR+LL+  S G+        +T+V  KE+K
Sbjct: 817  LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +FIL+G
Sbjct: 869  GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      A E L D + L    ++   N+ +     
Sbjct: 926  DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +LFG
Sbjct: 983  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +    +  L++L+++L   +  V  +    +R F++  K  +   +  +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090

Query: 1388 DCELLSHY 1395
            D +L+  +
Sbjct: 1091 DGDLIESF 1098



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSR GDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +++   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
            sapiens]
          Length = 1094

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 38/296 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKL---VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
             +  L E  +  L  +Q +L   + SV  +   +P     F  +G+ ++P   S V
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSHPPG-DPFTPSGRQNQPQVSSTV 1093


>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1092

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
             +  L E  +  L  +Q +L   +  V  +       FH    +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080


>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
          Length = 1092

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
             +  L E  +  L  +Q +L   +  V  +       FH    +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080


>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
            [Apis florea]
          Length = 1141

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            ++  +I+ HNL  +D H   T E     +L P                 +E AL+     
Sbjct: 782  DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA V  ++   + GR+LL+  S G+        +T+V  KE K
Sbjct: 817  LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEXK 868

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY+ S     +FIL+G
Sbjct: 869  GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ +S+  L +K        +A+D+      A E L D + L    ++   N+ +     
Sbjct: 926  DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
              +   + Q++    +FH+G  V  F    L +Q L  SS  T             +L G
Sbjct: 983  AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLXG 1032

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            T+ G+IG +  +  + +  L++L+ +L   +  V  +    +R F++  K  +   +  +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090

Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
            D +L+  +  L  ++  E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
          Length = 1175

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 96/469 (20%), Positives = 191/469 (40%), Gaps = 60/469 (12%)

Query: 983  LPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH 1042
            LP   T++   P+Q+I +  TPH I       +Y +  +  +  P           ++ H
Sbjct: 743  LPKFETFNYGIPIQEIIVGGTPHNIISSPIGLVYTISTTTTLESP-------DCPPQLPH 795

Query: 1043 QIDNHN-------LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
                 N       L++  L     +E Y++   + +++        T       N+L  R
Sbjct: 796  SSTQQNALEKPPELTATPLRAELFLEHYKLVYFDQNQS-------FTFDKGMFVNSL--R 846

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V+ L  T       L+  G      ED   +G V LFS    ++     ++ V     K 
Sbjct: 847  VLEL--TINSVKRVLVGCGVNTQTTEDDPVKGNVFLFSLESTSEGTIRHISTVCDG--KK 902

Query: 1156 AISALASLQGHLLIASGPKIILHK------WTGTELNGIAFYDAPPLYVVSLNIVKN--- 1206
            A+ A+ S+ G+L +A G ++ + K      W     + I+      +  + + + KN   
Sbjct: 903  AVHAINSIGGYLAVAEGNELQILKGKTESLWVKKCFSDISIL-INTITFLPMTLSKNKVD 961

Query: 1207 ----FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
                 ILL D+++S+  L ++ Q   +  L KD   +      F++D     ++  D ++
Sbjct: 962  EMCYLILLNDMYRSVILLLFQPQKKSVIPLGKDGRDIHAIDAAFVLDKDYFHVLEIDYER 1021

Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
            N+ +  Y    +E+ +       A F+VG  + +  RL++                 N +
Sbjct: 1022 NLSVMNYL--RTETERISIFEVAATFNVGVDILRLTRLRL----------------GNGY 1063

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
              ++ +  GS+G +  ++E +++ L+ +  K+     H AG NP  FR     G  +   
Sbjct: 1064 VFVYLSAQGSVGYLTVVNERSYQTLRQINAKMNREPWHFAGTNPEEFRMEKGYGVGYGRR 1123

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               I+D ++L  +  L  E+Q  +  +  T+ S +++ L++    +S L
Sbjct: 1124 KQVILDGDILKEFHFLTQEQQKRVCLR-NTSISDVVNILDNALQRSSLL 1171



 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 82/363 (22%), Positives = 153/363 (42%), Gaps = 48/363 (13%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
           +IL F+ A++SV+ ++   +   + S+HCFE PE    ++   +    P + +D +GRC 
Sbjct: 74  LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133

Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
            +L Y   + ++                GS         SS+ ++L    +  + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172

Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
            GY  P +  LH    TW  R V+    T  I+ ++++  +  ++     + +N    LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE----------L 355
           ++  K++    P+ G+L++ + ++ Y S ++S +  L  +    +  +           L
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFIL-PFGTYFNPPEVWKGVVPFLKLL 290

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
           P     ++L  +     QN + L  T  GD   + +     +VQ + LS   P     + 
Sbjct: 291 PMKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNT 348

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
            ++ +   FLGS   DS L  +T C  G      G  + FG    DA   K L+  S   
Sbjct: 349 VSLYDDYIFLGSVFHDSYLFNYTICEYG-----KGDIKPFGIHCGDAVRIKNLQERSGQM 403

Query: 475 LQD 477
            +D
Sbjct: 404 EED 406


>gi|324501533|gb|ADY40680.1| Splicing factor 3B subunit 3 [Ascaris suum]
          Length = 1214

 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 129/290 (44%), Gaps = 37/290 (12%)

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
            ++A+   +G  L+  G KI L+     +L         P+ VV +  +   I++ D  +S
Sbjct: 946  VNAVHDFRGMALVGVGKKIRLYDLGKKKLLAKCENKQLPVQVVDIRSMGQRIVVSDSQES 1005

Query: 1217 IYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            ++F+ +K+Q  QL++   D    F +  C     ++D  T++  V D   +I +      
Sbjct: 1006 LHFMRYKKQDNQLSIFCDDTSPRFVTCIC-----ILDYDTVA--VGDRFGSIAVLRLPKG 1058

Query: 1273 MSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
            ++E  +      RA +  G          H+ +F  +    TS  +T   PG++      
Sbjct: 1059 VTEEVQEDPTGVRALWDRGNLNGASQKVEHIGQFY-VGDTVTSMQKTSLVPGAND----C 1113

Query: 1324 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            L++ T+ G+IG + P    DE  F   Q+L+  L    P + G +  +FR F+   K   
Sbjct: 1114 LVYTTISGTIGMLVPFVSRDEFDF--FQNLEMHLRVEFPPLCGRDHLAFRSFYFPVKC-- 1169

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                 I+D +L   Y ++PL++Q  +A + G   ++I   L D+    +F
Sbjct: 1170 -----IIDGDLCEQYALMPLDKQKAVAEELGRKPAEIHKKLEDIRTRYAF 1214


>gi|378755148|gb|EHY65175.1| hypothetical protein NERG_01621 [Nematocida sp. 1 ERTm2]
          Length = 822

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 114/212 (53%), Gaps = 9/212 (4%)

Query: 1084 PMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            P++ +E     ++VTL +    E  +  + + T Y+  ED+ ARGR+++         R+
Sbjct: 486  PLEDNEYIAHHKIVTLPDKQNTEGVSEFVIVCTTYITDEDLMARGRLIVLEIASVVPQRD 545

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPL 1196
                ++ +  + +++ KGA +    ++G++++  G K++++ +   E L  +AF+D   +
Sbjct: 546  RIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-V 604

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF-ATEFLIDGSTLSL 1255
            ++ S  +++N I+ GD +K  + L ++ + + L+LL++  G +         + GS LSL
Sbjct: 605  FLTSCMVMRNIIVCGDAYKGTFLLFYQSEPSLLHLLSQSSGGVYLLKGIGMTLYGSVLSL 664

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            +  D  K + I+ Y+P+   S  G +L+SR E
Sbjct: 665  LSYDSAKTVCIYSYSPQHILSQGGTRLISRGE 696


>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
 gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
          Length = 1148

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 95/398 (23%), Positives = 166/398 (41%), Gaps = 64/398 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E G +++ HNL  +D H    +  ++                    MQS E AL++    
Sbjct: 789  EFGQEVEVHNLLVIDQHTFEVLHAHQF-------------------MQS-EYALSLISTK 828

Query: 1099 LFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI 1157
            L +    +  T   +GTA V   E  + +GR+L+F         +  + +V  KE+KGA 
Sbjct: 829  LGD----DPNTYYIVGTAMVNPDESESKQGRILIFQF------QEGKLYQVAEKEIKGAA 878

Query: 1158 SALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
             +L    G LL +    + L +WT  +   L    F +   LY   L    +FIL+GD+ 
Sbjct: 879  YSLVEFNGKLLASINSTVRLFEWTAEQELRLECSHFNNIISLY---LKTKGDFILVGDLI 935

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+  L +K        +A+D       A E + D + L    ++   N+ +       +
Sbjct: 936  RSMTLLQYKTMEGCFEEMARDHNPNWMTAVEIIDDDTFLG---AENSFNLFVCQKDSAAA 992

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLFGTL 1329
               + Q++ +   FH+G  V  F    L +Q +  TS+  TG            +LFGT+
Sbjct: 993  TDEERQQMHAVGMFHLGDMVNVFRHGSLVMQNVGETSTPTTGC-----------ILFGTV 1041

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
             G+IG +  +    +  L  L+ KL + +  V  +    +R F +  K      D  +D 
Sbjct: 1042 SGAIGLVTQISANFYNFLHELECKLTEVIKSVGKIKHSFWRSFTTEIKTEP--CDGFIDG 1099

Query: 1390 EL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +L      LSH +M  +   L+I + +G  +   + +L
Sbjct: 1100 DLIESFLDLSHEKMKEVAAGLQIDNGSGMKQEATVDDL 1137



 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G++S   G +  +DP+ R  G+ +Y   + I+   +  S L             S R+E 
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161

Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
             V           +D  F+HG   P ++++H+        ++ +H    +    IS   
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199

Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
           K+   + W   N+  DA  ++ VP P+ G +++G  +I YH+ +   A+A   +N   ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
             +           ++D+  + +L  D+A      G L +L +  + ++          L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
            +       +   IT + N + F+GSR GDS LV+ 
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338


>gi|297820284|ref|XP_002878025.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297323863|gb|EFH54284.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 454

 Score = 73.6 bits (179), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 161/381 (42%), Gaps = 57/381 (14%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P  A     T   + +Q +E A +V  V   N   KE  TLLA+GT  V+G    
Sbjct: 103  IRILDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 153

Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
             +  ++       R  +  ++L   ++  +++G   AL   QG LL   GP + L+    
Sbjct: 154  PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 212

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
              L         P  ++S+   ++ I +GDI +S ++  ++    QL + A D       
Sbjct: 213  KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 272

Query: 1236 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
                         D F   + +     LS  + ++    +I +   K++ +    K+   
Sbjct: 273  ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 330

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
             +FHVG  VT   +  M+   S+              ++++GT+ GSIG +      D++
Sbjct: 331  VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 376

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
             F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ 
Sbjct: 377  DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 427

Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
            Q +IA +   T ++IL  L D
Sbjct: 428  QRKIADELDRTPAEILKKLED 448


>gi|255588145|ref|XP_002534515.1| spliceosomal protein sap, putative [Ricinus communis]
 gi|223525135|gb|EEF27867.1| spliceosomal protein sap, putative [Ricinus communis]
          Length = 1214

 Score = 73.6 bits (179), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 93/387 (24%), Positives = 168/387 (43%), Gaps = 67/387 (17%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
            +R+L+P  A     T   + +Q +E A +V  V   N   KE+ TLLA+GTA    +   
Sbjct: 863  IRVLDPRTAA----TTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTAKGLQFWPK 915

Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
              ++A G + ++   +  D+ + L   ++  +++G   AL+  QG LL   GP + L+  
Sbjct: 916  RSLSA-GFIHIY---KFVDDGRALEL-LHKTQVEGVPLALSQFQGRLLAGIGPVLRLYDL 970

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1235
                L         P  +VS+   ++ I +GDI +S +F  ++    QL + A D     
Sbjct: 971  GKKRLLRKCENKLFPNSIVSIQTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDCVPRW 1030

Query: 1236 -----------FGSLDCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQ 1280
                           D F   + +    L   VSDE +      +I +   K++ +    
Sbjct: 1031 LTASHHVDFDTMAGADKFGNIYFV---RLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PN 1085

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL- 1339
            K+    +FH+G  VT   +  ++          PG  +     +++GT+ GS+G + P  
Sbjct: 1086 KVEEIVQFHIGDVVTSLSKASLI----------PGGGE----CIIYGTVMGSVGALLPFT 1131

Query: 1340 --DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
              D++ F     L+  L    P + G +  ++R       A+ P  D ++D +L   +  
Sbjct: 1132 SRDDVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPT 1182

Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            LPL+ Q +IA +   T  +IL  L ++
Sbjct: 1183 LPLDAQRKIADELDRTPGEILKKLEEV 1209


>gi|443922899|gb|ELU42250.1| splicing factor 3B subunit 3 [Rhizoctonia solani AG-1 IA]
          Length = 1212

 Score = 73.6 bits (179), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 117/486 (24%), Positives = 212/486 (43%), Gaps = 49/486 (10%)

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
            C  G I +    +L+I Q+P  S  D    V  +PL  TP +I    E  L+ +I S   
Sbjct: 753  CPEGLIGIAGS-VLRIFQIPKLS--DKLKQV-TMPLSYTPRKIAVHPEHQLFYVIESDHR 808

Query: 1025 L---KPLNQVLSLLIDQEVGHQIDNH--NLSSVD--LHRTYTVEEYE-VRILEPDRAGGP 1076
                +  N+ L+ L+    G QID    +L + D  L R    +    +RI++P      
Sbjct: 809  TWGSEAKNKRLAELV--RAGRQIDQELVDLPAEDFGLPRAGAGQWASCIRIIDPTEVFCS 866

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
              T   I + ++E+A +V VV       +ENE  L +GTA  +  +V  R  V   ++G 
Sbjct: 867  -ATLFKIELDNNESAFSVAVVPF---AARENELFLVVGTA--KDTNVLPRQCVGAVTSGS 920

Query: 1137 --NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
                    +++T     E+     AL  ++G L    G  + +++    +L   +     
Sbjct: 921  LVKLGWSTHILTRPIQTEVDDVPLALLGIKGRLCAGVGKALRIYEMGKKKLLRKSENKGF 980

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
               +V+L    + I++G++ +S+++ ++K +  +L + A D  S     +  L+D  T++
Sbjct: 981  ATAIVTLTSQGSRIIVGEMQESVHYATYKPESNRLLVFADD-TSARWVTSAALVDYDTVA 1039

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEF-HVGAHVTKFL---RLQMLATS 1306
              V D+  NI +      +S+       G  ++   EF H   H TK L    +  + TS
Sbjct: 1040 --VGDKFGNIFVNRLPANISQQVDDDPTGAGIMHEREFLHGAPHKTKLLAHYNVGDIVTS 1097

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              R    PG     R  + +  L G+IG + PL   +++ F  + +L++ +      + G
Sbjct: 1098 VHRAALVPG----GRDVVAYTGLHGTIGVLIPLASKEDVDF--ITTLEQHMRSEHSSLVG 1151

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +  ++R ++   KA       +VD +L   + MLP  +Q  IA +   T  ++L  L  
Sbjct: 1152 RDHLAYRGYYVPVKA-------VVDGDLCERFAMLPSTKQKSIAGELDRTVGEVLKKLEG 1204

Query: 1424 LALGTS 1429
            L +  S
Sbjct: 1205 LRVAGS 1210


>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
 gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
          Length = 1214

 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 163/382 (42%), Gaps = 59/382 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A +V  V   N   KE  TLLA+GT  V+G    
Sbjct: 863  IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 913

Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
             +  ++       R  ++ ++L   ++  +++G   AL   QG LL   GP + L+    
Sbjct: 914  PKKNLVAGFIHIYRFVEDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 972

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
              L         P  ++S+   ++ I +GDI +S ++  ++    QL + A D       
Sbjct: 973  KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 1032

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR------ 1285
            A+   +D  T++   +D+  N+        +SE            W+  KL         
Sbjct: 1033 ASHH-VDFDTMA--GADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDE 1089

Query: 1286 -AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
              +FHVG  VT   +  M+   S+              ++++GT+ GSIG +      D+
Sbjct: 1090 IVQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDD 1135

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
            + F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++
Sbjct: 1136 VDF--FSHLEMHMRQEYPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPMD 1186

Query: 1402 EQLEIAHQTGTTRSQILSNLND 1423
             Q +IA +   T ++IL  L D
Sbjct: 1187 LQRKIADELDRTPAEILKKLED 1208


>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
 gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
          Length = 1783

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 151/373 (40%), Gaps = 59/373 (15%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ  +  L+        +VEE E   +R+L+       +++    P+   E   ++
Sbjct: 717  RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQT----FESLCVYPLDQYEFGCSI 772

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
               +       +N     +GTAYV   E+   +GR+L+F+     D    L+ E   KE 
Sbjct: 773  ISCSF----ADDNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW        EL     +    L + +     +FI
Sbjct: 823  KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLL-----------------------AKDFGSLDCFATE 1245
            ++GD+ KSI  L +K     + L                        A+D+ +    A E
Sbjct: 882  VVGDLMKSISLLVYKVVPLTVCLTHIVLSVIFFVSLFVVLESAIEERARDYNANWMTAVE 941

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
             L D      V ++   N+         +   +  +L    E+H+G  V +F    ++  
Sbjct: 942  MLDDE---VYVGAENGYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR 998

Query: 1306 SSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
              D   G  P         ++FGT++G IG IA L    +  L+ LQ  LV  +  V  L
Sbjct: 999  LPDSEIGQIP--------TVIFGTINGVIGIIASLPHDQYVFLEKLQSTLVKYIKGVGNL 1050

Query: 1365 NPRSFRQFHSNGK 1377
            +   +R FH++ K
Sbjct: 1051 SHEQWRSFHNDKK 1063



 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D + +A E  K  VL++D     L   +M           + GR +   G +  +DP  
Sbjct: 75  QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127

Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           R  G+ +Y GL  +I   ++G                F+ R+E   V++++         
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
             F+HG ++P +V+L++           +H      AL       + P  WS  N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216

Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
             L+ VP+P+GGV+++G   I Y + +++          ++   Q + R+   V+ D + 
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
                    LL   TG L LL + ++   V  L +     + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320

Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
            GDS LV+                   +++ADA        S  + L+  VN   +  + 
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356

Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
               + +   +  T S A +D     G L+    G+ IN  AS            VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
            KG+W++  KSS            ++D Y  YL++S  + T  L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 147/360 (40%), Gaps = 62/360 (17%)

Query: 1113 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA+   E+V  +RGR+L+    R  +   +LV E   KE+KGA+  L +  G LL   
Sbjct: 873  VGTAFSLPEEVEPSRGRILVL---RADEGRLSLVAE---KEVKGAVYNLNAFNGKLLAGI 926

Query: 1172 GPKIILHKW---------------------------------------TGTELNGIAFYD 1192
              K+ L KW                                       T  EL     + 
Sbjct: 927  NSKVQLFKWVSRGAGAGAGAGGGAEGGAVAMADGGGGGGGGGGAPAAATTCELASECSHH 986

Query: 1193 APPLYVVSL--NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
                ++V+L  ++  +FI++GD+ KSI  L +K     +   A+DF      A   L D 
Sbjct: 987  G---HIVALYVDVRGDFIVVGDLMKSISLLVYKPDEGVIEERARDFNPNWMTAVCALDDE 1043

Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF------LRLQMLA 1304
            + L    ++   N+         +   +  +L    E+H+G  V +F      +RL    
Sbjct: 1044 TYLG---AENSFNLFTVRKNSDAAADEERSRLDVIGEYHLGEFVNRFRAGSLVMRLPGDG 1100

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
              +        S++      LFGT++G+IG +A L E T   L +LQK +   V  V G 
Sbjct: 1101 DGAGLGLGLDASNEAP--TQLFGTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGF 1158

Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  ++R FH+  ++        VD +L+  +  L  E+  E+A   G    ++   + +L
Sbjct: 1159 SHDAWRSFHNEHRSRLVEARGFVDGDLIESFLDLRPEKASEVASVVGVGVEELTKRIEEL 1218



 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  VDP+ R  G+ +Y     ++   Q G               FS R+E   V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F+ G  +P + +L++   T  GR    +  C+            +P  W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           +  ++   +  L+AVP+P+GGV+VVG   I Y ++  +  +              + ++ 
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278

Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
            +++ DA   T+   D      LLS   G L LL +++D   V+ L L     + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           + + N + ++GS  GDS LV+ 
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360


>gi|297816810|ref|XP_002876288.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322126|gb|EFH52547.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 162/381 (42%), Gaps = 57/381 (14%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A +V  V   N   KE  TLLA+GT  V+G    
Sbjct: 282  IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 332

Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
             +  ++       R  +  ++L   ++  +++G   AL   QG LL   GP + L+    
Sbjct: 333  PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 391

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
              L         P  ++S+   ++ I +GDI +S ++  ++    QL + A D       
Sbjct: 392  KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 451

Query: 1236 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
                         D F   + +     LS  + ++    +I +   K++ +    K+   
Sbjct: 452  ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 509

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
             +FHVG  VT   +  M+          PG  ++    +++GT+ GSIG +      D++
Sbjct: 510  VQFHVGDVVTCLQKASMI----------PGGSES----IMYGTVMGSIGALHAFTSRDDV 555

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
             F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ 
Sbjct: 556  DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 606

Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
            Q +IA +   T ++IL  L D
Sbjct: 607  QRKIADELDRTPAEILKKLED 627


>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 301

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)

Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           H+ D  F+ + Y EP + IL+    T  G + ++  T   +A            I++   
Sbjct: 6   HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
           LP+D + +L +P+PIGG L++G N + Y  Q+A   A+++N++A        +     ++
Sbjct: 54  LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113

Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
            L+ A   +L     Q    LL  + G  V +    DGRVV  L +   + SV    L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173

Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +   I  + N   F+GS++ +S+L+++   S  +             E      + +   
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220

Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             + L D+  GE+  +  ++S            F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271


>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1074

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/323 (24%), Positives = 143/323 (44%), Gaps = 44/323 (13%)

Query: 1112 AIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
             +GTAY+   ++   +GR+L+F      +    LV E   +E KGA+ +L +L G LL A
Sbjct: 782  CVGTAYILPYEIEPTKGRILIFLV---EERKLRLVAE---RETKGAVYSLNALTGKLLAA 835

Query: 1171 SGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               KII++KW       +L     Y    L  +      +FI++GD+ +S+  L +K + 
Sbjct: 836  VNQKIIVYKWVRRDNRHQLQSECSYRGCVL-ALHTQTHGHFIVVGDMVRSVSLLRYKYEE 894

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA-PKMSESWKGQKLLSR 1285
              + ++ +DF +    A   L D   +    +D   N+   +   P +            
Sbjct: 895  GLIEVVTRDFNTKWITAVAMLDDDIYIG---ADNCCNLFTLHSGRPGVV----------- 940

Query: 1286 AEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIA--PLDEL 1342
             E+H+G  V +     ++   +D   G  P         ++FGT+ G+IG IA  P D+ 
Sbjct: 941  GEYHLGDLVNRMHHGSLVMHHTDSEIGQIP--------TVIFGTISGAIGVIASFPYDQY 992

Query: 1343 TF-RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             F  +LQS+  K + SV +++ +  RSF       +A      + VD +L+  +  L   
Sbjct: 993  VFLEKLQSVLVKFIKSVGNLSHVEWRSFYNVSRTAEAR-----NFVDGDLIESFLSLSPS 1047

Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
            +  E++   G    ++   + +L
Sbjct: 1048 KMEEVSQVMGLRADELCKIVEEL 1070



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 135/605 (22%), Positives = 240/605 (39%), Gaps = 141/605 (23%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   N +EIY++  Q                          L+L+    L+G + +L
Sbjct: 31  NLIVAKCNRMEIYLLTPQ-------------------------GLQLMVDVPLYGTIATL 65

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            +        S  +D + ++ E  +  VL +D     L   S    +  +++    GR +
Sbjct: 66  ELFRS----RSETQDFLFISMERYRCIVLHWDGRNSELITRSGG--DVSDFI----GRPT 115

Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
              G +  +DPQ R  G+ +Y GL  +I                F + G           
Sbjct: 116 -DNGQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EA 153

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           +N+R L    V D  F++G   P +V+LH+           +H      AL     ++  
Sbjct: 154 LNIR-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS 206

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
              WS  NL + A+  L +P P+GGV+++G +TI Y S +   AL++          Q +
Sbjct: 207 ---WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
            R+   V+ D +   +  N        TG L L+ + ++   V  L       + + S I
Sbjct: 253 IRAVGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTI 304

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           + + + L ++GSR GDS L++                   +I+ADA +      S  + L
Sbjct: 305 SYLDSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEIL 340

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
           +  +N   +  +           +  + +        G  KD S    I A  +   I+ 
Sbjct: 341 EQFMNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITD 388

Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
           Q++   VEL G KG+W++  KSS     D+  +  + +E H +L +++E     LE  D+
Sbjct: 389 QAS---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDI 439

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGS 654
               +E+       +T+A G+     ++IQV  R  R++   S    D  F P+      
Sbjct: 440 KGFDSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNV 491

Query: 655 GSENS 659
            + N+
Sbjct: 492 AAANA 496


>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
 gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
          Length = 1282

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/343 (23%), Positives = 157/343 (45%), Gaps = 33/343 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHR-TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + + HQ + +  + V  H  + + ++  VR+++     G ++T +   ++  E   ++  
Sbjct: 907  RRIAHQPETNTFAVVVEHLWSKSSQDCFVRLVD----DGSFETLSQFQLEDQELTSSLTS 962

Query: 1097 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
             T    +T    T   +GT   ++ ED  +RGR+L+F      D+   LV+E   KE++G
Sbjct: 963  CTFAGDST----TYYVVGTGIALETEDEPSRGRILVFKVD---DDQLVLVSE---KEVRG 1012

Query: 1156 AISALASLQGHLLIASGPKIILHKWT--GTELNGIAFYDAPPLYVVSLNIVK--NFILLG 1211
            A+  L + +G LL     K+ L KWT    E++ +    +    +V+  +    ++IL+G
Sbjct: 1013 AVYNLNAFKGKLLAGINSKLELFKWTPREDEVHELVSECSHHGQIVTFAVKTRGDWILVG 1072

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ KS+  L +K +   ++ +A+DF +    A   L D  T        + ++ +F  + 
Sbjct: 1073 DLMKSMSLLLYKPEEGAIDEVARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVSR 1128

Query: 1272 KMSE--SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             ++     +  +L    E+H+G  V  F    ++ +  D        +  +   LLFGT 
Sbjct: 1129 NVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRD-------GESLSVPTLLFGTA 1181

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
            +G IG +A L +  +   + LQ  +   +  V GL    +R F
Sbjct: 1182 NGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLKHADWRSF 1224



 Score = 44.7 bits (104), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 101/206 (49%), Gaps = 24/206 (11%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
              N+R L+   V+D  F+HG  +P + +L+ R++  A  V  K +   +      ++  
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA--VHIKTYEIGVREKEFVSS-- 429

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
                W+  +L   + K++ VP+P+GGV+V+G  TI Y ++++      ++  V L +  
Sbjct: 430 ----PWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAIN 479

Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
              RSS      +D   + +L  D        G L LL +V+DG+ V  L + +   + +
Sbjct: 480 IPERSSIVCYGAIDPDGSRYLLGD------HDGTLYLLVLVHDGKRVNELKIERLGETSI 533

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
            S ++ + N + F+GS  GDS L++ 
Sbjct: 534 PSTVSYLDNGVVFVGSAYGDSQLIKL 559


>gi|397627714|gb|EJK68584.1| hypothetical protein THAOC_10223, partial [Thalassiosira oceanica]
          Length = 456

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G + AL   QG LL+  G  + L++    +L         P  V +L    +   +GD+ 
Sbjct: 189  GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 248

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+ F+ +     +L L+A+D  +      E L+D +T++  V D+  N+      P+ +
Sbjct: 249  QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 304

Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
            ++           W         KL +   +HVG  VT   R  ++A  ++         
Sbjct: 305  DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 355

Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                 +L++ T+ G IG + P    +++ F    SL+  +   VP   G +P+S+R F+ 
Sbjct: 356  -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 408

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
              K        ++D +L   Y  LP E +  IA Q   +  +++  L D
Sbjct: 409  PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 450


>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
            C-169]
          Length = 1205

 Score = 72.0 bits (175), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 118/495 (23%), Positives = 201/495 (40%), Gaps = 75/495 (15%)

Query: 964  NCNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
             C  GF  V S+ +L+I  L   G  ++     Q   L+ TP +     E N+  +  + 
Sbjct: 747  QCPEGFCAV-SKSMLRILTLERLGEAFNQ----QVTRLRYTPRKFVVHPESNMLIVAEAD 801

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--------------VRIL 1068
                PL +  ++    E G ++D      ++       EE +              +R+L
Sbjct: 802  HAAVPLAERRAV----EDGMEMDAALTEGIEFDEERAAEEEQHGAPKNSTGRWASCIRVL 857

Query: 1069 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG----EDVA 1124
            +P       QT + + +  +E A+++ ++   N    E   +LA+GT  VQG       A
Sbjct: 858  DPTS----LQTSSVLELDGNEAAVSLCLLRFSNW--PEEGMVLAVGT--VQGLAFYPRTA 909

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIASGPKIILHKWTGT 1183
              G + L+   R  D+ + L  E+  K   G I  ALA+ +G LL   GP + +++    
Sbjct: 910  DEGYIRLY---RFRDSGRQL--ELIHKTPTGGIPGALAAFKGRLLAGVGPTLRIYEAGKK 964

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            +L     +   P ++ +L    + I +GD+ +S+++  +K     L   A D       A
Sbjct: 965  KLLRKCEHRKLPTHIATLATSGDRIFVGDLQESMHYFRYKANENALYEYADDIAPRHLTA 1024

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFY------YAPKMSESWKGQKLLSRAEFHVGA----- 1292
                +D  T    V+   K   IF        + ++ E   G K    A    GA     
Sbjct: 1025 A-LPLDYDT----VAGADKFCNIFVTRLPRDVSTQVEEDPTGGKFAGAAGLLNGAPHKLE 1079

Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQS 1349
             V  F  +  L TS  R    PG     R  LL+ T+ G+IG + P    +++ F     
Sbjct: 1080 DVVNF-HVGDLVTSLQRAVLQPG----GREVLLYATVMGAIGAMLPFPSREDVDF--FSH 1132

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            L+  L    P + G +  S+R       ++ P  D ++D +L  H+  LP  +Q  IA +
Sbjct: 1133 LEMHLRQEHPPMGGRDHMSYR------GSYFPVKD-VIDGDLCEHFSQLPAAKQKSIADE 1185

Query: 1410 TGTTRSQILSNLNDL 1424
               T  +IL  L D+
Sbjct: 1186 LERTPGEILKKLEDI 1200


>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
          Length = 1160

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/319 (23%), Positives = 131/319 (41%), Gaps = 30/319 (9%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA V  E+   + GR+++F       N      ++  KE+KGA   L    G LL + 
Sbjct: 851  VGTALVHPEEAEPKQGRIVIFHFHEGKLN------QIAEKEIKGAAYTLVEFNGKLLASI 904

Query: 1172 GPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               + L +WT  +   L    F     LY   L    +FIL+GD+ +SI  L +K     
Sbjct: 905  NSTVRLFEWTTDKELRLECNYFNSIVALY---LKTKGDFILVGDLMRSITLLLYKPMEGT 961

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
               +A+D       A E L D + L    ++   N+         +     Q L     F
Sbjct: 962  FEEIARDCNPNWTTAVEILDDDNFLG---AENSFNLFTCQKDSASTTDEDRQNLQEVGMF 1018

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
            H+G  V  F    ++   S  T          + ++L+GT++G++G +  + +  +  LQ
Sbjct: 1019 HLGEFVNVFRHGSLVMQHSGETSTP------TQGSVLYGTVNGAVGLVTQVPQEFYSFLQ 1072

Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY------EMLPLEE 1402
             +Q +L   +  V  +    +R FH+  K      +  +D +L+  +      +M    +
Sbjct: 1073 DIQSRLAKVIKSVGKIEHSFWRSFHTERKTE--ACEGFIDGDLIESFLDLNRDKMQETVK 1130

Query: 1403 QLEIAHQTGTTRSQILSNL 1421
             L+I   +G  R   + +L
Sbjct: 1131 GLQIDDGSGMKREATVDDL 1149



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 35/219 (15%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSWKH--HTCMISAL 286
           N+R L+   V D  F+HG   P ++++H+  L             +S+K   H   +   
Sbjct: 156 NIR-LEELTVIDIQFLHGCTTPTLILIHQANLNCYHLMTLCITNLLSFKQDQHGRHVKTY 214

Query: 287 SISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
            IS   K+     W   N+  +A  L+AVP P GG L++G  +I YH       +A    
Sbjct: 215 EISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHKGDNFIPIA---- 270

Query: 346 AVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----V 398
                    + +S+ +   ++DA  + +L  D+       G L +L +  + ++     V
Sbjct: 271 ------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLMLEKEEKMDSTVTV 318

Query: 399 QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           + L +     + +   IT + N++ ++GSRLGDS LV+ 
Sbjct: 319 KDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357


>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 836

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            K+ +  L +G      ED   +G+  +F    N +N   L+ ++   + K ++ A+  + 
Sbjct: 519  KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572

Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
            G L +ASG ++ L       +W     + I+      +  PL V+     K    ILL D
Sbjct: 573  GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             ++S+  L +K     +  L KD  ++ C  + F+I     S++  D ++N+ +  Y+  
Sbjct: 633  FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +E     ++   A F++G ++ KF RL          G          +  ++ T++GS
Sbjct: 693  ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I+ ++E  ++ L+ +  K+     H AG N   +R     G          +D ++L
Sbjct: 735  VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794

Query: 1393 SHYEMLPLEEQ 1403
              + +L  E+Q
Sbjct: 795  KQFRLLNEEQQ 805


>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
          Length = 1084

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/270 (24%), Positives = 118/270 (43%), Gaps = 16/270 (5%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GTA V  E+   + GR+++F   R  D     +T+V  KE+KGA   L    G L
Sbjct: 771  TYYVVGTAMVYPEEAEPKQGRIIVF---RFHDGK---LTQVAEKEIKGAAYTLTEFNGKL 824

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L +    + L +WT  +   +       +  + L    +FIL+GD+ +S+  LS+K    
Sbjct: 825  LASINSTVRLFEWTAEKELRVECSYFNNIIALYLKTKGDFILVGDLMRSVTLLSYKPMEG 884

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
                +A+D+      + + L D + L    ++   NI         +   + Q L     
Sbjct: 885  CFEEIARDYNPNWMTSIDVLDDDTFLG---AENSFNIFTCQKDSAATTDEERQHLQEVGL 941

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            +H+G  V  F R   L         +P      + ++LFGT++G++G +  L +  +  L
Sbjct: 942  YHLGEFVNVF-RHGSLVMQHPGECTSP-----TQGSVLFGTVNGALGLVTQLPQEFYLFL 995

Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
              +Q KL  ++  V  +    +R FH+  K
Sbjct: 996  LEVQNKLAKTIKSVGKVEHAFWRSFHTERK 1025


>gi|357478323|ref|XP_003609447.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355510502|gb|AES91644.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 1225

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 165/379 (43%), Gaps = 51/379 (13%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P R G    T   + +Q +E A ++  V   N   KE  TLLA+GTA        
Sbjct: 874  IRVLDP-RTG---NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFTPK 926

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
                V      R  D+ ++L   ++  +++G   AL   QG LL   GP + L+      
Sbjct: 927  RSLTVGFIHIYRFLDDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGKRR 985

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            L       + P+ +VS++  ++ I +GDI +S ++  ++    QL + A D  S+  + T
Sbjct: 986  LLRKCENKSFPISIVSIHAYRDRIYVGDIQESFHYCKYRRDENQLYIFADD--SVPRWLT 1043

Query: 1245 -EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL---LSRAEFH 1289
              + ID  T++   +D+  NI        +S+            W+  KL   L++ E  
Sbjct: 1044 ASYHIDFDTMA--GADKFGNIFFARLPQDVSDEVEEDPTSGKIKWEQGKLNGALNKVEEI 1101

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR--- 1346
            V  HV   +      TS  +    PG  +     +++GT+   +GC+  L   T R    
Sbjct: 1102 VQFHVGDVI------TSLQKAALVPGGGE----CIVYGTV---MGCVGALHAFTSRDDVD 1148

Query: 1347 -LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
                L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ Q +
Sbjct: 1149 FFSHLEMHMRQDNPPLCGRDHMAYR------SAYFPVKD-VLDGDLCEQFPTLPMDLQRK 1201

Query: 1406 IAHQTGTTRSQILSNLNDL 1424
            IA +   T  +IL  L +L
Sbjct: 1202 IADELDRTPGEILKKLEEL 1220


>gi|351699158|gb|EHB02077.1| DNA damage-binding protein 1 [Heterocephalus glaber]
          Length = 1144

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 133/306 (43%), Gaps = 42/306 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKEL-KGAISALAS 1162
            K+  T   +GTA V  E+   + GR+++F   + +D  +    EV S+ L KGA+ ++  
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDEER----EVSSRGLVKGAVYSMVE 875

Query: 1163 LQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
              G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+
Sbjct: 876  FNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSV 930

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L++K        +A+DF      A E L D + L    ++   N+ +       +   
Sbjct: 931  LLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDE 987

Query: 1278 KGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
            + Q L     FH+G  V  F    L +Q L  +S  T            ++LFGT++G I
Sbjct: 988  ERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMI 1037

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVA----GLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            G +  L E  +  L  +Q +L   +  V      L P     FH+  K  +      +D 
Sbjct: 1038 GLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSRAVSFHTERKTEQ--ATGFIDG 1095

Query: 1390 ELLSHY 1395
            +L+  +
Sbjct: 1096 DLIESF 1101


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 128/297 (43%), Gaps = 17/297 (5%)

Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
             +GTA V  E+   + GR++LF   + AD     +T V  KE+KGA  +L      +L A
Sbjct: 847  VVGTALVVPEESEPKQGRIVLF---QWADGK---LTTVAEKEVKGACYSLVDFNSKILAA 900

Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
                + L++WT  +   +   +   +  + L    +FIL+GD+ +SI  L +K       
Sbjct: 901  INNVVRLYEWTAEKELRLECSNFNHIIALYLKRKGDFILVGDLMRSITLLQYKTMEGSFE 960

Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
             +A+D       A E L D + L    ++   N+ +       +   + Q+L     FH+
Sbjct: 961  EMARDSNPNWMSAVEILDDDTFLG---AENSFNLFVCQKDSAATTEEERQQLTEVGRFHL 1017

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
            G  V  F    ++   +  T   P      +  +LFGT+ G+IG +  L    +  L  +
Sbjct: 1018 GDMVNVFRHGSLVMDHAAETLTTP-----TQGCVLFGTVHGAIGVVTQLPSEFYHFLSEV 1072

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            Q ++   +  V  +    +R F +  K      +  +D +L+  +  L  ++  E+A
Sbjct: 1073 QTRMARVIKPVGKIEHSFWRSFATERKVEP--CEGFIDGDLIESFLDLSSDKMKEVA 1127



 Score = 56.6 bits (135), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
           ++D  F++G   P +VI+H+             H   +    IS   K+     W   N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
             +A  LL VP P GG L++G  +I YH+          NY        +    +   ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
           D   + +L  D+A      G L +L +      DG V V+ + +       +   +T + 
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N + F+GSR GDS LV+       +     + E F ++   AP            + DM 
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
                          +    T S A ++     G L+    G+ I+  AS          
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             ++LPG KGIW +   SS   + D +            +++S   +T VL       E 
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           TE        +T   GN+ G+  V+Q+     R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481


>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
           histolytica HM-1:IMSS]
 gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica HM-1:IMSS]
          Length = 1150

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/315 (21%), Positives = 137/315 (43%), Gaps = 36/315 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            K+ +  L +G      ED   +G+  +F    N +N   L+ ++   + K ++ A+  + 
Sbjct: 833  KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 886

Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
            G L +ASG ++ L       +W     + I+      +  PL V+     K    ILL D
Sbjct: 887  GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 946

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             ++S+  L +K     +  L KD  ++ C  + F+I     S++  D ++N+ +  Y+  
Sbjct: 947  FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 1006

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +E     ++   A F++G ++ KF RL          G          +  ++ T++GS
Sbjct: 1007 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 1048

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I+ ++E  ++ L+ +  K+     H AG N   +R     G          +D ++L
Sbjct: 1049 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 1108

Query: 1393 SHYEMLPLEEQLEIA 1407
              + +L  E+Q  + 
Sbjct: 1109 KQFRLLNEEQQKRVC 1123


>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 1088

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 161/351 (45%), Gaps = 48/351 (13%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
            ++ +E+AL++  + +      ENE +  IGTA+ +  +V  + GR+L+     GR     
Sbjct: 756  LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804

Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
               +  ++ K++ GA+ ++ +L + +L ++   K+++ ++     NG   +   +     
Sbjct: 805  ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861

Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
                 LYV +L    N IL+GD+ KSI   S+   G   N L   ++DF +    A EF+
Sbjct: 862  VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
             +   LS   SD   NI IF      +ES +  +L + A  HVG  +    +  +  T S
Sbjct: 919  DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
                     +   +  +LFG + G IG I  +    +  L  +Q +++  +  +     P
Sbjct: 975  TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
             ++++   + K  R    +I+D  ++  Y  +  E+Q EIAH +G    QI
Sbjct: 1028 DNWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076


>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica KU27]
          Length = 836

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            K+ +  L +G      ED   +G+  +F    N +N   L+ ++   + K ++ A+  + 
Sbjct: 519  KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572

Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
            G L +ASG ++ L       +W     + I+      +  PL V+     K    ILL D
Sbjct: 573  GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             ++S+  L +K     +  L KD  ++ C  + F+I     S++  D ++N+ +  Y+  
Sbjct: 633  FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +E     ++   A F++G ++ KF RL          G          +  ++ T++GS
Sbjct: 693  ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G I+ ++E  ++ L+ +  K+     H AG N   +R     G          +D ++L
Sbjct: 735  VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794

Query: 1393 SHYEMLPLEEQ 1403
              + +L  E+Q
Sbjct: 795  KQFRLLNEEQQ 805


>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
          Length = 760

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 168/403 (41%), Gaps = 65/403 (16%)

Query: 1019 IVSVPVL-KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
            +VSVP+  +PL           + H + +H  +   +       +  +R L   R   P+
Sbjct: 374  VVSVPLAEQPLC----------ICHDLQSHLFAVCTIDHREGDNQGVIRFL---RDEAPY 420

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTT-KENETLLAIGTAYVQGED--VAARGRVLLFST 1134
                   ++  E  L   +++L + +T K+      +GTA+   E+      GR+++F +
Sbjct: 421  NDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQENDFEPCSGRMIIFRS 480

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-------ELNG 1187
            G+    P  L    +  E  GA+  +A+++  LL+ +    I H +           L  
Sbjct: 481  GQANVAPSVL----FFVEANGAVYDVAAMRASLLVCAVNHAI-HIYDPVVRDNRRGHLKP 535

Query: 1188 IAFYDAPPLYVVSLNI--VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
             A YD     VV+L +    N I++GD+ +S+  L+   Q   +  +A D+ +    A E
Sbjct: 536  RASYDG---LVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDYNTNWVCALE 592

Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
             + DGS +    S     ++  Y       S KG  L SRA+ H+G  VT F R  ++  
Sbjct: 593  VIGDGSFIIADASGSLVALESLY-----GNSDKGYFLESRAKMHLGDVVTCFARGSIMTQ 647

Query: 1306 SSDRTGAAPGSDKTNRFA--LLFGTL-----------------DGSIGCIAPLDELTFRR 1346
               R      + K ++ A  L+FG +                  G+IGCI  +D++T   
Sbjct: 648  DDWR------NPKVSKVATPLIFGCVTSSRVLVLTPLIARYQVSGAIGCIVSIDDVTHSL 701

Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVD 1388
            L+ L K L++    V   +  +F+  H+N       P D  +D
Sbjct: 702  LERLSKVLLEFHSGVGDFDHETFQALHNNVATCNAAPMDDFID 744


>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
          Length = 1088

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
            ++ +E+AL++  + +      ENE +  IGTA+ +  +V  + GR+L+     GR     
Sbjct: 756  LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804

Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
               +  ++ K++ GA+ ++ +L + +L ++   K+++ ++     NG   +   +     
Sbjct: 805  ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861

Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
                 LYV +L    N IL+GD+ KSI   S+   G   N L   ++DF +    A EF+
Sbjct: 862  VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
             +   LS   SD   NI IF      +ES +  +L + A  HVG  +    +  +  T S
Sbjct: 919  DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
                     +   +  +LFG + G IG I  +    +  L  +Q +++  +  +     P
Sbjct: 975  TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
              +++   + K  R    +I+D  ++  Y  +  E+Q EIAH +G    QI
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076


>gi|224100909|ref|XP_002312063.1| predicted protein [Populus trichocarpa]
 gi|222851883|gb|EEE89430.1| predicted protein [Populus trichocarpa]
          Length = 1213

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 167/384 (43%), Gaps = 61/384 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A +V  V   N   KE+ TLLA+GTA  +G    
Sbjct: 862  IRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTA--KGLQFW 912

Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
             +  ++       +  D+ ++L   ++  +++G   AL   QG LL   G  + L+    
Sbjct: 913  PKRSLIAGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
              L         P  +VS++  ++ I +GDI +S +F  ++    QL + A D  S+  +
Sbjct: 972  KRLLRKCENKLFPNSIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADD--SVPRW 1029

Query: 1243 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR----- 1285
             T  + +D  T++   +D+  NI        +S+            W+  KL        
Sbjct: 1030 LTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVE 1087

Query: 1286 --AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
               +FH+G  V    +  ++          PG  +     +++GT+ GS+G + P    D
Sbjct: 1088 EIVQFHIGDVVNSLQKASLI----------PGGGE----CIMYGTVMGSVGALLPFTSRD 1133

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F     L+  L    P + G +  ++R       A+ P  D ++D +L   +  LPL
Sbjct: 1134 DVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPL 1184

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
            + Q +IA +   T  +IL  L ++
Sbjct: 1185 DAQRKIADELDRTPGEILKKLEEV 1208


>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 1088

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
            ++ +E+AL++  + +      ENE +  IGTA+ +  +V  + GR+L+     GR     
Sbjct: 756  LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804

Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
               +  V+ K++ GA+ ++ +L + +L I+   K+++ ++     NG   +   +     
Sbjct: 805  ---LEIVFEKDVNGAVYSMKTLLKKYLAISIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861

Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
                 LYV +L    N IL+GD+ KSI   S+   G   N L   ++DF +    A EF+
Sbjct: 862  VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
             +   LS   SD   NI IF      +ES +  +L + A  HVG  +    +  +  T S
Sbjct: 919  DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
                     +   +  +LFG + G IG I  +    +  L  +Q +++  +  +     P
Sbjct: 975  TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTP 1027

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
              +++   + K  R    +I+D  ++  Y  +  E+Q EIAH +G    +I
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKI 1076


>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
          Length = 270

 Score = 70.9 bits (172), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 69/253 (27%), Positives = 113/253 (44%), Gaps = 24/253 (9%)

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP--PLYVVSLNI 1203
            KE KGA+ +L +  G LL A   KI L+KW     GT EL     +      LYV +   
Sbjct: 1    KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG- 59

Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
              +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N
Sbjct: 60   --DFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFN 114

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRF 1322
            +       + +   +  +L    E+H+G  V +F    ++    D   G  P        
Sbjct: 115  LLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP-------- 166

Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
             ++FGT++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F  N +     
Sbjct: 167  TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSF--NNEKRTAE 224

Query: 1383 PDSIVDCELLSHY 1395
              + +D +L+  +
Sbjct: 225  ARNFLDGDLIESF 237


>gi|397615212|gb|EJK63291.1| hypothetical protein THAOC_16062, partial [Thalassiosira oceanica]
          Length = 322

 Score = 70.9 bits (172), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G + AL   QG LL+  G  + L++    +L         P  V +L    +   +GD+ 
Sbjct: 55   GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 114

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+ F+ +     +L L+A+D  +      E L+D +T++  V D+  N+      P+ +
Sbjct: 115  QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 170

Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
            ++           W         KL +   +HVG  VT   R  ++A  ++         
Sbjct: 171  DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 221

Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                 +L++ T+ G IG + P    +++ F    SL+  +   VP   G +P+S+R F+ 
Sbjct: 222  -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 274

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
              K        ++D +L   Y  LP E +  IA Q   +  +++  L D
Sbjct: 275  PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 316


>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
           [Dekkera bruxellensis AWRI1499]
          Length = 353

 Score = 70.5 bits (171), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
           +K++ D+ F++ Y EP + IL+  E L+WAG +        +  LS++    +   I   
Sbjct: 69  VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            NLP+D   +  +PSPI G L++G+N I H +S  +   +  N Y       +    S  
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
           ++E +    +++ +D  LL ++ G   +L+    G      ++++  + +  N SV  ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246

Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +  I N    +  FL  +  DS+L+ +                      + P+   + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286

Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           ++   ++    E+  LY     S  +     + +F   D LVN GP  DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336


>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
          Length = 1129

 Score = 70.1 bits (170), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 65/275 (23%), Positives = 127/275 (46%), Gaps = 23/275 (8%)

Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
            A+GTA V  ++  ++ GR+L+F    +++  +  +  V+ KE+KGA  ++  L G L++A
Sbjct: 815  AVGTAVVLTDETESKSGRLLIFQVAPSSEGGRMRL--VHDKEIKGAAYSIQVLMGKLVVA 872

Query: 1171 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
                + L +WT  +   L    F +   LY+ + N   + +L+GD+ +S+  L++K   +
Sbjct: 873  INSCVRLFEWTAEKELRLECSDFDNVTALYLRTKN---DVVLVGDLMRSLSVLAYKPMES 929

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-- 1285
                +A+DF +    A E +ID  T        +    +F      S   +G +L  +  
Sbjct: 930  SFEKIARDFVTNWMTACE-IIDMETF----LGAEIMFNLFTVVKDCSSKDEGIRLQLQET 984

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
              +++G  V  F    ++AT  D T        +    +L+GT DG +G I  L    + 
Sbjct: 985  GMYYLGESVNAFCHGSLIATHIDLT-------PSFTTPILYGTSDGGLGVIVQLTPQFYD 1037

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
             +  L+ ++     +   +    +R F S+G+  +
Sbjct: 1038 FVHELETRIAAVTKNCMRIEHGQYRTFESDGRTEQ 1072



 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P +W   N+  +A  ++ +P P GGV+VVG   I YH  +       N Y+         
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
            + S   ++D     +L  D   LS +   L+L L V  DG   V+ L +     + +  
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            +  + N + F+GSR GDS LV+ 
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333


>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
 gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
          Length = 1157

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 135/316 (42%), Gaps = 31/316 (9%)

Query: 1102 TTTKENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAIS 1158
            T + +  T   +GTAYV  E+   + GR+LLF  S G+        + +V  KE+KGA+ 
Sbjct: 833  TLSDDPHTYYCVGTAYVFPEEPEPKAGRLLLFHLSEGK--------LVQVAEKEVKGAVY 884

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
            +L    G +L      + + +WT  +      ++YD   +  + L    +FIL+GD+ +S
Sbjct: 885  SLVEFNGKVLAGINSTVSIFEWTADKEFRYECSYYDN--ILALYLKTKGDFILVGDLMRS 942

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L +         +A DF      A E L D + L    ++   N+         +  
Sbjct: 943  MTLLVYLPLEGSFQEIAHDFSPKWMTAIEILDDDTFLG---AENSYNLFTCTKDSGATTD 999

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF--ALLFGTLDGSIG 1334
             +   L    ++H+G  V  F    ++          PG D +  F   +LFGT++G IG
Sbjct: 1000 EERYHLQDAGQYHLGEFVNVFRHGSLVMEH-------PG-DASTPFQGCVLFGTVNGRIG 1051

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCEL 1391
             +A + +  F  L  +QKKL   +  V  ++   +   H +  +H    +     +D +L
Sbjct: 1052 IVAQIAQDLFNFLIQVQKKLNKVIKSVGKIDHSLYPFPHCSNLSHSRKMEPAHGFIDGDL 1111

Query: 1392 LSHYEMLPLEEQLEIA 1407
            +  +  LP     E+ 
Sbjct: 1112 IESFLDLPRARMEEVV 1127



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+  HV D  F++G   P +V +++             H   +    I+  L+ H 
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200

Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
                W   N+  +A +++AVP+P+GG L++G  +I YH  S   A+A            
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250

Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
            L +SS +   ++D   + +L  D+       G L +L +    + DG   V+ L L   
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
             + +   +  + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336


>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1196

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 88/377 (23%), Positives = 158/377 (41%), Gaps = 49/377 (12%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VRI++P  A    ++   + +  SE AL++  V L    T  NE LLA+GTA       A
Sbjct: 847  VRIVDPKEA----KSTFVLELHKSEAALSLCHVFL----TGPNELLLAVGTAV--NLTFA 896

Query: 1125 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
             R    G + L+  G    N    +  V+S    G + AL   +GHLL      + ++ +
Sbjct: 897  PRNCDGGFIHLYRYG----NDGRTLNLVHSTPTDGPVGALCGYKGHLLAGVNNSLRIYDY 952

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
               +L         P ++ +L+   + I +GD+ +SI+++ +K     + + A D     
Sbjct: 953  GKKKLLRKVENRNFPNFITTLHAAGDRIYVGDVQESIHYVKYKADEGSIYIFADDTKPRY 1012

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----------GQKLLSRAEFHV 1290
              AT   +D  TL+   +D+  NI +      +SE              Q +L+ A    
Sbjct: 1013 ITAT-LPLDYDTLA--GADKFGNIFVNRLPKDVSEDMDDDPTGGKNIYSQGVLNGAPNKS 1069

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRL 1347
                  ++   + A +  +    PG  +     +++GT  G IGC+ P     E+ F   
Sbjct: 1070 ETSAQTYIGETVCALT--KGALQPGGIEI----IMYGTFMGGIGCLLPFSSRSEIEF--F 1121

Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
              L+  +    P + G +  +FR +++  K       +++D +L   +  LP + Q  IA
Sbjct: 1122 THLEMHMRQEAPSIVGRDHMAFRSYYAPVK-------NVIDGDLCEQFGALPADVQRRIA 1174

Query: 1408 HQTGTTRSQILSNLNDL 1424
             +   T  +IL  L  +
Sbjct: 1175 EEMDRTPGEILKKLEQV 1191


>gi|312072035|ref|XP_003138882.1| hypothetical protein LOAG_03297 [Loa loa]
 gi|307765956|gb|EFO25190.1| hypothetical protein LOAG_03297 [Loa loa]
          Length = 1197

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 81/364 (22%), Positives = 148/364 (40%), Gaps = 32/364 (8%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
            +T +  P    E A  + +V  F   +     L+  G         A  G +  F    N
Sbjct: 855  ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCELQLKPRKANGGCIYTFLLAAN 913

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
                Q L+    + E+   ++A+   +G  L   G K+ L+     +L         P  
Sbjct: 914  GTTLQ-LLHRTATDEV---VNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 969

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            VV +  +   I++ D  +S++F+ +K+Q  QL++   D  S        L+D  T++  V
Sbjct: 970  VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1026

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1309
             D   NI +      ++E  +      RA +  G       +L+ +A        TS  +
Sbjct: 1027 GDRFGNIAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1086

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
            T   PG++      L + T+ G IG + P    DE  F   Q+L+  +    P + G + 
Sbjct: 1087 TSLVPGAND----CLCYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1140

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
             ++R ++   K       SI+D +L   Y ++PL++Q  +  + G   ++I   L D+  
Sbjct: 1141 LAYRSYYFPVK-------SIIDGDLCEQYSLMPLDKQKSVGEELGRKSTEIHKKLEDIRT 1193

Query: 1427 GTSF 1430
              +F
Sbjct: 1194 RYAF 1197


>gi|168064351|ref|XP_001784126.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664326|gb|EDQ51050.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1214

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 167/385 (43%), Gaps = 62/385 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
            +R+L+P  +     T   + +Q +E A ++  V   +   KE  TL+A+GTA       +
Sbjct: 862  IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKDLQFMPK 915

Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
              A+ G + ++   R AD  + ++  V+   + G  +AL   QG LL+  G  + ++   
Sbjct: 916  KEASGGFIHIY---RFADEGK-VLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 971

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
              +L         P  +++++   + I +GDI +S +++ ++    QL   A D  S   
Sbjct: 972  KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1029

Query: 1242 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG-------QKL 1282
            + T  L ID  T++   +D+  NI +      +SE            W+         K+
Sbjct: 1030 WLTASLHIDFDTMA--GADKFGNIYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1087

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
                +FHVG  VT   +  ++          PG  ++    +L+GT+ GS+G + P    
Sbjct: 1088 EEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSMGALLPFSSR 1133

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            +++ F     L+  L    P + G +   FR       A+ P  D ++D +L   Y ML 
Sbjct: 1134 EDVDF--FSHLEMHLRQENPPLCGRDHMGFR------SAYFPVKD-VIDGDLCEQYPMLT 1184

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
             E Q +IA     T  +IL  L D+
Sbjct: 1185 SELQKKIADDLDRTPGEILKKLEDI 1209


>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
          Length = 1140

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/346 (24%), Positives = 147/346 (42%), Gaps = 57/346 (16%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 812  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 863

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 864  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 918

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 919  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 975

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L      H+G  V  F    L +Q L  +S  T            ++LFGT++G 
Sbjct: 976  EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGM 1025

Query: 1333 IGCIAPLDELTFRRLQSLQKKL---VDSV--------PHVAGLNPRSFRQFHSNGKAHRP 1381
            IG +  L E  +  L  +Q +L   + SV        P +  L   + + FH+  K   P
Sbjct: 1026 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSLVQLRAWASQSFHTERKT-EP 1084

Query: 1382 GPDSIVDCELLSHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
                 +D +L+  +      +M  +   L+I   +G  R   + +L
Sbjct: 1085 AT-GFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129


>gi|224109600|ref|XP_002315251.1| predicted protein [Populus trichocarpa]
 gi|222864291|gb|EEF01422.1| predicted protein [Populus trichocarpa]
          Length = 1213

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 92/385 (23%), Positives = 167/385 (43%), Gaps = 63/385 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A ++  V   N   KE+ TLLA+GTA  +G    
Sbjct: 862  IRVLDPRSAA----TTCLLELQDNEAAFSLCTV---NFHDKEHGTLLAVGTA--KGLQFW 912

Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
             +  ++       +  D+ ++L   ++  +++G   AL   QG LL   G  + L+    
Sbjct: 913  PKRSLVTGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
              L         P  +VS++  ++ I +GDI +S +F  ++    QL + A D       
Sbjct: 972  KRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLT 1031

Query: 1236 ------FGSL---DCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQKL 1282
                  F S+   D F   +    + L   VSDE +      +I +   K++ +    K+
Sbjct: 1032 SSYHVDFDSMAGADKFGNIYF---ARLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PNKV 1086

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
                +FH+G  V    +  ++          PG  +     +++GT+ GS+G + P    
Sbjct: 1087 EEIVQFHIGDVVNSLQKASLI----------PGGGE----CIIYGTVMGSVGALLPFTSR 1132

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            D++ F     L+  L    P + G +  S+R       A+ P  D ++D +L   +  LP
Sbjct: 1133 DDVDF--FSHLEMHLRQDHPPLCGRDHMSYR------SAYFPVKD-VIDGDLCEQFPTLP 1183

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
            L+ Q +IA +   T  +IL  L ++
Sbjct: 1184 LDAQRKIADELDRTPGEILKKLEEV 1208


>gi|224004656|ref|XP_002295979.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
            associated protein [Thalassiosira pseudonana CCMP1335]
 gi|209586011|gb|ACI64696.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
            associated protein [Thalassiosira pseudonana CCMP1335]
          Length = 1212

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 122/289 (42%), Gaps = 47/289 (16%)

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G + +L   QG LL+  G  + L++    +L         P  V +L    +   +GD+ 
Sbjct: 945  GPVLSLVHFQGRLLVGVGKTVRLYEMGKRQLLKKCELRGMPTMVKTLQAAGDRAFVGDMM 1004

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+ F+ +     +L L+AKD         E L+D +T++  V D+  N+ I    P+ +
Sbjct: 1005 QSMQFIRYDSTANRLVLVAKDRNPRPITCQE-LLDINTVA--VGDKFGNVTILRL-PRGA 1060

Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
            ++           W         KL +   +HVG  VT   R  ++A  ++         
Sbjct: 1061 DAGAIDVTGTRALWDSARDDATPKLETLCTYHVGEVVTSMTRASLVAGGAE--------- 1111

Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                 +L++ T+ G +G   P    D++ F    SL+  L    P   G +P+S+R +++
Sbjct: 1112 -----SLIYVTVTGRVGAFVPFTSRDDVEF--YTSLEGFLRTETPRPTGRDPQSYRSYYA 1164

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
              K        IVD +L   +  LP E + +IA     +  +++  L D
Sbjct: 1165 PMK-------HIVDGDLCDAFAQLPYETKQKIAESLDRSVGEVMKKLED 1206


>gi|356576847|ref|XP_003556541.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
          Length = 1214

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/389 (23%), Positives = 168/389 (43%), Gaps = 71/389 (18%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
            +R+L+P R G    T   + +Q +E A ++  +   N   KE  TLLA+GTA    ++  
Sbjct: 863  IRVLDP-RTG---NTTCLLELQENEAAFSICTI---NFHDKEYGTLLAVGTAKGLQFLPK 915

Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
              + A G + ++   R  ++ ++L   ++  +++G   AL   QG LL   GP + L+  
Sbjct: 916  RTITA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
                L         P  +VS++  ++ I +GD+ +S ++  ++    QL + A D     
Sbjct: 971  GKRRLLRKCENKLFPNTIVSIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025

Query: 1241 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1285
            C        + ID  T++   +D+  NI        +S+            W+  KL   
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083

Query: 1286 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
                    +FH+G  VT   +  ++          PG  +     ++FGT+ GS+G +  
Sbjct: 1084 PNKVEEIVQFHIGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129

Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
                D++ F     L+  +    P + G +  ++R       A+ P  D ++D +L   Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180

Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              LP++ Q +IA +   T  +IL  L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209


>gi|356536504|ref|XP_003536777.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
          Length = 1214

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/389 (23%), Positives = 167/389 (42%), Gaps = 71/389 (18%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
            +R+L+P  +     T   + +Q +E A ++  V   N   KE  TLLA+GTA    ++  
Sbjct: 863  IRVLDPRTS----NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFLPK 915

Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
              V A G + ++   R  ++ ++L   ++  +++G   AL   QG LL   GP + L+  
Sbjct: 916  RTVTA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
                L         P  ++S++  ++ I +GD+ +S ++  ++    QL + A D     
Sbjct: 971  GKKRLLRKCENKLFPNTIISIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025

Query: 1241 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1285
            C        + ID  T++   +D+  NI        +S+            W+  KL   
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GTDKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083

Query: 1286 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
                    +FHVG  VT   +  ++          PG  +     ++FGT+ GS+G +  
Sbjct: 1084 PNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129

Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
                D++ F     L+  +    P + G +  ++R       A+ P  D ++D +L   Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180

Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              LP++ Q +IA +   T  +IL  L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209


>gi|340520436|gb|EGR50672.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1212

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 104/461 (22%), Positives = 200/461 (43%), Gaps = 67/461 (14%)

Query: 998  IPLKATPHQITYFAEKNLY-----------PLIVSVPVLKP--LNQVLSLLIDQEVGHQI 1044
            IPL  TP ++    ++ L+           P + +  +  P  +N    +L  +E GH  
Sbjct: 791  IPLTYTPKKMVKHPDQPLFYVIEADNHTLSPALCAQLLADPARVNGDSKVLPPEEFGHPR 850

Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
             N   +S             + +++P    G  Q    I ++ +E A+++ +VT     +
Sbjct: 851  GNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSLAIVTF---AS 893

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
            +ENET L +GT    G+D+    R          R   + + LV  ++  +++    A+ 
Sbjct: 894  QENETFLVVGT----GKDMVLNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPPMAMI 948

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
              QG +L+  G  + ++     +L   +  +  P  +VSLN   + I++GD+ + I ++ 
Sbjct: 949  PFQGRVLVGIGKMLRIYDLGMRQLLRKSQAEVAPQQIVSLNAQGSRIVVGDVQQGITYVV 1008

Query: 1222 WKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK-- 1278
            +K+Q  +L     D  ++  + T   ++D  T +    D+  NI I     K SE     
Sbjct: 1009 FKQQTNKLIPFVDD--TVARWTTCSTMVDYETTA--GGDKFGNIFIVRAPQKASEEADEE 1064

Query: 1279 --GQKLL-SRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
              G  LL +R+  H  +H       L  Q + TS  +T    G  +     LL+  L G+
Sbjct: 1065 PAGLHLLNARSYLHGTSHRLDLMCHLYTQDIPTSITKTSLVVGGQE----VLLWSGLMGT 1120

Query: 1333 IGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            IG + P    ++  F   QSL++ L    P +AG +   +R +++  K        ++D 
Sbjct: 1121 IGVLIPFVTREDADF--FQSLEQHLRAEDPPLAGRDHLMYRSYYAPMKG-------VIDG 1171

Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +L   Y +LP +++  IA +   +  +I   ++D+   ++F
Sbjct: 1172 DLCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212


>gi|358056808|dbj|GAA97158.1| hypothetical protein E5Q_03834 [Mixia osmundae IAM 14324]
          Length = 1243

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 139/321 (43%), Gaps = 35/321 (10%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADN 1140
            +QS E+   +  V+L            A+GTA+     V AR    GRVL F   R+ D 
Sbjct: 914  LQSDEHGTALETVSLHGAAH------FAVGTAF-SDRTVDAREPKKGRVLTFM--RDGDK 964

Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
             +  V  V    L+G +  L  L    L  IA+    + H      ++ +  + A     
Sbjct: 965  FEQHVHAV----LEGGVFGLCQLPNSFLAAIANAQVKVFHVTEQAHIDQMTCW-AGTFLA 1019

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
             S++   + I++GD+++S+  L W E    L+ +A++       A EFL  G T    + 
Sbjct: 1020 QSISSRDSQIIVGDLYRSVVLLQWDEAKDTLSEVAREHHVNGMSAVEFL--GFTDDRYIG 1077

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSD 1317
             EQ+ + IF    K     +   L +   FH+G +VT+  +  ++   +D + GAAP   
Sbjct: 1078 TEQE-LNIFTLT-KTKTRERIDILETEGMFHIGEYVTRIRKGALVPGYTDTSFGAAP--- 1132

Query: 1318 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
                  LLFGT DGS+G I         +L +L++ +   +    GL    +R F +  +
Sbjct: 1133 -----QLLFGTSDGSLGVIVNCTPEVSLKLFALERNMRAVIRAFGGLEQVDWRAFRAPHR 1187

Query: 1378 AHRPGPDSIVDCELLSHYEML 1398
             H P     VD +++  +  L
Sbjct: 1188 VHEPV--GFVDGDMIGRFAEL 1206


>gi|268568396|ref|XP_002640241.1| C. briggsae CBR-TAG-203 protein [Caenorhabditis briggsae]
          Length = 1218

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/320 (21%), Positives = 145/320 (45%), Gaps = 39/320 (12%)

Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
            RG V  F    N D    L    +  E    + A+   +G  L+  G  + ++     +L
Sbjct: 923  RGCVYTFHLSPNGDRFDFL----HRTETPLPVGAIHDFRGMALVGFGKFLRMYDIGQKKL 978

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDC 1241
                     P+ +V++      I++ D  +S++FL +++   QL + A D    + S  C
Sbjct: 979  LAKCENKNFPVNIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC 1038

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
                 ++D  T++  ++D+  N+ +     +++E  +    +S++ +  G       +++
Sbjct: 1039 -----VLDYHTVA--IADKFGNLSVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVE 1091

Query: 1302 MLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
            ++A        TS  +T   PG+++    AL++ T+ G+IGC+      DE+ F    +L
Sbjct: 1092 LVANFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNL 1145

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            +  +    P + G +  S+R +++  K       S++D ++   + ++ L +Q E+A + 
Sbjct: 1146 EMHVRSEYPPLCGRDHLSYRSYYAPCK-------SVIDGDICEQFSLMELSKQKEVAEEL 1198

Query: 1411 GTTRSQILSNLNDLALGTSF 1430
            G T S+I   L D+    +F
Sbjct: 1199 GKTVSEISKKLEDIRTRYAF 1218


>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 108

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 37/100 (37%), Positives = 60/100 (60%), Gaps = 5/100 (5%)

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            A +FLID   ++ ++SDE  NI +F Y P+  ES  G++L+ R+E ++G +V  F+R++ 
Sbjct: 10   AAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFMRVKG 69

Query: 1303 LATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDE 1341
              +S    G       + NR ++LF +LDGS G + PL E
Sbjct: 70   HISS----GFVENEHYSLNRQSVLFCSLDGSFGFVRPLSE 105


>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
 gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
          Length = 423

 Score = 68.6 bits (166), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
           F+H Y EP + +L  ++  WAG +           L++   LK    ++   NLP++  +
Sbjct: 3   FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62

Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
           ++ +PSP+ G L+VG N  IH  +      +A+N +      S  S Q+  +S  +++L+
Sbjct: 63  VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120

Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
                 + +D   LL  +TG+   +    DG+ ++R+ +   +             ++  
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180

Query: 418 IGNSLFFLGSRLGDSLLVQ 436
           +  ++ F+ +  G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199


>gi|358378986|gb|EHK16667.1| hypothetical protein TRIVIDRAFT_40938 [Trichoderma virens Gv29-8]
          Length = 1212

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 104/460 (22%), Positives = 195/460 (42%), Gaps = 65/460 (14%)

Query: 998  IPLKATP--------HQITYFAEKN---LYPLIVSVPVLKP--LNQVLSLLIDQEVGHQI 1044
            IPL  TP        H + Y  E +   L P + +  +  P  +N    +L  +E GH  
Sbjct: 791  IPLTYTPKKMVKHPDHPLFYVIEADNHTLAPELCAKLLADPARVNGDTKILPAEEFGHPR 850

Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
             N   +S             + +++P    G  Q    I ++ +E A++V +VT     +
Sbjct: 851  GNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSVAIVTF---AS 893

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
            +ENET L +GT    G+D+    R          R   + + LV  ++  +++    A+ 
Sbjct: 894  QENETFLVVGT----GKDMVVNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPPMAMI 948

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
              QG +L+  G  + ++     +L   A  +  P  ++SL+   + I++GD+ + I +  
Sbjct: 949  PFQGRVLVGIGKTLRIYDLGMRQLLRKAQAEVAPQQIISLSTQGSRIVVGDVQQGITYAV 1008

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            +K+   +L     D  +     T  ++D  T++    D+  NI I     K SE    ++
Sbjct: 1009 YKQSTNKLIPFVDDTVARWTTCTT-MVDYETVA--GGDKFGNIFIVRSPQKASEEADEEQ 1065

Query: 1282 -----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
                 L +R   H  +H       L  Q + TS  +T    G        LL+  L G+I
Sbjct: 1066 AGLHLLNARDYLHGTSHRLDLMCHLFTQDIPTSIAKTSLVVGGQD----VLLWSGLMGTI 1121

Query: 1334 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            G + P    ++  F   QSL++ +    P +AG +   +R +++  K        I+D +
Sbjct: 1122 GVLIPFITREDTDF--FQSLEQHMRAEDPPLAGRDHLMYRSYYAPMKG-------IIDGD 1172

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            L   Y +LP +++  IA +   +  +I   ++D+   ++F
Sbjct: 1173 LCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212


>gi|168045572|ref|XP_001775251.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673464|gb|EDQ59987.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1201

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/385 (22%), Positives = 163/385 (42%), Gaps = 62/385 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
            +R+L+P  +     T   + +Q +E A ++  V   +   KE  TL+A+GTA        
Sbjct: 849  IRVLDPKTS----TTTCLLELQENEAAFSICTVNFHDN--KELGTLIAVGTAKDLQFMPR 902

Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
              A+ G + ++           ++  V+   + G  +AL   QG LL+  G  + ++   
Sbjct: 903  KEASGGFIHIYRFAEEG----RVLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 958

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
              +L         P  +++++   + I +GDI +S +++ ++    QL   A D  S   
Sbjct: 959  KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1016

Query: 1242 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSRA--- 1286
            + T  L ID  T++   +D+  N+ +      +SE            W+  +L       
Sbjct: 1017 WLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1074

Query: 1287 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
                +FHVG  VT   +  ++          PG  ++    +L+GT+ GS+G + P    
Sbjct: 1075 DEIIQFHVGEVVTSLQKASLI----------PGGGES----MLYGTVMGSMGALLPFSSR 1120

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            +++ F     L+  L    P + G +  +FR       A+ P  D ++D +L   Y ML 
Sbjct: 1121 EDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYSMLT 1171

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
             E Q +IA     T  +I+  L D+
Sbjct: 1172 SELQKKIADDLDRTPGEIVKKLEDI 1196


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
            queenslandica]
          Length = 1142

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 137/318 (43%), Gaps = 27/318 (8%)

Query: 1102 TTTKENETLLAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
            TT  E  ++  +GTA V+ E+  ++ GR+L+F+           +  ++ K   GA+  +
Sbjct: 821  TTNDEERSVYVVGTALVKPEEKESSTGRILVFAVNSGK------LELLHEKLENGAVFQV 874

Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
                G +L +    + ++      L     Y    L  + L    +FIL+GDI +S+  L
Sbjct: 875  LGFNGKILNSVNSGVFVNALVDGALKEECAYKNNIL-ALYLKTKGDFILVGDILRSLKLL 933

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
             +KE+   L  +  D     CF T   +ID       +  + ++I   +   K +E+   
Sbjct: 934  VYKEE-LGLEEIGVDHNISPCFCTAIEMIDDENY---LGADGRHI---FICQKNTEATSE 986

Query: 1280 QKLLSRAE---FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
              LL   +    + G +V  F R   +    D  GA   S    +  +LFGT+ G+IG I
Sbjct: 987  ADLLYMVQPSRMYFGDNVNVFSRGSFVM---DHPGAGASSLLQGK-PILFGTVHGAIGLI 1042

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHY 1395
              L+  T+  L  LQ+K+  ++  V  +    +R F +    HR  P    +D +L+  +
Sbjct: 1043 GTLNMDTYTLLSKLQQKMAANIKSVGNIEHEIYRSFSNE---HRSKPFAGFIDGDLVEKF 1099

Query: 1396 EMLPLEEQLEIAHQTGTT 1413
              LP  +  +I     TT
Sbjct: 1100 LELPRPQMSQIVQGIKTT 1117



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L DL   ++ D  F+HG   P +  + E      GRV        +    IS   K+ 
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204

Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
            P  W    +   A  L +VPSP  G++VVGA+++ Y           N+ + ++D    
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
           +  S  S      H+ +L  D        G L+ L + +   +      +  + L     
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             +   ++ + N + F+GS  GDS LV+         LSS   E  G I           
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347

Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
               D L+ M N G  L +    S      Q          L   G L+    G+ IN  
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           AS            ++LPG KGIW++             + A  +DE    ++++   +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           M L  A    E TE        +T    N+ G   +IQ+  +  R++D   M     + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493

Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
            +    S +   +S V+     D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520


>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
 gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
          Length = 1241

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 67/276 (24%), Positives = 125/276 (45%), Gaps = 29/276 (10%)

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
            A++ALAS +G LL ++G  + ++     +L         P  +  +  + + I +GD+ +
Sbjct: 982  AVTALASFRGRLLASAGKMLRIYDLGKKKLLRKCENKHMPNLITHILTMGHRIFVGDVQE 1041

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL-VVSDEQKNIQ-----IFYY 1269
            S++F  +K    QL + A D     C A   ++D  T++L + SD   ++Q     I   
Sbjct: 1042 SVFFYRYKPIENQLVVFADDTHQRFCSAM-CILDYDTVALRLPSDCTDDVQEDPTGIRAL 1100

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
              K   +   QK    A F+VG  VT   +  ++          PGS ++    L++ T+
Sbjct: 1101 WDKGILNGASQKCEMVATFYVGECVTCLQKAMLI----------PGSSES----LVYSTM 1146

Query: 1330 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
             G IG + P   +  +   Q L+  L    P + G +  ++R F++  K        ++D
Sbjct: 1147 SGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSFYAPVKG-------VID 1199

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +L   Y +L   +Q EI+++     S+I+  L D+
Sbjct: 1200 GDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235


>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1517

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 74/314 (23%), Positives = 137/314 (43%), Gaps = 42/314 (13%)

Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKELK---- 1154
            T E +  + IGTAY    ED   +GR+L+         P     + + + YS+ ++    
Sbjct: 1124 TSEYKPYILIGTAYAYPDEDEPTQGRILVVECNSGEAEPHLKSDDDMEDTYSRYVRHVTQ 1183

Query: 1155 ----GAISALASLQGHLLIAS-GPKIILHKWT--GTELNGIAFYDAP------PLYVVSL 1201
                G + +++   G  ++A+   K  L + +    ++  + F  A        L+V SL
Sbjct: 1184 MPTRGGVYSISPFYGGTVLATVNSKTHLCRLSIGCDQIGELKFVGAGHHGHMLSLFVKSL 1243

Query: 1202 -------------NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
                            K   ++GD+ +SI  + ++ +   +  LA+D+ +  C A E L 
Sbjct: 1244 AGSESESESSGTNRQAKQLAIVGDLMRSISLVEYQPKHNVIEELARDYNANFCTAVEMLT 1303

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G+ L    S+   N+ +  +    S      +L +  E+H+G    KF+   ++  S+ 
Sbjct: 1304 NGTYLG---SEGFNNLFVLRHNANASSEEARVRLDTVGEYHLGEMTNKFMGGSLIMPSN- 1359

Query: 1309 RTGAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
             +G   G+      +  LFGT+DGSIG +  LD  TF  L  LQ+ ++  V  V  ++  
Sbjct: 1360 -SGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLACLQRAILSIVKTVGDISHE 1418

Query: 1368 SFRQFHSNGKAHRP 1381
             +R F +  +  RP
Sbjct: 1419 EYRAFRAERQV-RP 1431


>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2558

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 105/215 (48%), Gaps = 12/215 (5%)

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
            P  +++SL++V+N+I++GDI  S+  LS+  + A LN + +D+ ++ C +   L   S  
Sbjct: 2320 PSSWIMSLDVVENYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTSVSAL---SEN 2376

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTG 1311
              +VSD + N  +   +       +  KL   ++F+ G+ V K     L+ L    +R  
Sbjct: 2377 HFLVSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLRNLVDDEERRN 2436

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
                 +++    +L  + +GSI  + P    L F+R   ++  + D++  +  L+  S+R
Sbjct: 2437 EILQKEQS----ILCASSEGSISALIPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYR 2492

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            ++  +  +       +VD EL   +  LP E QL+
Sbjct: 2493 EYKVSLAS--KNCKGVVDGELFKMFFYLPFERQLK 2525


>gi|357478269|ref|XP_003609420.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355510475|gb|AES91617.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 1225

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 96/416 (23%), Positives = 180/416 (43%), Gaps = 68/416 (16%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALT 1093
            E G + ++++ S  D H  Y   E +     +R+L+P R G    T   + +Q +E A +
Sbjct: 843  ENGGEDEDNDDSLSDEHYGYPKSESDKWVSCIRVLDP-RTG---NTTCLLELQENEAAFS 898

Query: 1094 VRVVTLFNTTTKENETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1149
            +  V   N   KE  TLLA+GTA    +     + A G + ++   R  D+ ++L   ++
Sbjct: 899  ICTV---NFHDKEYGTLLAVGTAKGLQFTPKRSLTA-GFIHIY---RFLDDGRSLEL-LH 950

Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
              +++G   AL   QG LL   GP + L+      L       + P  +VS++  ++ I 
Sbjct: 951  KTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGKRRLLRKCENKSFPSSIVSIHAYRDRIY 1010

Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
            +G I +S ++  ++    QL + A D        + + ID  T++   +D+  NI     
Sbjct: 1011 VGGIQESFHYCKYRRDENQLYIFADD-SVPRWLTSSYHIDFDTMA--GADKFGNIFFARL 1067

Query: 1270 APKMSE-----------SWKGQKLLSR-------AEFHVGAHVTKFLRLQMLATSSDRTG 1311
               +S+            W+  KL           +FHVG  +T   +  ++        
Sbjct: 1068 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVITSLQKASLV-------- 1119

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
              PG  +     +++GT+ GS+G +      D++ F     L+  +    P + G +  +
Sbjct: 1120 --PGGGE----CIVYGTVMGSVGALHAFTSRDDVDF--FSHLEMHMRQDNPPLCGRDHMA 1171

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R       A+ P  D ++D +L   +  LP++ Q +IA +   T  +IL  L ++
Sbjct: 1172 YRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELDRTPGEILKKLEEV 1220


>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
          Length = 1202

 Score = 67.8 bits (164), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 89/391 (22%), Positives = 157/391 (40%), Gaps = 71/391 (18%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VR+++P+      +T+  I +  +E AL+V V T ++   ++  T L  GTA   G  V 
Sbjct: 847  VRVIDPNE----RETKQIIELDPNEAALSVCVATFYD---RKGHTFLCFGTAV--GHKVG 897

Query: 1125 ARGRVLLFSTGRNADNPQNLV----TEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
            +R       TG    +  ++V    T V+   + G   AL S QG LL+  G  + L++ 
Sbjct: 898  SR-------TGSGFLHTYSVVGSQLTFVHKTPIDGVPRALCSFQGRLLVGVGSALRLYEM 950

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1235
               +L         P  VV+++ + + I +GD+ +SI FL +     +L + A D     
Sbjct: 951  GKRKLLRKCENRNIPNLVVTISTMGDRIYVGDVAESISFLKYNRILNELVIFADDTHPRW 1010

Query: 1236 -----------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
                           D F   FL      +S  +S+E   + +F         +K ++++
Sbjct: 1011 MTAACPVDYDTVAGADKFGNIFLTRLPDNVSDEISEEPGAVGMFEGNDLQGAHYKAEEIV 1070

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---- 1339
               ++HVG  V    +  +    SD              A+++GT+ G IG + P     
Sbjct: 1071 ---QYHVGETVCSLQKATLSPGGSD--------------AIIYGTMYGGIGALQPFVSRE 1113

Query: 1340 DELTFRRLQSLQKKLVDSVPH------VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
            D   F  L+   +    +  H      + G +  SFR ++   K        +VD +L  
Sbjct: 1114 DVDFFLHLEMHLRGAAGAREHKPAGEGICGRDQLSFRSYYFPVK-------DVVDGDLCE 1166

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  L    Q +IA     T  ++   L D+
Sbjct: 1167 TFNYLSPSRQKQIAEDLDRTPGEVAKKLEDM 1197



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 110/554 (19%), Positives = 203/554 (36%), Gaps = 97/554 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           ++ +C     G + S+A     G++    +D ++L  +  +ISVLEF    +      + 
Sbjct: 49  IQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVHLE 104

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +        K G      G  +  DP+GR   +     Q ++          V + D  
Sbjct: 105 TYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRDA- 147

Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSW 276
                 S+++  S  +        H        G+  P+   L     + +    G+ S 
Sbjct: 148 ------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ-SA 200

Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTIHY 331
           +    +++   +   L     +    + P DA   + +P P       GVLV   N I Y
Sbjct: 201 EEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKIAY 257

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                   +AL      +   Q L  + +S      H         LL ++ GDL  LT+
Sbjct: 258 KKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRLTL 311

Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSS 448
            Y+   V  ++++  +   +   IT +     F+ S  G+  L QF    G+    M+  
Sbjct: 312 TYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMMPV 371

Query: 449 GLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
            ++ E   IE    A  P    L     ++L  +++   L L G      E   + ++  
Sbjct: 372 EVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA-- 423

Query: 505 VRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSSRG 560
               L   GP   L+   +GL +            +   + ELP     +WTV     +G
Sbjct: 424 ----LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV-----KG 462

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
            + D++           Y++++    T+VL   D + EVT+S  +    +T++  +L G 
Sbjct: 463 SSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLLGD 511

Query: 621 RRVIQVFERGARIL 634
             ++QV   G R +
Sbjct: 512 DSLLQVHPNGLRTV 525


>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
          Length = 702

 Score = 67.8 bits (164), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 83/358 (23%), Positives = 159/358 (44%), Gaps = 25/358 (6%)

Query: 1077 WQTRATIPMQSSENALTVRVVTLFN-TTTKENETLLAIGTAYV-QGEDVAARGRVLLFST 1134
            +  R  +P++ SE AL V   ++F  +   E   +  +GTA+V   E+  ++GR+L+   
Sbjct: 317  YAIRDELPLKPSEIALCVASGSIFPLSNAPERNEVFVVGTAFVLPEENEPSQGRLLVL-- 374

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYD 1192
             R  ++   LV E     L G   ++   +G ++  + S  ++       + ++ +A  +
Sbjct: 375  -RAVEHRLELVAETM---LSGGCLSICLFKGKVVCGVNSELQVFDVDEKTSTISKLA-SE 429

Query: 1193 APPLYVVSL--NIVKNFILLGDI------HKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
               + V SL  N     I LGDI      +K +  +    Q AQL  +A +    D  A 
Sbjct: 430  VACISVTSLSPNEADETIALGDILYSVVVYKLVLEVVRGRQLAQLECIASERRRRDVTAL 489

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPK--MSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            E L +  +  +VV D   N+ +     +  +  S   + ++++  FH+   + +F+ +Q+
Sbjct: 490  ERLPEAQS-EMVVGDAYGNLMVMQVVEEADLDRSNPQKIVVTKESFHLDDQINRFVPVQL 548

Query: 1303 LAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1361
              + + D+       +    F L F T+ G IG I  L++  FR L++++  + + +  V
Sbjct: 549  FRSGAEDKKKEKRAEESEIAFNLAFATVSGRIGMIGALNDREFRMLRAIETAMENVITPV 608

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
             GL+ + +R   SN           +D +L+  +  L  E Q +IA    T     LS
Sbjct: 609  GGLDHKQWR--CSNTPFGIKNLAYCIDGDLVEMFLELDDESQAKIADSVSTELRSALS 664


>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
 gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
          Length = 1329

 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 75/322 (23%), Positives = 142/322 (44%), Gaps = 35/322 (10%)

Query: 1098 TLFNTTTKENETLLAIGTAYVQGEDVAA--RGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            ++ + T  E++    I +A V   D     +GR+L+    R+     +L   V+ KE+ G
Sbjct: 995  SILSCTMGEDQNPFFILSAAVITADETEPLQGRLLMLRYERDGQGNSSL-NLVHEKEVNG 1053

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS--LNIVKNFILLGDI 1213
             + A+AS +  LL+A    ++L +W  +++ G+    +  L+V +  L +    IL+GDI
Sbjct: 1054 CVYAMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSLFVTAMHLKVRDEVILVGDI 1111

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDF-----GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             +SI  L +    +     A+D+      +++    ++ +     SL ++  QK++Q   
Sbjct: 1112 QRSIAVLRYVPSESSFVEEARDYHPNWISAIEVIDNDYFMAAEN-SLNITVSQKDLQ--- 1167

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                +SES   Q + S    H+G ++  F         S  + A   S  +N   ++ GT
Sbjct: 1168 -QQPVSES---QVVKSAGRLHLGEYINVFKH----GALSMYSYAGISSLVSN--PIMIGT 1217

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL----NPRSFRQFHSNGKAHRPGPD 1384
             +GSI     + +  FR L  LQ+   D VP   G     + R +  +  N  A      
Sbjct: 1218 AEGSILIYCQIHDSHFRVLNDLQRCFSDIVPDNVGCIAYDSYRRYVVYEKNAPAF----- 1272

Query: 1385 SIVDCELLSHYEMLPLEEQLEI 1406
              +D +L+     +P +E + +
Sbjct: 1273 GFIDGDLIEQLLEMPRQEAIRL 1294



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/359 (22%), Positives = 152/359 (42%), Gaps = 74/359 (20%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R  +  +SA  L+ V   ++ G + +  + +  G + +     +++      ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVYGLQMIILK 205
                RI +         L  +   E+F R    G L+ V P G   G+ +       + 
Sbjct: 261 G----RIKT---------LASRNISENFGRPASNGILLSVHPDGEVIGLRIMSSTFKCIT 307

Query: 206 ASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHE 265
            ++  S L                  S++ +N     + H+ DF+F+HG+  PV+ +++ 
Sbjct: 308 WNRATSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY- 345

Query: 266 RELTWAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
                 G +  +H  TC IS        +  P  WS  ++  +A+ L+AVP P+ GV+VV
Sbjct: 346 ------GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVV 395

Query: 325 GANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
           G +++ Y           +N  +S  S   L +S  +   DAA           L    G
Sbjct: 396 GCSSLLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDG 441

Query: 385 DLVLLTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
            L LL +  +     ++ LS+   ++L  TS   ++      SL F+GSR+ DS L++ 
Sbjct: 442 TLSLLKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500


>gi|221055487|ref|XP_002258882.1| CPSF (cleavage and polyadenylation specific factor), subunit A
            [Plasmodium knowlesi strain H]
 gi|193808952|emb|CAQ39655.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium knowlesi strain H]
          Length = 2478

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 70/322 (21%), Positives = 139/322 (43%), Gaps = 36/322 (11%)

Query: 1109 TLLAIGTAYVQGEDVA--ARGRVLLFSTGRNADNPQNLVTEVYSKELK-GAISALASLQG 1165
            TL+ +GTA    E +   + G + +F   +  +  Q  +  +Y+  +  G I+ L   + 
Sbjct: 2135 TLICVGTA-NNNERITEPSSGHIYVFVAKKKTN--QFEIKHIYTYNVNCGGITHLKQFRD 2191

Query: 1166 HLLIASGPKIILHKWTGTELN-GIAFYDA-------------------PPLYVVSLNIVK 1205
             ++ A    +++        N G   Y+A                   P  +++SL++VK
Sbjct: 2192 KIVAAVNNTVLILDIRNFLTNLGTYIYNASKAMKVESNDAFLEVASFTPSSWIMSLDVVK 2251

Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            N+I++GDI  S+  LS+  + A LN + +D+ ++ C A       S    +VSD + N  
Sbjct: 2252 NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTAL------SEDHFLVSDMESNFL 2305

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFAL 1324
            +   +       +  KL   ++F+ G+ V K L   +     +  +   P        ++
Sbjct: 2306 VLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMLSTSLRNLVDEYESEERPNEIVQKERSI 2365

Query: 1325 LFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
            L  + +GSI  + P    + F+R   ++  + D++  +  L+  S+R++     +     
Sbjct: 2366 LCASSEGSISTLIPFSNFIQFKRALCIEIAINDNISSLGNLSHSSYREYKITLAS--KNC 2423

Query: 1384 DSIVDCELLSHYEMLPLEEQLE 1405
              +VD EL   +  LP E QL+
Sbjct: 2424 KGVVDGELFKMFFYLPFERQLK 2445


>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
          Length = 1054

 Score = 67.0 bits (162), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 78/300 (26%), Positives = 137/300 (45%), Gaps = 28/300 (9%)

Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
            + +GTA+  + E   ++GR+L+F   R  DN   L+ EV    L   +  L    G LL 
Sbjct: 754  VVVGTAFHNEVESQQSKGRILVF---RIEDNRLILLDEV---ALPACVYCLLPFNGRLLA 807

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1227
                ++    W G + N +   ++   + +S ++V   +F+L+ D+ KS+  L   +QGA
Sbjct: 808  GINKRVQAFNW-GVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTLLVEDQQGA 866

Query: 1228 QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
             +  LA++   L  + +   +ID  T   +  D   N+ +     + S     + L +  
Sbjct: 867  -IKELARN--PLPIWLSRIEMIDDETF--IGGDNSYNLIVVQKNAEASSEIDNELLDTVG 921

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
            +FH+G  + KF +   L TS       P  D      +LFGT+ G+IG I  + +  +  
Sbjct: 922  QFHLGETINKF-KHGSLVTS-------PDMDSPKLPTILFGTVSGAIGVIVSISKDDYEF 973

Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLE 1405
             + LQK L   V  V GL   ++R F +    H   P  + +D +L+  +  L  ++ LE
Sbjct: 974  FEKLQKGLNRVVHGVGGLPFENWRSFSTE---HMTIPSKNFIDGDLIETFLDLRHDKMLE 1030


>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
 gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
          Length = 1019

 Score = 67.0 bits (162), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 91/395 (23%), Positives = 167/395 (42%), Gaps = 64/395 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
            +R+L P       Q+   + ++ +E AL+V +V     T+  +E  + +G A  +Q    
Sbjct: 660  IRVLNPSDG----QSLCKVALEQNEAALSVALVRF---TSHPDEQFVVVGAAREMQLNPR 712

Query: 1124 AARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISALASLQGHLLIASGPKIIL 1177
              RG  LL  T R A NP+  +        V++  ++ A +AL   QG LL   G  + L
Sbjct: 713  VCRGGGLLL-TYRLAPNPEEPMAGPTQLELVHATPVEEAPTALCPFQGRLLAGVGKCLRL 771

Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
            +     +L         P  +VS+  + N +++ D+ +S +FL +K Q  QL + A D  
Sbjct: 772  YDLGRKKLLRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRYKRQENQLVIFADD-- 829

Query: 1238 SLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG-- 1279
            S+  + T   ++D  T++   +D+  N+ I      +S+                W G  
Sbjct: 830  SVPRWITASCMLDYETVA--GADKFGNVSIIRLPSSISDDVDEDPTGIKSLWDRGWLGGS 887

Query: 1280 -QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
             QK    + FH+G  V    +  ++          PG  ++    L++ TL G++G + P
Sbjct: 888  SQKADVISNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTVGVLVP 933

Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
                ++  F   Q L+  +    P + G +  SFR  +   K       +++D +L   +
Sbjct: 934  FTAHEDHDF--FQHLEMHMRYENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQF 984

Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
              L   +Q  IA +     S++   L D+    +F
Sbjct: 985  NSLDPSKQKSIAEELDRNPSEVSKKLEDIRTRYAF 1019


>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
            magnipapillata]
          Length = 725

 Score = 67.0 bits (162), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 121/272 (44%), Gaps = 19/272 (6%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GT+ V  E+   + G+++LF            + ++ SK + GA+  L    G L
Sbjct: 414  TYYCVGTSMVYPEESEPKEGKIILFQLFEGK------LVQIGSKTVNGAVYVLQGFNGKL 467

Query: 1168 LIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
            L      + +++WT   EL     Y    L  + L    +FIL+GD+ +S+  L++K  G
Sbjct: 468  LAGVNSLVSVYEWTSDKELKQECCYHNTIL-ALYLKSKGDFILVGDLMRSMTLLAYKPLG 526

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
             +L  +A DF      A E + D + L    ++   N+ I           +   L +  
Sbjct: 527  -RLEEIAHDFSPNWMTAVEIIDDDTFLG---AENSFNLFICQKDNSSVNDEERHHLQTIG 582

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
            ++H+G  V  F    ++   S        S      ++L+GT+ G+IG +A L + TF  
Sbjct: 583  KYHLGDFVNVFKHGSLVMHHSTEQLTPISS------SILYGTVRGAIGLVAGLPKNTFDF 636

Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            L  +Q+KL  ++  V  +    +R F+++ K 
Sbjct: 637  LSQVQEKLSKTIKSVGKIEHEFWRSFYNDKKT 668


>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
 gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
          Length = 1214

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/403 (24%), Positives = 172/403 (42%), Gaps = 70/403 (17%)

Query: 1053 DLHRTYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1107
            D H  Y   E E     +R+L+P  A     T   + +Q +E A +V  V   N   KE 
Sbjct: 846  DEHYGYPKAESEKWVSCIRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEY 898

Query: 1108 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
             TLLA+GTA    +     + A G + ++   R  ++ ++L   ++  +++G   ALA  
Sbjct: 899  GTLLAVGTAKGLQFFPKRSLVA-GYIHIY---RFLEDGKSLEL-LHKTQVEGVPLALAQF 953

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            QG LL   G  + L+      L         P  +VS+   ++ I +GDI +S ++  ++
Sbjct: 954  QGRLLAGLGSVLRLYDLGKRRLLRKCENKLFPNTIVSIQTYRDRIYVGDIQESFHYCKYR 1013

Query: 1224 EQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE------- 1275
                QL + A D  S+  + T  + +D  T++   +D+  NI        +S+       
Sbjct: 1014 RDENQLYIFADD--SVPRWLTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPT 1069

Query: 1276 ----SWKGQKLLSRA-------EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
                 W+  KL           +FH+G  VT   +  ++          PG  +     +
Sbjct: 1070 GGKIKWEQGKLNGAPNKVEEIIQFHIGDVVTSLQKASLI----------PGGGE----CI 1115

Query: 1325 LFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
            L+GT+ GS+G +      D++ F     L+  +    P + G +   +R       A+ P
Sbjct: 1116 LYGTVMGSLGALHAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMGYR------SAYFP 1167

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              D ++D +L   +  LPL+ Q +IA +   T  +IL  L ++
Sbjct: 1168 VKD-VIDGDLCEQFPSLPLDMQRKIADELDRTPGEILKKLEEV 1209



 Score = 44.7 bits (104), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVLV   N + Y +Q      A+      +    +LP     + + AA          LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            T+ GD+  +T+ ++   V+ L +   +   +T+ +  + +   F  S  G+  L QF  
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359

Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
            G    + SS       +E F  +       K L R    ++L  +++ + ++L+     
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414

Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
             E   + F+   R      GP   L+    GL I            S   + ELPG   
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WTV                  +DE+ AY+++S    T+VL   + + EV++S   F+ 
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
                A +L G   ++QV   G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523


>gi|170580631|ref|XP_001895346.1| splicing factor 3B subunit 3 [Brugia malayi]
 gi|158597745|gb|EDP35799.1| splicing factor 3B subunit 3, putative [Brugia malayi]
          Length = 1181

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/364 (21%), Positives = 145/364 (39%), Gaps = 32/364 (8%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
            +T +  P    E A  + +V  F   +     L+  G         A  G +  F    N
Sbjct: 839  ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCDLQLKPRKANGGCIYTFLLAAN 897

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
                Q L    +       ++A+   +G  L   G K+ L+     +L         P  
Sbjct: 898  GTTLQLL----HRTPTDEVVNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 953

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            VV +  +   I++ D  +S++F+ +K+Q  QL++   D  S        L+D  T++  V
Sbjct: 954  VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1010

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1309
             D   N+ +      ++E  +      RA +  G       +L+ +A        TS  +
Sbjct: 1011 GDRFGNVAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1070

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
            T   PG++      L + T+ G IG + P    DE  F   Q+L+  +    P + G + 
Sbjct: 1071 TSLVPGAND----CLSYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1124

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
             ++R ++   K       S++D +L   Y ++PL++Q  +  + G   ++I   L D+  
Sbjct: 1125 LAYRSYYFPVK-------SVIDGDLCEQYSLMPLDKQKSVGEELGRKPTEIHKKLEDIRT 1177

Query: 1427 GTSF 1430
              +F
Sbjct: 1178 RYAF 1181


>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1263

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 141/314 (44%), Gaps = 42/314 (13%)

Query: 1111 LAIGTAYVQGEDV-AARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQG 1165
             A+GT Y++ E+   ++GR+LLFS     G    N ++L T + S  + G + ALA+L  
Sbjct: 944  FALGTVYIRPEEREPSKGRILLFSVSSTEGARGANVRSLHT-LASVNVGGCVYALANLSE 1002

Query: 1166 HLLIAS-GPKIILHKWTGTELNGIAFYDAPPL------------YVVSLNIVKNFILLGD 1212
            +L++A+    ++L K T  E       ++ PL            +V ++ +    IL+GD
Sbjct: 1003 NLIVAAINTSVVLFKSTENEAG-----ESTPLSLEKVTEWNHNHFVTNVVVDGERILVGD 1057

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
               S+  L W E+  +L  +A+D+G L   A    I+G+   L+ ++   N+  F     
Sbjct: 1058 AISSVSVLKWNERLERLESIARDYGPLWPIA----IEGTGNGLIGANADCNLFSFSLQSV 1113

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
               ++    L     +H+     KF+R  +  TS+D        D+  + + +F T  G 
Sbjct: 1114 PHRTY----LEKDGVYHLNDVTNKFVRGAL--TSTDV-----AEDQVVKASHVFFTSTGC 1162

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1390
            IG I  ++++T   + +LQ+ +  ++    G N    R   S  + H     S   +D +
Sbjct: 1163 IGAILDMNDVTSLHMTALQRNMAKTLTGPGGDNHTKLRA-PSTPRGHTDAEASYGFLDGD 1221

Query: 1391 LLSHYEMLPLEEQL 1404
             L  Y   P  EQ 
Sbjct: 1222 FLEQYLTHPHPEQF 1235


>gi|168046759|ref|XP_001775840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672847|gb|EDQ59379.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1214

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/388 (22%), Positives = 166/388 (42%), Gaps = 68/388 (17%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
            +R+L+P  +     T   + +Q +E A ++  V   +   KE  TL+A+GTA       +
Sbjct: 862  IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKNMQFMPK 915

Query: 1122 DVAARGRVLLF---STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
              ++ G + ++     GR       ++  V+   + G  +AL   QG LL+  G  + ++
Sbjct: 916  KESSGGFIHIYRFVEEGR-------ILELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIY 968

Query: 1179 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
                 +L         P  +++++   + I +GDI +S +++ ++    QL   A D  S
Sbjct: 969  DLGKRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--S 1026

Query: 1239 LDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG------- 1279
               + T  L ID  T++   +D+  N+ +      +SE            W+        
Sbjct: 1027 CPRWLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAP 1084

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
             K+    +FHVG  VT   +  ++          PG  ++    +L+GT+ GS+G + P 
Sbjct: 1085 NKVEEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSVGALLPF 1130

Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
               +++ F     L+  L    P + G +  +FR       A+ P  D ++D +L   Y 
Sbjct: 1131 SSREDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYP 1181

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            ML  E Q +IA     T  ++L  L D+
Sbjct: 1182 MLTSELQRKIADDLDRTPGEVLKKLEDI 1209


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 146/368 (39%), Gaps = 44/368 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L       P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHHFVSPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  LQ++L   +  V  +    +R F  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLQERLKKIIKSVGKIEHTYYRNFQINNKVE--PSEGFIDGDLIES 1097

Query: 1395 YEMLPLEE 1402
            +  L  E+
Sbjct: 1098 FLDLSREK 1105



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
            11827]
          Length = 1243

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 113/494 (22%), Positives = 213/494 (43%), Gaps = 70/494 (14%)

Query: 965  CNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
            C  GFI + +   L+I Q+P  GS        + +PL  TP +     + + + LI S  
Sbjct: 789  CPDGFIGI-ANNTLRIFQVPKLGSKLKQ----EILPLSYTPRKFVSHPQNSYFYLIES-- 841

Query: 1024 VLKPLNQVLSLLIDQEV------GHQIDNHNLS-SVDLHRTYTVEEYE----VRILEPDR 1072
                   +   +I++ V      G ++D   L     ++  +   E      +RI++P  
Sbjct: 842  ---DHRAMSETMIEERVKAIEMSGEKVDREMLELDPRIYGHFKAPEGVWASCIRIIDPVN 898

Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG----- 1127
                 ++ A   + ++E A ++ VV       +  E LL +GTA      +A R      
Sbjct: 899  ----LRSVAAFSLDNNEAAFSIAVVPF---AARNGELLLVVGTAV--DTHLAPRSCSTGY 949

Query: 1128 -RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1186
             RV  F+ G +       +  ++  ++    +AL + QG L+   G  + L+     +L 
Sbjct: 950  LRVYSFTEGGSG------LELLHKTDIDEVPTALMAFQGRLIAGVGKALRLYDIGKKKLL 1003

Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
              A        +V+L+   + IL GDI++SIY++++K    +L + A D  S        
Sbjct: 1004 RKAENRQFATAIVTLSTQGSRILAGDINQSIYYVAYKAAENRLLIFADD-TSARWITAST 1062

Query: 1247 LIDGSTLSLVVSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGA-HVTKFL--- 1298
            ++D +T  +V +D+  N+ +       + ++ +   G  +L      +GA H TK L   
Sbjct: 1063 MLDYNT--VVAADKFGNVFVNRLDEAVSKQVDDDPTGAGILHEKSVLMGAPHKTKMLTHF 1120

Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
             +  + TS  R     G+    R  +++  L G+IG + PL   D++ F  + +L++ + 
Sbjct: 1121 HVGDVITSIQRVALVAGA----REVVVYFGLHGTIGILVPLVTKDDVDF--ISTLEQHMR 1174

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
                 + G +  S+R +++  KA        VD +L  ++  LP ++QL IA +   T  
Sbjct: 1175 TENLSLVGRDHLSWRGYYTPVKA-------TVDGDLCEYFAKLPTQKQLAIAGELDRTVG 1227

Query: 1416 QILSNLNDLALGTS 1429
             +L  L  L +  S
Sbjct: 1228 DVLKKLESLRVTAS 1241


>gi|17508021|ref|NP_491953.1| Protein TEG-4 [Caenorhabditis elegans]
 gi|351060889|emb|CCD68627.1| Protein TEG-4 [Caenorhabditis elegans]
          Length = 1220

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/394 (19%), Positives = 169/394 (42%), Gaps = 55/394 (13%)

Query: 1070 PDRAGGPWQTRATIPMQSSENALT---------VRVVTLFNTTTKENETLLAIGTAYVQG 1120
            P  A G W +  ++   +S + L+          + V L   +   NE ++ +G    + 
Sbjct: 849  PRAARGKWASAISLISATSGDKLSYFELPQDENAKCVALVQFSKHPNEAMVLVGCGVNEV 908

Query: 1121 EDV-----------AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
             +V             RG V  F    N D    L    +  E    + A+   +G  L+
Sbjct: 909  LNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFL----HRTETPLPVGAIHDFRGMALV 964

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
              G  + ++     +L         P+ +V++      I++ D  +S++FL +++   QL
Sbjct: 965  GFGRFLRMYDIGQKKLLAKCENKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQL 1024

Query: 1230 NLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
             + A D     + C     ++D  T++  V+D+  N+ +     +++E  +    +S++ 
Sbjct: 1025 VVFADDTTPRYVTCVC---VLDYHTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSV 1079

Query: 1288 FHVGAHVTKFLRLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            +  G       ++++++        TS  +T   PG+++    AL++ T+ G+IGC+   
Sbjct: 1080 WDRGWLNGASQKVELVSNFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSF 1135

Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
               DE+ F    +L+  +    P + G +  ++R +++  K       S++D ++   + 
Sbjct: 1136 MSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYRSYYAPCK-------SVIDGDICEQFS 1186

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            ++  ++Q ++A + G T S+I   L D+    +F
Sbjct: 1187 LMDTQKQKDVAEELGKTVSEISKKLEDIRTRYAF 1220


>gi|392570042|gb|EIW63215.1| hypothetical protein TRAVEDRAFT_161375 [Trametes versicolor FP-101664
            SS1]
          Length = 1213

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 127/556 (22%), Positives = 221/556 (39%), Gaps = 103/556 (18%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            NI  +       SRP     ++  +   P + +    A++    + C  G I + S  +L
Sbjct: 714  NIQQNPAILALSSRPWLNYTYQNFMHFTPLIFENLDYAWSFSAEL-CTEGLIGI-SGSLL 771

Query: 979  KICQLPSGSTY--DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
            +I Q+P   T    +  P+   P K  PH                     P N +L L+ 
Sbjct: 772  RIFQIPKLGTKLKQDSLPLSYTPRKFMPH---------------------PTNGLLYLI- 809

Query: 1037 DQEVGHQIDNHNLSSVDLH----RTYTVEEYEVRILEPDRAGGP------WQT--RATIP 1084
              E  H++ +   +S  L     R   ++E EV +L P++ G P      W +  R   P
Sbjct: 810  --EGDHRVMSEEAASKKLQEMRARGERIDE-EVLLLPPEQFGRPKAPAGTWASCIRIINP 866

Query: 1085 MQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
            ++SS    TV+V+ L N              + NE  L +GTA  Q   ++ R     F 
Sbjct: 867  LESS----TVKVIHLDNNEAAFSMAIVPFAARGNELHLVVGTA--QDTFLSPRSCTSGFL 920

Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
             T R  D+ ++L   ++  E      A+ + QG L+   G  + L+     +L       
Sbjct: 921  RTYRFIDDGRDL-EFLHKTETSDVPLAVMAFQGKLIAGVGKSLRLYDVGKKKLLRKVENK 979

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
              P  +V+LN   + I++GD+ +S+++  +K    +L + A D       AT  L D +T
Sbjct: 980  GFPAAIVTLNTQGSRIIVGDMQESVFYAVYKAPENRLLVFADDAQPRWVTATTML-DYNT 1038

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHV 1294
              +V  D   N+ +     K+S+              KG       K +  + FHVG  V
Sbjct: 1039 --VVAGDRFGNVFVNRLDSKISDQIDDDPTGAGILHEKGVLFGAPHKSVMLSHFHVGDIV 1096

Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKK 1353
            T   ++ ++A                R  LL+  L G+IG + P + +     L +L++ 
Sbjct: 1097 TSLHKVALVA--------------GGREVLLYTCLHGTIGILVPFVSKEDVDLLTTLEQH 1142

Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
            +      + G +  ++R ++   K       S+VD +L   +  LP  +Q  IA +   T
Sbjct: 1143 MRTEQLSLVGRDHLTWRGYYVPVK-------SVVDGDLCESFAKLPANKQSTIAGELDRT 1195

Query: 1414 RSQILSNLNDLALGTS 1429
              ++L  L  L +  S
Sbjct: 1196 VGEVLKKLEQLRVTAS 1211


>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
          Length = 1134

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/310 (19%), Positives = 129/310 (41%), Gaps = 19/310 (6%)

Query: 1106 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            +N +   +GTA V  ++   + GR+++F            +TE+    ++GA   +  L 
Sbjct: 816  DNNSYYIVGTALVYPDESETKIGRIIVFEVDETDKTKLRFMTEIV---VRGAPMGIRILN 872

Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            G L+ A    + + +WT  +   +       +  V L ++   I + D+ +S+  LS++ 
Sbjct: 873  GKLVAAINSSVRMFEWTAEKELRVECSTFNHIAAVDLKVLNEEIAVADVMRSVSLLSYRT 932

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
                   +AKD+ S      EF+   S L          +++    P   +   G+ +L 
Sbjct: 933  LEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTDD---GRYVLE 989

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDEL 1342
               +     +TK +   +L    D        D + R+   +++GT  GS+G +  +D++
Sbjct: 990  PTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGSLGLVVQIDDM 1041

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
              + L S++K + D+  +   +   ++R F  N +   P     +D +L+     +    
Sbjct: 1042 YKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLIESILDMDRSR 1099

Query: 1403 QLEIAHQTGT 1412
             +EI  +  T
Sbjct: 1100 AIEILEKANT 1109



 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VPSPI GV+V+G +++ Y S         N+  V   SS  L  + F+    +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262

Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
           D     ++ +D    LL      ++LL  V +  G  V+ + +     + +   I  I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            + F+GSR GDS L++       S   S L   +                +   ++DM+ 
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360

Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            E         ++ +    T S A +D     G L+    G+ I   A+           
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            V+LPG  GI+ +   SS    AD+            Y+I+SL   T VL+      E  
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438

Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
           + +       T+ AG LFG      V+QV ER  R++    +++   + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490


>gi|307109500|gb|EFN57738.1| hypothetical protein CHLNCDRAFT_56079 [Chlorella variabilis]
          Length = 1144

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 90/385 (23%), Positives = 162/385 (42%), Gaps = 63/385 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VRI++P       QT   I M+ +E AL+V +V     +  E+ TLLA+GTA  QG    
Sbjct: 793  VRIVDPVS----LQTTHCIEMEDNEAALSVCLVEF--DSHPEHGTLLAVGTA--QGLKFY 844

Query: 1125 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
             +    G V L+   R  D+ +  +  ++   ++G   A+A+ +G LL+     + L+  
Sbjct: 845  PKECQNGFVHLY---RFLDDGKR-IELLHKTAVEGVPGAMAAFKGRLLVGVDAVLRLYDM 900

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
                +     Y   P  + +L++  + I +GD  +S +F+ +K+   Q  + A D     
Sbjct: 901  GKKRMLRKCEYRRLPTRIATLHVSGSRIYVGDGQESTFFMRYKKGDNQFYIFADDIVPRH 960

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG------------------QKL 1282
              A   L D  TL+   +D   N+ +     ++S   +                    KL
Sbjct: 961  VTAALHL-DYDTLA--GADRFGNVFVSRLPQEVSAQVEDDPTGGKYATETGLLGGAPNKL 1017

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
             +   FHVG  VT   R  +           PG     R  +++GT++G+IG + P    
Sbjct: 1018 RTINSFHVGETVTALQRAVL----------QPG----GRELIVYGTINGAIGVLYPFTSK 1063

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            ++  F   Q L+  +    P + G +  ++R F+   K        +VD +L   Y  L 
Sbjct: 1064 EDCDF--FQHLEMHMRQEHPPLLGRDHLAYRSFYFPVK-------DVVDGDLCEQYPQLA 1114

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
             ++   +A +   +  ++L  L D+
Sbjct: 1115 ADKARGVAEELDRSPGEVLKKLEDI 1139


>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
          Length = 64

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/47 (74%), Positives = 42/47 (89%)

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+   L
Sbjct: 1   MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097

Query: 1395 Y 1395
            +
Sbjct: 1098 F 1098



 Score = 57.0 bits (136), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           NLR +D  +V D  F+HG + P ++++H+      GR    H         I+   K+  
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
            I W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+       
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
              +F       +A    N +  LL    G L +L +       G  V+ + + +     
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +   IT + N   ++G+R GDS LV+ 
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/371 (21%), Positives = 149/371 (40%), Gaps = 44/371 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D   T+ V                      +P ++    ++ ++
Sbjct: 780  NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISTLMSAKL 822

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
                     +  T   + T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 823  -------GDDPNTYYVVATSLVYPEEPEPKVGRIIIFHYNDNK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L     D   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  +  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  ++   +R +  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097

Query: 1395 YEMLPLEEQLE 1405
            +  L  E+  E
Sbjct: 1098 FLDLNREKMRE 1108



 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H       GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPI GV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L    T    G  V+ + + +     +   
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
            melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score = 65.1 bits (157), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE--PSEGFIDGDLIES 1097

Query: 1395 Y 1395
            +
Sbjct: 1098 F 1098



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
 gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
          Length = 1140

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097

Query: 1395 Y 1395
            +
Sbjct: 1098 F 1098



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
          Length = 980

 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480



 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 73/149 (48%), Gaps = 13/149 (8%)

Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   +GTA V  E+   + GR++++     AD   NL T+V  KE+KG+  +L    G +
Sbjct: 826  TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879

Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
            L +    + L++WT  +   L    F +   LY  +     +FIL+GD+ +SI  L +K+
Sbjct: 880  LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                   +A+D+      A E L D + L
Sbjct: 937  MEGSFEEIARDYQPNWMTAVEILDDDAFL 965


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/343 (22%), Positives = 134/343 (39%), Gaps = 40/343 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    A +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISALMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
            V        E E  +               GR+++F    N       +T+V   ++ G 
Sbjct: 832  VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
              AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +S
Sbjct: 871  CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            I  L  K+       +A+D       A E L D + L    S+   N+ +       +  
Sbjct: 931  ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
             + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG 
Sbjct: 988  EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +  + +  +  L  L+++L   +  V  +    +R F  N K 
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV 1083



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 74/361 (20%), Positives = 147/361 (40%), Gaps = 44/361 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D   T+ V                  +   +P ++  + ++ ++
Sbjct: 780  NAEVGQEIDVHNLLVID-QNTFEV----------------LHSHQFVPPETISSLMSAKL 822

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
                     +  T   + T+ V  ++   + GR+++F    N       +T+V   ++ G
Sbjct: 823  -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L     D   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  +  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  ++   +R +  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097

Query: 1395 Y 1395
            +
Sbjct: 1098 F 1098



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG   P ++++H+      GR    H   +     I          W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPS IGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L    T    G  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 75/338 (22%), Positives = 136/338 (40%), Gaps = 42/338 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  Q  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAQLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
             +  + +  +  L  L+++L   +  V  +  + +R F
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKLVGKIGHKFYRNF 1077



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Cucumis sativus]
          Length = 570

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I     
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
                            F  + +     N+R L+   V D  F++G   P +V+L++   
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|357496593|ref|XP_003618585.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355493600|gb|AES74803.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 702

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 100/444 (22%), Positives = 185/444 (41%), Gaps = 63/444 (14%)

Query: 998  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLSSVDLHR 1056
            IPL+ TP +     ++ L  +I S       +Q      ++E  G + ++ +    D H 
Sbjct: 306  IPLRYTPMKFVLQPKRKLLVVIES-------DQGAFTAEEREANGGEDEDKDDPLSDEHY 358

Query: 1057 TYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             Y   E +     +RIL+P        T   + +Q +E A +   V   N   KE  TLL
Sbjct: 359  GYPKAESDKWASCIRILDPKTG----NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLL 411

Query: 1112 AIGTAYVQGEDVAARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
             +GTA  +G     R  +        R  ++ ++L   ++  +++G   AL+  QG LL 
Sbjct: 412  DVGTA--KGLQFTPRRSLTAGFIHIYRFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLA 468

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
              GP +  +      L         P  +VS+   ++ I +GD  +S ++  ++    QL
Sbjct: 469  GIGPVLRFYDLGKRRLLRKYENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQL 528

Query: 1230 NLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
             + A D     C        + ID  T++ +  D     +I +   K++ +    K+   
Sbjct: 529  YIFADD-----CVPRWLTASYHIDFDTMAGIEEDPTGG-RIKWEQGKLNGA--PNKVEEI 580

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
             +FHVG  ++   +  ++          PG  +     +L GT+ GSIG +      D++
Sbjct: 581  VQFHVGDVISCLQKASLI----------PGGGE----CILNGTVMGSIGALHAFTSRDDV 626

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
             F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ 
Sbjct: 627  DF--FSHLEMHMRQDNPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 677

Query: 1403 QLEIAHQTGTTRSQILSNLNDLAL 1426
            Q +IA +   TR +IL  L +  +
Sbjct: 678  QRKIADELDRTRGEILKKLEEYKI 701


>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1213

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 115/466 (24%), Positives = 196/466 (42%), Gaps = 76/466 (16%)

Query: 998  IPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQIDNHN 1048
            IPL  TP +   +   NL+ LI           V K LN++   +  IDQEV       N
Sbjct: 788  IPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNRAIDQEV------LN 841

Query: 1049 LSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
            LS     R           +RI++P       +T + +P+  +E+A ++ VV     + K
Sbjct: 842  LSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF---SAK 894

Query: 1106 ENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKGAIS 1158
             NE  L +GTA      ++ R       RV  F   GR  +        V+  E+     
Sbjct: 895  GNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDDVPL 945

Query: 1159 ALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            AL + QG L+   G  + ++    K    ++    F  A    +V+L+   + IL+GD+ 
Sbjct: 946  ALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVGDMQ 1001

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +SI+F  +K    +L + A D       +   ++D +T  +V +D   NI +    P++S
Sbjct: 1002 ESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDPRVS 1058

Query: 1275 ----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLF 1326
                E   G  +L     ++GA H TK +    +  L TS  +     G     R  LL+
Sbjct: 1059 DQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREVLLY 1114

Query: 1327 GTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
              L G+IG + P    +++ F  + +L++ +      + G +   +R ++   KA     
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA----- 1167

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
              +VD +L   Y  LP  +Q  IA +   +  ++L  L  L +  S
Sbjct: 1168 --VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211



 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|124359136|gb|ABD32504.2| CPSF A subunit, C-terminal; WD40-like [Medicago truncatula]
          Length = 632

 Score = 63.9 bits (154), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 98/443 (22%), Positives = 184/443 (41%), Gaps = 65/443 (14%)

Query: 998  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
            IPL+ TP +     ++ L  +I S       +Q      ++E   + ++ +    D H  
Sbjct: 240  IPLRYTPMKFVLQPKRKLLVVIES-------DQGAFTAEEREAAKKDEDKDDPLSDEHYG 292

Query: 1058 YTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1112
            Y   E +     +RIL+P        T   + +Q +E A +   V   N   KE  TLL 
Sbjct: 293  YPKAESDKWASCIRILDPKTG----NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLLD 345

Query: 1113 IGTAYVQGEDVAARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
            +GTA  +G     R  +        R  ++ ++L   ++  +++G   AL+  QG LL  
Sbjct: 346  VGTA--KGLQFTPRRSLTAGFIHIYRFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLAG 402

Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
             GP +  +      L         P  +VS+   ++ I +GD  +S ++  ++    QL 
Sbjct: 403  IGPVLRFYDLGKRRLLRKYENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLY 462

Query: 1231 LLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
            + A D     C        + ID  T++    ++    +I +   K++ +    K+    
Sbjct: 463  IFADD-----CVPRWLTASYHIDFDTMA----EDPTGGRIKWEQGKLNGA--PNKVEEIV 511

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELT 1343
            +FHVG  ++   +  ++          PG  +     +L GT+ GSIG +      D++ 
Sbjct: 512  QFHVGDVISCLQKASLI----------PGGGE----CILNGTVMGSIGALHAFTSRDDVD 557

Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
            F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ Q
Sbjct: 558  F--FSHLEMHMRQDNPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDLQ 608

Query: 1404 LEIAHQTGTTRSQILSNLNDLAL 1426
             +IA +   TR +IL  L +  +
Sbjct: 609  RKIADELDRTRGEILKKLEEYKI 631


>gi|225448823|ref|XP_002282354.1| PREDICTED: splicing factor 3B subunit 3-like [Vitis vinifera]
          Length = 1214

 Score = 63.9 bits (154), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P  A     T   + +Q +E A ++  V   N   KE  TLLA+GTA        
Sbjct: 863  IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 907

Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
               + L F   R+ D               +  ++  +++G   AL   QG LL   G  
Sbjct: 908  ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 964

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+      L         P  +VS++  ++ I +GDI +S ++  ++    QL + A 
Sbjct: 965  LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 1024

Query: 1235 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1282
            D  S+  + T  + ID  T++   +D+  NI        +S+            W+  KL
Sbjct: 1025 D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 1080

Query: 1283 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
                       +FHVG  VT   +  ++          PG  +     +++GT+ GS+G 
Sbjct: 1081 NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 1126

Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +      D++ F     L+  +    P + G +  ++R       A+ P  D ++D +L 
Sbjct: 1127 LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 1177

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              +  LPL+ Q +IA +   T  +IL  L ++
Sbjct: 1178 EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 1209


>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
            bisporus H97]
          Length = 1213

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 115/466 (24%), Positives = 196/466 (42%), Gaps = 76/466 (16%)

Query: 998  IPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQIDNHN 1048
            IPL  TP +   +   NL+ LI           V K LN++   +  IDQEV       N
Sbjct: 788  IPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNKAIDQEV------LN 841

Query: 1049 LSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
            LS     R           +RI++P       +T + +P+  +E+A ++ VV     + K
Sbjct: 842  LSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF---SAK 894

Query: 1106 ENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKGAIS 1158
             NE  L +GTA      ++ R       RV  F   GR  +        V+  E+     
Sbjct: 895  GNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDDVPL 945

Query: 1159 ALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            AL + QG L+   G  + ++    K    ++    F  A    +V+L+   + IL+GD+ 
Sbjct: 946  ALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVGDMQ 1001

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +SI+F  +K    +L + A D       +   ++D +T  +V +D   NI +    P++S
Sbjct: 1002 ESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDPRVS 1058

Query: 1275 ----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLF 1326
                E   G  +L     ++GA H TK +    +  L TS  +     G     R  LL+
Sbjct: 1059 DQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREVLLY 1114

Query: 1327 GTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
              L G+IG + P    +++ F  + +L++ +      + G +   +R ++   KA     
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA----- 1167

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
              +VD +L   Y  LP  +Q  IA +   +  ++L  L  L +  S
Sbjct: 1168 --VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211



 Score = 45.4 bits (106), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|395333071|gb|EJF65449.1| hypothetical protein DICSQDRAFT_178021 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1213

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 117/495 (23%), Positives = 199/495 (40%), Gaps = 72/495 (14%)

Query: 965  CNHGFIYVTSQGILKICQLPSGSTY--DNYWPVQKIPLKATPHQ---ITYFAEKNLYPLI 1019
            C  G I + S  +L+I Q+P   T    +  P+   P K  PH    + Y  E + + ++
Sbjct: 759  CQEGLIGI-SGSLLRIFQIPKLGTKLKQDSIPLSYTPRKLIPHPHNGLLYLIEGD-HRVM 816

Query: 1020 VSVPVLKPLNQVLSLLIDQEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAG 1074
                  K L Q+      +E G  +D    NL      R           +RI+ P  A 
Sbjct: 817  SEEAAAKQLQQL------RESGRAVDEEMVNLPPEQFGRPKAPAGTWASCIRIISPLDA- 869

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF-S 1133
               QT   I + ++E A ++ +V       K NE  L +GTA  Q   +A R     F  
Sbjct: 870  ---QTVNVIHLDNNEAAFSLAIVPF---AAKNNELHLVVGTA--QDTFLAPRSCTSGFLR 921

Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
            T R  D+ +NL   ++  E      A+ + QG L+   G  + L+     +L        
Sbjct: 922  TYRFTDDGRNLEL-LHKTETNDVPLAIMAFQGKLVAGVGKALRLYDIGKKKLLRKVENKT 980

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                +V+LN   + I++GD+ +S+++  +K    +L + A D       AT  L D +T 
Sbjct: 981  LGSTIVTLNTQGSRIIIGDMQESVFYAVYKPPENRLLVFADDVQPRWVTATTML-DYNT- 1038

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHVT 1295
             +V SD   N+ +     K+S+              KG       K    A FH+G  VT
Sbjct: 1039 -VVASDRFGNVFVNRLDAKISDQIDDDPTGAGILHEKGVLFGAPHKTAMLAHFHIGDIVT 1097

Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKL 1354
               ++ ++A                R  +L+  L G+IG + P + +     L +L++ +
Sbjct: 1098 SLNKISLVAGG--------------REVILYTCLHGTIGILVPFVSKEDVDLLTTLEQHM 1143

Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
                  + G +  ++R ++   KA       +VD +L   +  LP  +Q  IA +   T 
Sbjct: 1144 RTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCESFAKLPANKQSSIAGELDRTV 1196

Query: 1415 SQILSNLNDLALGTS 1429
             ++L  L  L +  S
Sbjct: 1197 GEVLKKLEQLRVTAS 1211


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + ++G +++ HNL  +D H    +  ++                    MQ+ E A+++  
Sbjct: 795  NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 834

Query: 1097 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
              L N    +  T   +GTA V   E    +GR+++F      D     V E   +E+KG
Sbjct: 835  TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 884

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1212
            A  ++    G LL A    + L +W    EL      F +   LY   L    +F+L+GD
Sbjct: 885  APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 941

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            + +S+  L++K        +A+D+ +    + E L D + L    ++   N+ +      
Sbjct: 942  LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 998

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +   + Q L    +FH+G  V  F    ++      T +        + ++LFGT+ G+
Sbjct: 999  ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1052

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            IG ++ L    +  L  +Q+KL   +  V  ++   +R F +  K
Sbjct: 1053 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1097



 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + ++GSRLGDS L++             + E F ++                 
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + DM                +    T S A ++     G L+    G+ I+  AS     
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +        +    R      E    L++S   +T VL  + 
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494


>gi|399216895|emb|CCF73582.1| unnamed protein product [Babesia microti strain RI]
          Length = 1232

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 98/406 (24%), Positives = 164/406 (40%), Gaps = 90/406 (22%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VRI++P        T A + + + E A++     L      E    LA+GT  V G ++A
Sbjct: 866  VRIIDPT----TLSTAAKLLLDTDEAAISCCACDL------EGYRCLAVGT--VTGWNLA 913

Query: 1125 ARG----RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
                    + +++ G N +     +T ++S ++ G   AL + +G LL   GP +IL+  
Sbjct: 914  NSNSNSCHIRMYAYGPNFE-----ITFLHSTKVTGIPRALLAYEGRLLAGVGPDVILYAL 968

Query: 1181 TGTELNGIAFYD-----------APP--------LYVVSLNIVKNFILLGDIHKSIYFLS 1221
               +L   A Y            A P          V+ L    N I +GDI +SI  L 
Sbjct: 969  GKRQLLKKAEYRGGVIDIQGYGVATPRTIGNGGLFGVMWLGASGNRIFVGDIRESITVLK 1028

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----- 1276
            + EQ A+L+L+  D           ++D  T++LV  D+     +     + S S     
Sbjct: 1029 FDEQMAKLSLICDDIRP-RWITGATVLDHHTVALV--DKFDTFAVCRVPSEASASNLSSA 1085

Query: 1277 --------------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
                            G K    A+FH+G           L+T  D+     G  +    
Sbjct: 1086 LNSGSLEAVMPTILGVGNKFEQEAQFHLGD----------LSTCIDKVTLCSGCTE---- 1131

Query: 1323 ALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
            A+++ T+ GSIG + P    DEL    LQ L+  + +  P ++G     +R ++      
Sbjct: 1132 AVVYATILGSIGALIPFISSDELD--TLQHLELLMANENPPLSGREHSIYRSYY------ 1183

Query: 1380 RPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              GP   ++D +L   +E L    Q  IA +   T ++I+  L D+
Sbjct: 1184 --GPVQHVIDGDLCEEFESLDSITQSRIAAKIDKTVTEIIKKLRDI 1227


>gi|296814646|ref|XP_002847660.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
 gi|238840685|gb|EEQ30347.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
          Length = 1235

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 101/462 (21%), Positives = 188/462 (40%), Gaps = 52/462 (11%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +    ++ S    DN    + IPL  T   +    E   YP+     
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNLLQESIPLAYTSRSLVRHPE---YPIFY--- 808

Query: 1024 VLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGP 1076
            V+   N VLS      L+ +      DN  L   D         +   +++++P  A   
Sbjct: 809  VIGSDNNVLSPSTKAKLLSESTTVNGDNAELPPEDFGYPRGTNHWASCIQVVDPINAKA- 867

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
                + I ++ +E A+++  V+    T++E+ET L +GT   +G  V+ R     F    
Sbjct: 868  --VMSRIELEDNEAAVSIAAVSF---TSQEDETFLVVGTG--KGMVVSPRSFTCGFIHIY 920

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
                    +  ++  +++    AL   QG LL   GP + ++     +L         P 
Sbjct: 921  RFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITPR 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
             +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  T++  
Sbjct: 981  VIVGLQTQGSRIIVSDVQESVTYVVYKYQENNLIPFADDIIPRWTTCTT-MVDYETVA-- 1037

Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSD 1308
              D+  NI +    PK SE       G  L+   ++  GA     +      Q + TS  
Sbjct: 1038 GGDKFGNIWLLRCPPKASEEADEDGSGAHLIHERQYLQGAPNRLSLVAHFYSQDIPTSIQ 1097

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G     R  L++  L G++G + P    D++ F   Q+L+ +L    P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGLQGTVGMLIPFVTRDDVDF--FQTLEMQLTSQNPPLAGRD 1151

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
               +R +++  K        ++D +L   + +LP +++  IA
Sbjct: 1152 HLIYRGYYAPCKG-------VIDGDLCETFFLLPNDKKQAIA 1186


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + ++G +++ HNL  +D H    +  ++                    MQ+ E A+++  
Sbjct: 820  NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 859

Query: 1097 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
              L N    +  T   +GTA V   E    +GR+++F      D     V E   +E+KG
Sbjct: 860  TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 909

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1212
            A  ++    G LL A    + L +W    EL      F +   LY   L    +F+L+GD
Sbjct: 910  APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 966

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            + +S+  L++K        +A+D+ +    + E L D + L    ++   N+ +      
Sbjct: 967  LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 1023

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             +   + Q L    +FH+G  V  F    ++      T +        + ++LFGT+ G+
Sbjct: 1024 ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1077

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            IG ++ L    +  L  +Q+KL   +  V  ++   +R F +  K
Sbjct: 1078 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1122



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
              ++DA  + +L  D+A                   LL    G L +L +  + ++   
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319

Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
             V+ L L       +   IT + N + ++GSRLGDS L++             + E F 
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
           ++                 + DM                +    T S A ++     G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +    G+ I+  AS            ++LPG KG+W +        +    R      E 
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              L++S   +T VL  +    E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519


>gi|296086939|emb|CBI33172.3| unnamed protein product [Vitis vinifera]
          Length = 934

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P  A     T   + +Q +E A ++  V   N   KE  TLLA+GTA        
Sbjct: 583  IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 627

Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
               + L F   R+ D               +  ++  +++G   AL   QG LL   G  
Sbjct: 628  ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 684

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+      L         P  +VS++  ++ I +GDI +S ++  ++    QL + A 
Sbjct: 685  LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 744

Query: 1235 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1282
            D  S+  + T  + ID  T++   +D+  NI        +S+            W+  KL
Sbjct: 745  D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 800

Query: 1283 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
                       +FHVG  VT   +  ++          PG  +     +++GT+ GS+G 
Sbjct: 801  NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 846

Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +      D++ F     L+  +    P + G +  ++R       A+ P  D ++D +L 
Sbjct: 847  LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 897

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
              +  LPL+ Q +IA +   T  +IL  L ++
Sbjct: 898  EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 929


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339



 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 78/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D   T+ V       L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
            V        E E  +               GR+++F    +       +T+V   ++ G 
Sbjct: 832  VATSLVIPDEPEPKV---------------GRIIIFHYHDSK------LTQVAETKVDGT 870

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
              AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +S
Sbjct: 871  CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            I  L  K+       +A+D       A E L D + L    S+   N+ +       +  
Sbjct: 931  ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
             + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG 
Sbjct: 988  EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098


>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
          Length = 1104

 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 138/323 (42%), Gaps = 51/323 (15%)

Query: 1106 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            +N     IGTA+V   E   +RGR+L+       +N + +   V+  E  G++  L  + 
Sbjct: 779  DNTEYFIIGTAFVDPTETQPSRGRILI----SKLENKKEIAI-VHECEAAGSVYCLTKMC 833

Query: 1165 GH----LLIASGPKIILHKWTGT---------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G     L+     +++  K+  T          ++G   + A    VVSL+   + +L+G
Sbjct: 834  GKDTDDLVAGINNQVVHFKYDATGQDAAKKLRAVSGNQNFGA----VVSLDSCDDIVLVG 889

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA----TEFLIDGSTLSLVVSDEQKNIQIF 1267
            D+  +++ +   +   QL   ++    +   A    T FL+     SL V   +     F
Sbjct: 890  DMLNAVFVMQKAQDKLQLVAGSQTANWVSSCALVNETVFLVASHAHSLSVCQRE-----F 944

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT----NRFA 1323
                 M      Q L ++ E ++G  VT F+R  +        G+A   D +    N F 
Sbjct: 945  EPGSTM------QTLNAKFEIYLGETVTSFVRAAL--------GSAAAVDSSMPLRNTF- 989

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
             +FGT+ G + C+ PL       L +L+ ++ + +  + GL+ R FR      +  +   
Sbjct: 990  FVFGTMGGGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVN 1049

Query: 1384 DSIVDCELLSHYEMLPLEEQLEI 1406
              +VD +L+  +  LP EEQ E+
Sbjct: 1050 PRLVDGDLVETFLQLPEEEQKEL 1072


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 79/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D   T+ V       L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
            V        E E  +               GR+++F    N       +T+V   ++ G 
Sbjct: 832  VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
              AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +S
Sbjct: 871  CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            I  L  K+       +A+D       A E L D + L    S+   N+ +       +  
Sbjct: 931  ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
             + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG 
Sbjct: 988  EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 141/360 (39%), Gaps = 42/360 (11%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
            V        E E  +               GR+++F    N       +T+V   ++ G 
Sbjct: 832  VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
              AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +S
Sbjct: 871  CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMRS 930

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            I  L  K+       +A+D       A E L D + L    S+   N+ +       +  
Sbjct: 931  ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
             + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG 
Sbjct: 988  EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040

Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098



 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|322707263|gb|EFY98842.1| Pre-mRNA-splicing factor rse-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1212

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/367 (22%), Positives = 160/367 (43%), Gaps = 38/367 (10%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
            Q   T+ ++++E A++  +V      +++NE+ L +GT    G+DV    R         
Sbjct: 870  QVVQTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDVVVNPRNFSEAYIYV 922

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  +  + L   ++  +++    AL   QG LL   G  + ++     ++   A  +  
Sbjct: 923  YRFQEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQMLRKAQAEVA 981

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +VSLN   + I++GDI + + ++++K    +L   A D  +     T  ++D    S
Sbjct: 982  PQQIVSLNTQGSRIIVGDIQQGVTYVTYKPTTNKLIPFADDTIARWTTCTT-MVDYE--S 1038

Query: 1255 LVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTKFLRLQMLATS 1306
            +   D+  N+ I    PK S    E   G  L++  ++  G      +      Q + TS
Sbjct: 1039 VAGGDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMCHFYTQDIPTS 1098

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G        LL+  L G+IG   PL   ++  F   QSL+  L    P +AG
Sbjct: 1099 MAKTSLVVGGQD----VLLWSGLMGTIGVFIPLISREDADF--FQSLESHLRTEDPPLAG 1152

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +   +R +++  K        I+D +L   Y +LP +++  IA +   +  +I   ++D
Sbjct: 1153 RDHLMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISD 1205

Query: 1424 LALGTSF 1430
            +   ++F
Sbjct: 1206 IRTRSAF 1212


>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
            [Siebenrockiella crassicollis]
          Length = 364

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 107/235 (45%), Gaps = 30/235 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 152  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 203

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LYV +     +FIL+GD+ +S
Sbjct: 204  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRS 258

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 259  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 315

Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
             + Q L     FH+G  V  F    ++  +   T + P  D     ++LFGT++G
Sbjct: 316  EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STPTQD-----SVLFGTVNG 364


>gi|429961863|gb|ELA41407.1| hypothetical protein VICG_01512 [Vittaforma corneae ATCC 50505]
          Length = 1153

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 165/385 (42%), Gaps = 69/385 (17%)

Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
            +Y + +  PD     ++  ++  M+  EN L   +  +FN         L + T++ +GE
Sbjct: 804  QYSIELRSPD-----FKLISSTTME--ENELICDIKVIFNN-------FLVVCTSFPEGE 849

Query: 1122 DVAARGRVLLFSTGRNADNPQNL-VTE----VYSKELKGAISALASLQGHLLIASGPKII 1176
            D   +G+++++S      +P NL +T+    + S+ LK        ++  + +  G K++
Sbjct: 850  DKMTKGKLIVYSLVNIVPDPDNLHITKKLKLICSETLKNPCLFCEEVRSLISVCVGTKLM 909

Query: 1177 LHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA-QLNLLAK 1234
            ++++   T L  +  ++   L   SL + KN I + DI   IYF   + +   +L+LL +
Sbjct: 910  IYEFNENTGLAAVGRHEL-SLLCTSLFVTKNLIAVSDIMNGIYFFFLRPRDPLKLHLLGR 968

Query: 1235 D--------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                      G +D F   F  D    S+V   +   ++IF Y+P    S  G +L+ RA
Sbjct: 969  SCLVPNCRFLGGID-FCPSFETDALQFSIVSVCKYGIVRIFTYSPYDPVSKNGNQLVKRA 1027

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG------SIGCIAPLD 1340
            E      VTK               A P       + ++FG ++       S   +  L 
Sbjct: 1028 EI-----VTKL--------------ANP------LYKVVFGQINEFESILLSSNVMVLLR 1062

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE--LLSHYEML 1398
             + F +LQ++Q  +   + +  G+N R++ +     +   P   S++ CE  LL  +   
Sbjct: 1063 AINFPKLQAIQHCISIFISNRCGINVRNYLE---TEEFVNPECKSVI-CEKILLEFFYFK 1118

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLND 1423
            PL ++ +I    G     I+  + D
Sbjct: 1119 PLVQE-KICKLVGLDYFNIVELIED 1142


>gi|237837399|ref|XP_002367997.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
 gi|211965661|gb|EEB00857.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
 gi|221488748|gb|EEE26962.1| splicing factor 3B subunit, putative [Toxoplasma gondii GT1]
 gi|221509241|gb|EEE34810.1| splicing factor 3B subunit, putative [Toxoplasma gondii VEG]
          Length = 1233

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 119/299 (39%), Gaps = 49/299 (16%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+S  ++    AL + +G LL   G K+ L+      L     Y   P  V  + +  + 
Sbjct: 957  VHSTPVEDYPMALTAFRGMLLAGVGHKLRLYALGRKRLLKKCEYKNLPCGVAFIRVAGDR 1016

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1264
            + +GD+ +S++ + ++       +LA D          +L  G  L   + V +D+  ++
Sbjct: 1017 LFVGDVRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070

Query: 1265 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1306
             I     +  E   G                   KL S   FH+G  VT   R  + + +
Sbjct: 1071 FICRVPSEAKEDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEIVTALERATLTSAA 1130

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1365
            S+              ++++GT+ GSIG  +P L +        L+  +    P +AG  
Sbjct: 1131 SE--------------SIVYGTIMGSIGSFSPFLTKHELDLFTHLEMVMRSEKPPLAGRE 1176

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               FR ++   K       + VD +L   Y +LP E+Q  IA     T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYEDQKRIAQDFEKTPADILKHLEDI 1228


>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
          Length = 1209

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 123/569 (21%), Positives = 231/569 (40%), Gaps = 61/569 (10%)

Query: 887  LRNLRFSRTPLDAY------TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
            L+N    RT LD        TR       P + I +  N+ G        SR      ++
Sbjct: 674  LQNGVLLRTVLDPVNGQLTDTRTRFLGSRPVRLIRV--NVHGLPSILALSSRSWLNYTYQ 731

Query: 941  ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
              L   P + D    A++    + C  G I + S  +L+I Q+P+            IPL
Sbjct: 732  NLLHFTPLIFDPLEYAWSFSAEL-CPDGLIGI-SGNVLRIFQVPN---LGQKLKQDVIPL 786

Query: 1001 KATPHQITYFAEKNLYPLIVSV-PVLKPLNQVLSLLIDQEVGHQIDNHNLS-SVDLHRTY 1058
              TP ++     + L+ +I S   VL P      L   +  G ++D   +    D+    
Sbjct: 787  SYTPRKMLQHPTERLFYVIESDHRVLSPEAADKKLQKLKSTGQRLDQEVIDLPADIFGRP 846

Query: 1059 TVEEYE----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1114
              +       ++I++P       ++   +P+ ++E A ++ + T      +  E  L +G
Sbjct: 847  RADAGTWASCIQIIDPANV----RSVLEVPLDNNEAAFSLAITTFI---ARPGELFLVVG 899

Query: 1115 TAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            TA    +DV    +        T + +++ ++L   ++  E+     AL S QG L+   
Sbjct: 900  TA----QDVIVSPKSCKSGFLRTYKISEDGRSL-EFLHKTEVDDVPLALLSFQGRLVAGI 954

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            G  + +       L       +    +V+L+   + I++GD+ +SIYF ++K    +L +
Sbjct: 955  GKALRIFDMGKKRLLRKCENKSFATAIVTLSTQGSRIIVGDMAESIYFATYKPPENRLLI 1014

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAE 1287
             A D           ++D  T+     D+  N+ +    PK+ E       G  +L    
Sbjct: 1015 FADD-SQPRWITASAMVDYDTVC--AGDKFGNVFVNRLPPKVGEQVDEDPTGAGVLHEKG 1071

Query: 1288 FHVGA-HVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
              +GA H T  L    +  + TS  +     G     R  +L+  L G+IG + P    +
Sbjct: 1072 LFMGAPHKTNMLAHYYVGDIITSMHKVALVTG----GRDIVLYTGLHGTIGVLIPFISKE 1127

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F R  +L++ +    P + G +  ++R ++   K        +VD +L   + +LP 
Sbjct: 1128 DVDFIR--TLEQHMRTEAPSLVGRDHLTYRGYYVPVKG-------VVDGDLCELFSLLPT 1178

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            ++Q  IA +   T S++L  L  L + T+
Sbjct: 1179 QKQQSIAGELDRTYSEVLKKLEQLRVTTT 1207



 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ + +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
              G     +  SS    + G  +    +  R R   +  L D +N  +  +    +N  
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           NT++ Q  F+   R +  +   L+   +GL +    S+            ELPG    +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  K+              DD+Y AY+I+S    T+VL   + + EV+++  +     T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516

Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
           IA   + G   ++QV+  G R                  I+  +                
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
           Y   DL  G  N      S  S VL++SIA+        PY+ +G  D ++R++  DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634

Query: 690 C--TVSVQTPAAIESS 703
               +S+Q   A  SS
Sbjct: 635 TLENISLQALTAPPSS 650


>gi|195996829|ref|XP_002108283.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
 gi|190589059|gb|EDV29081.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
          Length = 1208

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 83/390 (21%), Positives = 167/390 (42%), Gaps = 61/390 (15%)

Query: 1069 EPDRAGGPWQT-------RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
            EP    G W +         ++ +   E AL+V +        K +ET + +G A     
Sbjct: 852  EPKAGNGQWASCIQLLAPDQSLELDQDEAALSVAICRF---AYKPDETFVVVGVAKELNL 908

Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
            + ++    L+ +T R A+    LV +   +E+     A+A+ QG LL+ +G  + ++   
Sbjct: 909  NPSSSSGGLM-NTYRMANGQLELVHKTVVEEVP---RAMAAFQGRLLVGTGRILRVYDLG 964

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
              +L         P  +V+++ + + +++GD+ +S++F+ ++ +  +L + A D      
Sbjct: 965  RKKLLRKCENKNFPYRIVTISSMGSRVIVGDVQESVHFVKYRAKENRLVVFADDVSPRYV 1024

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLL 1283
             AT FL D  T++  V D+  +I I   +  +++            W         QK  
Sbjct: 1025 TATCFL-DYDTIA--VGDKFGSIAILRLSDDINDEIEEDPTGAKAFWDRGLLNGASQKAN 1081

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
              A F++G  V   + LQ       +T   PG  ++    L++ TL GSIG + P    +
Sbjct: 1082 LEASFYIGETV---MSLQ-------KTTIIPGGSES----LIYTTLSGSIGVLLPFTSRE 1127

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            E+ F   Q L+  L      + G +  ++R ++   K       +++D ++   +  L  
Sbjct: 1128 EVDF--FQHLEMHLRSENAPICGRDHLAYRSYYFPAK-------NVIDGDMCEQFNALDG 1178

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
             ++  +A +   T  +I   L D+    +F
Sbjct: 1179 SKRRTLAMELDRTPPEISKKLEDMRTRYAF 1208


>gi|374095609|gb|AEY85032.1| spliceosomal-like protein [Camellia sinensis]
          Length = 1212

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 85/388 (21%), Positives = 162/388 (41%), Gaps = 69/388 (17%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
            +R+L+P  A     T   + +Q +E A ++    L N   KE  TLLA+GTA       +
Sbjct: 861  IRVLDPRTA----NTTCLLELQDNEAAFSI---CLVNFHDKEYGTLLAVGTAKGLQFWPK 913

Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
               + G + ++   R  ++ ++L   ++  ++     AL   QG LL   G  + L+   
Sbjct: 914  RSISSGYIHIY---RFVEDGKSLEL-LHKTQVDDVPLALCQFQGKLLAGVGSVLRLYDLG 969

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
              +L         P  + S++  ++ I +GDI +S ++  ++    QL + A D     C
Sbjct: 970  KRKLLRKCENKLFPNTITSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFADD-----C 1024

Query: 1242 ----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR- 1285
                    + ID  T++   +D+  NI     A  +S+            W+  KL    
Sbjct: 1025 VPRWLTASYHIDFDTMA--GADKFGNIYFVRLAQDVSDEIEEDPTGGKIKWEQGKLNGAP 1082

Query: 1286 ------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
                   +FHVG  VT   +  ++ +  +               +++GT+ GS+G +   
Sbjct: 1083 NKVEEIVQFHVGDVVTCLQKASLIPSGGE--------------CVIYGTVMGSLGALLAF 1128

Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
               D++ F     L+  +    P + G +  ++R       A+ P  D ++D +L   + 
Sbjct: 1129 TSRDDVDF--FSHLEMHMRQENPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFP 1179

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             LP++ Q +IA +   T  +IL  L ++
Sbjct: 1180 TLPMDMQRKIADELDRTPGEILKKLEEV 1207


>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 172

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           E+ A+ ++  A    RR S+ +        +L F +     R   +HCFE PE+ +L R 
Sbjct: 28  ETAALRTEAAAPGIHRRPSLTMRLR----IILAFTEC----RCLLIHCFEYPEYQYLNRS 79

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
           RE FA    V+ D  GRC  VL+Y  Q++ LKA  G
Sbjct: 80  RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/344 (20%), Positives = 138/344 (40%), Gaps = 42/344 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D   T+ V                      +P ++    ++ ++
Sbjct: 780  NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISALMSAKL 822

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
                     +  T   + T+ V  ++   + GR+++F    N       +T+V   ++ G
Sbjct: 823  -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L     D   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  +  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
             +  + +  +  L  L+++L   +  V  ++   +R +  N K 
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV 1083



 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG + P ++++H+           +H  C    L     +K   L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L +  +  G+V  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|330790247|ref|XP_003283209.1| CPSF domain-containing protein [Dictyostelium purpureum]
 gi|325086890|gb|EGC40273.1| CPSF domain-containing protein [Dictyostelium purpureum]
          Length = 1233

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 63/289 (21%), Positives = 128/289 (44%), Gaps = 29/289 (10%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            VY  E +  + A+A  QG L    G  I ++     +L         P  +V+++ + + 
Sbjct: 956  VYKTEAEEPVYAMAPFQGRLCAGVGKNIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1015

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            +++GDI +SI+F+ +K+    L + A D        +  ++D  T++   +D+  NI I 
Sbjct: 1016 LVVGDIQESIHFIKYKKLENMLYVFADDLAP-RWITSSVMLDYDTVA--GADKFGNIFIL 1072

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDK 1318
                 +S+  +     S+ +F  G          H+        + T +  +    GSD 
Sbjct: 1073 RLPSNVSDEVEEDPTGSKLKFESGLLNGAPHKLEHIANIFAGDAITTLNKTSLVVGGSD- 1131

Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                 LL+ T+ G+IG + P    +++ F    SL+ +L +    + G +  ++R ++  
Sbjct: 1132 ----VLLYTTISGAIGALIPFVSREDVDF--FSSLELQLRNEHAPLCGRDHLAYRSYYFP 1185

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             K       +I+D +L   +  L  ++Q +IA +   + S++L  L D+
Sbjct: 1186 VK-------NIIDGDLCEQFITLDPQKQRQIAEELSRSPSEVLKKLEDI 1227


>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
          Length = 1294

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 114/275 (41%), Gaps = 26/275 (9%)

Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
            I+ +AS QGHLL+A G  + ++     + L       A P  +  +    + I L D+ +
Sbjct: 1002 ITTMASFQGHLLVAVGTSLRMYDLGKKQLLKKTQHPRATPHKITCIETCYDRIFLSDVQE 1061

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK--- 1272
            S++   +         +A D+    C  T  L+D  T++  + D+  NI I    P+   
Sbjct: 1062 SVFLYRYSAADNLFLCIADDYLPKWC-TTMCLLDYDTVA--IGDKMGNISILRLPPEAGT 1118

Query: 1273 -MSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             + +   G  L   A  H       ++   +Q L+     TG  P         L +GTL
Sbjct: 1119 FIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEWTTGDVP--------LLFYGTL 1170

Query: 1330 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            DG+IG + PL   L     Q+L+ +L +    + G +  ++R +    +        ++D
Sbjct: 1171 DGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLAYRSYFFPVR-------HVID 1223

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +L   +  L LE+Q +I  +   +   +   L D
Sbjct: 1224 GDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLED 1258


>gi|336371417|gb|EGN99756.1| hypothetical protein SERLA73DRAFT_88390 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336384183|gb|EGO25331.1| hypothetical protein SERLADRAFT_355643 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1216

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 39/379 (10%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RI++P  A    +T + I + S+E A ++ +V     + + NE  L +GTA      +A
Sbjct: 861  IRIIDPVEA----KTLSMITLDSNECAFSLAIVPF---SARGNELHLVVGTA--ADTFLA 911

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
             R     F            +  ++  E      AL + QG L+   G  + ++     +
Sbjct: 912  PRSCSSGFLRTYKFTEDGTGLELLHKTETDDVPLALMAFQGRLVAGVGKALRIYDIGKKK 971

Query: 1185 LNGIAFYDAPPLY---VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
            L       A   +   +V+LN   + I++GD+ +SI F+++K    +L + A D      
Sbjct: 972  LLRKVENKARATFSTAIVTLNTQGSRIIVGDMQESISFVAYKAPENRLLVFADDNQPRWI 1031

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTK 1296
             AT  ++D +T++    D   NI +    PK+SE       G  +L      +GA H TK
Sbjct: 1032 TATT-MVDYTTIA--AGDRFGNIFVNRLDPKVSEQVDDDPTGAGILHEKGLLMGAPHKTK 1088

Query: 1297 FL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
             L    +  L TS ++     G     R  LL+  L G+IG + P    +++ F  + +L
Sbjct: 1089 MLAHFHIGDLVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTL 1142

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            ++ +      + G +  S+R +++  K       S+VD +L   Y  LP  +Q  IA + 
Sbjct: 1143 EQHMRTEQGSLVGRDQLSWRGYYTPVK-------SVVDGDLCETYARLPGTKQSAIAGEL 1195

Query: 1411 GTTRSQILSNLNDLALGTS 1429
              T  ++L  L  L +  S
Sbjct: 1196 DRTVGEVLKKLEQLRVTAS 1214


>gi|413935524|gb|AFW70075.1| hypothetical protein ZEAMMB73_605375 [Zea mays]
          Length = 1229

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 121/554 (21%), Positives = 213/554 (38%), Gaps = 91/554 (16%)

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            +S  Q      SRP    + +    + P  CD ++ +     +  C+ G + V    +  
Sbjct: 713  VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 771

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL-------KPLNQVL 1032
                  G T++       IPL+ TP +     +K    +I S               + L
Sbjct: 772  FTIERLGETFNE----TAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECL 827

Query: 1033 SLLIDQEVGHQIDNHNLSSVDLH------RTYTVEEYE------------VRILEPDRAG 1074
                  E G+  +   + + D         T+  E+Y             +RIL+P    
Sbjct: 828  DASGAAENGNANNGDPMENGDGQDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSR- 886

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---L 1131
                T   + +Q +E A+++  V   N   KE+ TLLAIGTA  +G     R R L    
Sbjct: 887  ---DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAIGTA--KGLQFWPR-RTLAGGF 937

Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
                +  D  ++L   ++  +++    AL   QG LL   G  + L+     +L      
Sbjct: 938  IHVYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGKRKLLRKCEN 996

Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
               P  +VS++  ++ I +GD+ +S ++  ++    QL + A D        T   ID  
Sbjct: 997  KLFPRTLVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADD-SVPRWLTTAQHIDFD 1055

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR-------AEFHVGAH 1293
            T++   +D+  NI        +S+            W+  KL           +FHVG  
Sbjct: 1056 TMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDV 1113

Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
            VT   +  ++          PG  +     L++GT+ GS+G +      +++ F     L
Sbjct: 1114 VTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSREDVDF--FSHL 1157

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
            +  L    P + G +  ++R       A+ P  D ++D +L   Y  LP + Q +IA + 
Sbjct: 1158 EMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPADMQRKIADEL 1210

Query: 1411 GTTRSQILSNLNDL 1424
              T  +IL  L D+
Sbjct: 1211 DRTPGEILKKLEDI 1224


>gi|326483043|gb|EGE07053.1| pre-mRNA-splicing factor rse1 [Trichophyton equinum CBS 127.97]
          Length = 1209

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 106/488 (21%), Positives = 201/488 (41%), Gaps = 58/488 (11%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    + IPL  TP       E  L+ +I S  
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
             +L P  +  + L+ +      D+  L   D         +   I   D    P  T++ 
Sbjct: 815  NILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKSV 868

Query: 1083 I---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG---- 1135
            +    ++ +E A+++  V+    T++E+ET L +GT    G+D+    R   F+ G    
Sbjct: 869  LSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHI 919

Query: 1136 -RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  +  + L   ++  +++    AL   QG LL   GP + ++     +L         
Sbjct: 920  YRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQIT 978

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  T++
Sbjct: 979  PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA 1037

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLAT 1305
                D+  NI +     K SE       G  L+   ++  GA      V  F   Q + T
Sbjct: 1038 --GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPT 1094

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L    P +A
Sbjct: 1095 SIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLA 1148

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +   +R +++  K        ++D +L   + +LP +++  IA +   +  +I   ++
Sbjct: 1149 GRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKIS 1201

Query: 1423 DLALGTSF 1430
            D+    ++
Sbjct: 1202 DMRTKVAY 1209


>gi|326469377|gb|EGD93386.1| splicing factor 3B subunit 3 [Trichophyton tonsurans CBS 112818]
          Length = 1188

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 106/488 (21%), Positives = 201/488 (41%), Gaps = 58/488 (11%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    + IPL  TP       E  L+ +I S  
Sbjct: 738  QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDN 793

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
             +L P  +  + L+ +      D+  L   D         +   I   D    P  T++ 
Sbjct: 794  NILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKSV 847

Query: 1083 I---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG---- 1135
            +    ++ +E A+++  V+    T++E+ET L +GT    G+D+    R   F+ G    
Sbjct: 848  LSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHI 898

Query: 1136 -RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  +  + L   ++  +++    AL   QG LL   GP + ++     +L         
Sbjct: 899  YRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQIT 957

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  T++
Sbjct: 958  PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA 1016

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLAT 1305
                D+  NI +     K SE       G  L+   ++  GA      V  F   Q + T
Sbjct: 1017 --GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPT 1073

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L    P +A
Sbjct: 1074 SIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLA 1127

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +   +R +++  K        ++D +L   + +LP +++  IA +   +  +I   ++
Sbjct: 1128 GRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKIS 1180

Query: 1423 DLALGTSF 1430
            D+    ++
Sbjct: 1181 DMRTKVAY 1188


>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
            [Rhipicephalus pulchellus]
          Length = 1259

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 84/372 (22%), Positives = 156/372 (41%), Gaps = 56/372 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
            +R+L P  +    QT   + ++ +E AL+V +V      ++ +E  + +G A  +  +  
Sbjct: 861  IRVLNPADS----QTLCKVALEQNEAALSVALVKF---ASQPDEQYVVVGAARELSLQPW 913

Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
             AR   LL  T R +   +  +  V++  ++ A +AL   QG LL   G  + L+     
Sbjct: 914  HARSGGLLL-TYRLSHAGETRLELVHATSVEEAPTALCPFQGRLLAGVGKCLRLYDLGRK 972

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
            +L         P  +VS+  + N +++GD+ +S +FL +K Q  QL + A D        
Sbjct: 973  KLLRKCENKYIPSAIVSIQSMGNRVVVGDVQESFFFLRYKRQENQLVIFADD-AVPRWIT 1031

Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG---QKLLSR 1285
               ++D  T++   +D+  N+ I      +S+                W G   QK    
Sbjct: 1032 ASCMLDYDTVA--GADKFGNVSIIRLPNSVSDEVDEDPTGIKSLWDRGWLGGSSQKAEVI 1089

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
            + FH+G  V    +  ++          PG  ++    L++ TL G+IG + P    ++ 
Sbjct: 1090 SNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTIGVLVPFTAHEDH 1135

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
             F   Q L+  +    P + G +  SFR  +   K       +++D +L   +  L   +
Sbjct: 1136 DF--FQHLEMHMRSENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQFNSLDPSK 1186

Query: 1403 QLEIAHQTGTTR 1414
            Q  IA +    R
Sbjct: 1187 QKSIAEELDRNR 1198



 Score = 40.8 bits (94), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 60/265 (22%), Positives = 102/265 (38%), Gaps = 51/265 (19%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ +  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL++++  EEL     A   T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411

Query: 495 ------ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC- 547
                 E   + +    R     I  L+   +GL +            S   + ELPG  
Sbjct: 412 IADLANEDTPQLYVACGRGPRSCIRVLR---HGLEV------------SEMAVSELPGNP 456

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WTV  K+              D++Y AY+I+S    T+VL   + + EVT+S  +  
Sbjct: 457 NAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-GFLG 501

Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
              T++   + G   ++QV+  G R
Sbjct: 502 TTPTLSCAQI-GDDALVQVYPEGIR 525


>gi|302807210|ref|XP_002985318.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
 gi|300147146|gb|EFJ13812.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
          Length = 1207

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A ++  V   +   K   T+LA+GTA        
Sbjct: 855  IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
             R         R  ++ ++L   V+   + G  +AL   QG LL   G  + ++     +
Sbjct: 909  RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1235
            L         P  + S++   + I +GDI +S +++ ++    QL   A D         
Sbjct: 968  LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027

Query: 1236 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
                F ++   D F   F +     LS  + D+    +I +   +++ +    K+    +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
            FHVG  VT   +  ++          PG  ++    +++GT+ GS+G + P    +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
                 L+  +    P + G +  +FR       A+ P  D ++D +L   Y  LP + Q 
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182

Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
            +IA +   T  +++  L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202


>gi|302773427|ref|XP_002970131.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
 gi|300162642|gb|EFJ29255.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
          Length = 1207

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P  A     T   + +Q +E A ++  V   +   K   T+LA+GTA        
Sbjct: 855  IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
             R         R  ++ ++L   V+   + G  +AL   QG LL   G  + ++     +
Sbjct: 909  RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1235
            L         P  + S++   + I +GDI +S +++ ++    QL   A D         
Sbjct: 968  LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027

Query: 1236 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
                F ++   D F   F +     LS  + D+    +I +   +++ +    K+    +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
            FHVG  VT   +  ++          PG  ++    +++GT+ GS+G + P    +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
                 L+  +    P + G +  +FR       A+ P  D ++D +L   Y  LP + Q 
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182

Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
            +IA +   T  +++  L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202


>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 328

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 74/280 (26%)

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
            G F  G+ P W +  R      P     S                   +++FT  H+ +C
Sbjct: 3    GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62

Query: 966  NHGFIYVTSQGILKICQLPSGST-----YDNYWPVQKIPLKATPHQITYFA--------- 1011
             +GFIY  S+G L++C+LPS  T         + +QK    AT H + Y           
Sbjct: 63   PNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAE 122

Query: 1012 --EKNLYPLIVSVPVLKPLNQVLSLL------------IDQEVGHQIDNHNLSSVDL--- 1054
              E   Y ++ S   LKP +   +              +D        N    + ++   
Sbjct: 123  ALEAPTYAVVCSA-RLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFAD 181

Query: 1055 ----HRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT---- 1103
                H  +T E+ YE+R+++ D   G W  R    +  +  E  L+V+++ L++++    
Sbjct: 182  YETDHMAHTEEDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 240

Query: 1104 ---------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1132
                      K+    L +GT +V   GED + RGR+LL+
Sbjct: 241  EVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLY 280


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 60.8 bits (146), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 66/259 (25%), Positives = 113/259 (43%), Gaps = 26/259 (10%)

Query: 1113 IGTAYV-QGEDVAARGRVLLFS--TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
            +GTA V   E    +GR+++F    G+        + +V  KE+KGA  +L    G LL 
Sbjct: 831  VGTAIVLPDESDPKQGRIIIFHWVDGK--------LQQVAEKEIKGAPYSLLEFNGKLLA 882

Query: 1170 ASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
            +    + L +W    EL+     F +   LY   L    +FIL+GD+ +S+  L++K   
Sbjct: 883  SINSTVRLFEWNAERELHNECSHFNNILALY---LKTKGDFILVGDLMRSMSLLAYKPLE 939

Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
                 +A+D+ +    A E L D + L    ++   N+ +       +     Q L    
Sbjct: 940  GSFEEIARDYQTNWMCAVEILDDDTFLG---AESTTNLFVCQKDSAATTDEDRQHLQEVG 996

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
            +FH+G  V  F R   L        ++P      + ++LFGT+ G+IG +A L    +  
Sbjct: 997  QFHLGEFVNIF-RHGSLVMQHPGEASSP-----TQGSVLFGTIHGAIGLVAQLPSDFYNF 1050

Query: 1347 LQSLQKKLVDSVPHVAGLN 1365
            L  +Q  L   +  V  ++
Sbjct: 1051 LLEVQGNLTKVIKSVGKID 1069



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 94/402 (23%), Positives = 155/402 (38%), Gaps = 95/402 (23%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H   +    IS  LK    +   W   
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P      +G  +I YH+      +           +  L R S  V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + ++GSRLGDS L++         L+S   E+   +E     T          
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L         +    F           G L+    G+ I+  AS     
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KGIW +        N DSSR           L++S   +T VL  + 
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
              E TE   + +  +T   GN+    ++IQV     R++DG
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDG 483


>gi|242060436|ref|XP_002451507.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
 gi|241931338|gb|EES04483.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
          Length = 1232

 Score = 60.8 bits (146), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 92/384 (23%), Positives = 162/384 (42%), Gaps = 61/384 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P        T   + +Q +E A+++  V   N   KE+ TLLA+GTA  +G    
Sbjct: 881  IRILDPRSR----DTTCLLELQENEAAVSICTV---NFHDKEHGTLLAVGTA--KGLQFW 931

Query: 1125 ARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
             R R L        +  D  ++L   ++  +++    AL   QG LL   G  + L+   
Sbjct: 932  PR-RTLAGGFIHIYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLG 989

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
              +L         P  +VS++  ++ I +GD+ +S ++  ++    QL + A D      
Sbjct: 990  KRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWL 1049

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR----- 1285
             A +  ID  T++   +D+  NI        +S+            W+  KL        
Sbjct: 1050 TAAQH-IDFDTMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVE 1106

Query: 1286 --AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
               +FHVG  VT   +  ++          PG  +     L++GT+ GS+G +      +
Sbjct: 1107 EIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSRE 1152

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F     L+  L    P + G +  ++R       A+ P  D ++D +L   Y  LP 
Sbjct: 1153 DVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPA 1203

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
            + Q +IA +   T  +IL  L D+
Sbjct: 1204 DMQRKIADELDRTPGEILKKLEDI 1227


>gi|342320507|gb|EGU12447.1| Pre-mRNA-splicing factor RSE1 [Rhodotorula glutinis ATCC 204091]
          Length = 1212

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 114/543 (20%), Positives = 211/543 (38%), Gaps = 89/543 (16%)

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G        SRP     +R  L+  P + D    A++    + C  G I +     L+
Sbjct: 716  VQGSPAILALSSRPWLNYAYRGILQFTPLIFDALDYAWSFSAEL-CPEGLIGIVGNS-LR 773

Query: 980  ICQLPSGSTYDNYWPVQK--IPLKATPHQI-TYFAEKNLYPLIVSVPVLKP--LNQVLS- 1033
            I   P          VQ+  I L  TP Q+ T    + LY +        P  + + +S 
Sbjct: 774  IFTFPRLGQ-----KVQQTVIDLSYTPRQLLTSPHSRLLYTVEADHRTFSPSAIQKTISD 828

Query: 1034 -LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQTRATIPMQSSENA 1091
              + + EV  ++ N +     L R    +    VR+++P  A    +T   + ++ +E A
Sbjct: 829  MRMAEMEVDEEVLNLDPKEFGLPRGPAGQWASCVRVIDPVTA----ETVFKVDLEQNEAA 884

Query: 1092 LTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---------LFSTGRNADNPQ 1142
             +  +VT     +  NE  L +GT    G+D +   R           L   GR      
Sbjct: 885  FSAAIVTFH---SHPNEVFLVVGT----GQDTSLAPRACKQAYLHTYKLLEEGRQ----- 932

Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
              +  ++  E+     AL + QG L+   G  + L+     +L   A        +++LN
Sbjct: 933  --LELLHKTEVDDIPKALIAFQGRLVAGVGKALRLYDLGKKKLLRKAENKGFATMIMTLN 990

Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST---------- 1252
                 I++GD  +S+Y+  +K    +L + A D       A+  ++D  T          
Sbjct: 991  TQGTRIIVGDAQESVYYALYKAPENRLLIFADDISPRWTTAS-IMVDYETVAAGDKFGNF 1049

Query: 1253 --------LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1304
                    +S  V D+     I +  P +  +     LL  A +H+G  +T   ++ ++A
Sbjct: 1050 FVNRLPKGVSSDVDDDPTGAGIMHEKPYLMGAPHRTHLL--AHYHIGDIITSLHKVALVA 1107

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
               D               L++  L G++G + P    +++ F    +L+  L    P +
Sbjct: 1108 GGRD--------------LLVYTGLMGTVGVLVPFVSNEDVDF--FTTLEMHLRSEAPSL 1151

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             G    ++R  ++  KA        VD +L   Y  LP+ +Q +IA +   T S+++  L
Sbjct: 1152 CGREHLAYRSAYTPVKA-------TVDGDLCEVYRSLPMAKQGQIAGELERTVSEVIKKL 1204

Query: 1422 NDL 1424
            +++
Sbjct: 1205 DNV 1207


>gi|403411971|emb|CCL98671.1| predicted protein [Fibroporia radiculosa]
          Length = 1212

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 121/494 (24%), Positives = 199/494 (40%), Gaps = 70/494 (14%)

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI----- 1019
            C  G I + S  +L+I Q+P   T         IPL  TP +        L+ LI     
Sbjct: 758  CPEGLIGI-SGSVLRIFQIPKLGTK---LKQDAIPLSYTPRKFISHPTNGLFYLIEGDHR 813

Query: 1020 -----VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE---EYEVRILEPD 1071
                  S   L  L Q   ++ D+ V       NL      R           +RI+ P 
Sbjct: 814  VRSDEASAKALGELRQQGKMVDDELV-------NLPPETFGRQKAPAGTWASCIRIINPV 866

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
             A    +T   IP+ ++E A ++ VV+    + +  E  L +GTA  Q   +A R     
Sbjct: 867  DA----KTVNIIPLDNNEAAFSLAVVSF---SARSGELHLVVGTA--QDTFLAPRSCTSG 917

Query: 1132 F-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----N 1186
            F  T R  D+  NL   ++  E      A+   QG L+   G  + L+     +L     
Sbjct: 918  FLRTYRFTDDGTNLEL-LHKTETNDVPLAVLGFQGRLVAGVGKALRLYDMGKKKLLRKVE 976

Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
               F  A    +VSL    + IL+GD+ +S+ F  +K    +L + A D       A   
Sbjct: 977  NKTFASA----IVSLATQGSRILVGDMQESVSFAVYKPPENKLLVFADDTQPRWTSAMT- 1031

Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL--- 1298
            ++D +T++   +D   NI +    PK+SE       G  +L       GA H T+ L   
Sbjct: 1032 MVDYNTVA--SADRFGNIYVNRLDPKVSEQVDDDPTGAGILHEKGLLAGAPHKTELLSHF 1089

Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
             +  + TS ++     G     R  LL+  L G+IG + P    +++ F  + +L++ + 
Sbjct: 1090 HVGDIVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTLEQHMR 1143

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
                 + G +  ++R ++   KA       +VD +L   +  LP  +Q  IA +   T  
Sbjct: 1144 TEQLSLVGRDHLTWRGYYVPVKA-------VVDGDLCETFARLPASKQSAIAGELDRTVG 1196

Query: 1416 QILSNLNDLALGTS 1429
            ++L  L  L +  S
Sbjct: 1197 EVLKKLEQLRVTAS 1210


>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
          Length = 1212

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 94/399 (23%), Positives = 168/399 (42%), Gaps = 55/399 (13%)

Query: 1064 EVRILEPDRAGGP-----WQTRATIPMQSSENALTVRVVTLFNT-----------TTKEN 1107
            + RIL PD  G P     W +  ++    SE    ++ V L N             +++N
Sbjct: 837  DARILPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDNNEAAVSAAIVSFASQDN 896

Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALAS 1162
            E+ L +GT    G+D+    R   FS G     R +++   L   ++  +++   SAL  
Sbjct: 897  ESFLIVGT----GKDMIVNPR--QFSEGYIHVYRFSEDGHEL-EFIHKTKVEEPPSALLG 949

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
             QG LL   G  + ++     ++   A  D  P  +VSL+   + I++GD+   I ++ +
Sbjct: 950  FQGRLLAGIGQTLRIYDLGLRQMLRKAQADVAPQLIVSLSTQGSRIIVGDVQHGITYVVY 1009

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWK 1278
            K    +L     D  S     T  ++D    S+V  D+  NI +     K S    E   
Sbjct: 1010 KPTTNKLIPFVDDTISRWVTCTT-MVDYE--SVVGGDKFGNIFLVRCPEKASQEADEESG 1066

Query: 1279 GQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            G  LL +R   H   H    L     Q + TS  +T    G        LL+  ++G+IG
Sbjct: 1067 GLHLLNTRDYLHGTPHRLSLLGHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIG 1122

Query: 1335 CIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
               P    +++ F   Q+L++ +      +AG +   +R ++   K        ++D +L
Sbjct: 1123 VFIPFVTREDVDF--FQNLEQHMRTEDAPLAGRDHLMYRSYYVPVKG-------VIDGDL 1173

Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
               Y +LP E++  IA +   +  +I   ++D+   ++F
Sbjct: 1174 CERYTLLPSEKKQMIAGELDRSVREIERKISDIRTRSAF 1212



 Score = 45.8 bits (107), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)

Query: 74  QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           Q  G+KE     +  ++  +L    S   +  V  + + G + S+A     G++    +D
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
            +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  DP
Sbjct: 83  YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131

Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +GR      V    L  ++ + SQ        E T  S     A      V+++  LD+ 
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
                    GY  PV   L E E T A     G  + +  T ++    +   L      W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230

Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALAL--NNYAVSLDSSQ 353
           S    P  A  L  VP       GVLV G   I Y HS   +  + +     A    S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
               S    +L  +   +      LL T+ GDL   T+      DG     V+RL +   
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           +   ++S +  + +   +  S+ G+    QF              E+ GD + +      
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
           L  SS D   D   G +   +          +   + A+ +S+ ++ PL D         
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437

Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
           DA    +A G   +S + ++            ELPG    +WT+  K +RG         
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486

Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
              D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQV  +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541

Query: 630 GAR 632
           G R
Sbjct: 542 GIR 544


>gi|358391805|gb|EHK41209.1| hypothetical protein TRIATDRAFT_135379 [Trichoderma atroviride IMI
            206040]
          Length = 1212

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 103/481 (21%), Positives = 189/481 (39%), Gaps = 107/481 (22%)

Query: 998  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
            IPL  TP ++    E  L+ +I                       + DNH LS  DL   
Sbjct: 791  IPLTYTPKKMVKHPEHPLFYVI-----------------------EADNHTLSP-DLCAK 826

Query: 1058 YTVEEYEV----RILEPDRAGGP-----W--------------QTRATIPMQSSENALTV 1094
               +   V    ++L P+  G P     W              Q    I +  +E A+++
Sbjct: 827  LLADPARVNGDAKVLSPEEFGHPRGNRRWASCISVVDPLAEDGQALQKIDLDENEAAVSL 886

Query: 1095 RVVTLFNTTTKENETLLAIGT-------------AYVQGEDVAARGRVLLFSTGRNADNP 1141
             +VT     +++NET L +GT             AYV        GR L+F         
Sbjct: 887  AIVTF---ASQDNETFLVVGTGKDMVVNPRSFSDAYVHIYRFEQEGRGLVF--------- 934

Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
                  ++  +++    A+   QG +L+  G  + ++     +L      +  P  + SL
Sbjct: 935  ------IHKTKVEEPPMAIIPFQGRVLVGIGKILRIYDLGMRQLLRKTQAEVAPQLINSL 988

Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDE 1260
            +   N I++GD+ + I ++ +K+   +L     D  ++  + T   ++D  T++    D+
Sbjct: 989  STQGNRIIVGDVQQGITYVVYKQTTNKLIPFVDD--TVARWTTCSTMVDYETVA--GGDK 1044

Query: 1261 QKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGA 1312
              NI +     K SE    ++     L +R   H  +H       L  Q + TS  +T  
Sbjct: 1045 FGNIFVVRSPQKASEEADEEQAGLHLLNARDYLHGRSHRLDLMCHLYTQDIPTSITKTSL 1104

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
              G        LL+  L G+IG + P    ++  F   QSL+  L    P +AG +   +
Sbjct: 1105 VVGGQD----VLLWSGLMGTIGVLIPFVTREDTDF--FQSLELHLRAEDPPLAGRDHLMY 1158

Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            R +++  K        ++D +L   Y +LP +++  IA +   +  +I   ++D+   ++
Sbjct: 1159 RSYYAPVKG-------VIDGDLCERYTLLPNDKKQMIAAELDRSVREIERKISDIRTRSA 1211

Query: 1430 F 1430
            F
Sbjct: 1212 F 1212


>gi|401407861|ref|XP_003883379.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
 gi|325117796|emb|CBZ53347.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
          Length = 1233

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 117/299 (39%), Gaps = 49/299 (16%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+S  ++    ALA  +G LL   G K+ L+      L     Y   P  V  + +  + 
Sbjct: 957  VHSTPVEDYPMALAPFRGMLLAGVGHKLRLYALGKKRLLKKCEYKNLPCGVAFIRVAGDR 1016

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1264
            + +GD+ +S++ + ++       +LA D          +L  G  L   + V +D+  ++
Sbjct: 1017 LFVGDLRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070

Query: 1265 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1306
             I     +  +   G                   KL S   FH+G  VT   R  + A +
Sbjct: 1071 FICRVPSEAKQDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEVVTALERATLTAGA 1130

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1365
            S+              ++++GT+ GSIG  +P L +        L+  L    P + G  
Sbjct: 1131 SE--------------SIIYGTIMGSIGAFSPFLTKHELDLFTHLEMVLRSEKPPLGGRE 1176

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               FR ++   K       + VD +L   Y +LP + Q  IA     T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYDVQKRIAQDFEKTPADILKHLEDI 1228


>gi|440792421|gb|ELR13643.1| splicing factor 3b subunit 3, putative [Acanthamoeba castellanii str.
            Neff]
          Length = 1227

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/293 (20%), Positives = 118/293 (40%), Gaps = 44/293 (15%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+  +++G  +A+   QG LL+  G  + ++     +L         P  + S+      
Sbjct: 952  VHKTQVEGVPTAVCGFQGRLLVGIGKMLRIYDLGKRKLLRKCENKGFPHCIQSITTQGER 1011

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGS 1251
            I++GD+ +S +F+ +++   QLN+ A D                    D F   F++   
Sbjct: 1012 IIVGDLAESFHFVKYRKAENQLNVYADDSNPRWLTASQMLDYDTMAGADKFGNVFIV--R 1069

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
              S V  + + N    +   K S +    KL +   FHVG  +    +  +    +D   
Sbjct: 1070 LPSEVNEELEDNPMGNFLMSKQSLNGAAFKLQTLINFHVGDTINSMTKASLFTGGAD--- 1126

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                        L++ TL G +G + P    +++ F     L+  +   +P + G +  +
Sbjct: 1127 -----------VLVYTTLMGGMGALLPFVSREDVDF--FSHLEMHMRSELPPLCGRDHLA 1173

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +R ++   K        ++D +L   + +LP E+Q  IA +   T  ++L  L
Sbjct: 1174 YRSYYFPVK-------DVIDGDLCEQFSLLPPEKQRTIAEELDRTPGEVLKKL 1219


>gi|294875343|ref|XP_002767276.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
 gi|239868839|gb|EEQ99993.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
          Length = 1258

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 82/383 (21%), Positives = 152/383 (39%), Gaps = 54/383 (14%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
            +R+++P  A   ++    + +   E A  + V   +    K+N   L +GTA  V   + 
Sbjct: 902  IRVVDPLTASTSFK----LDLDVDEAATAMTVCYFYQL--KDNRPCLVVGTATGVDPHNP 955

Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA------SGPKIIL 1177
            +       +      D   NL   ++   L+G  SA+   +G LL+A        P + +
Sbjct: 956  SRSAHGKCYIKTYLYDESYNLQL-IHVTPLEGVPSAMYPFEGRLLVALRGSPTVAPVLRI 1014

Query: 1178 HKWTGTELNGIAFYDAPPLY--VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
            ++     L     Y   P    ++ L++ K+ I   D   SI  L W+    Q+ +++ D
Sbjct: 1015 YELGKKRLLKKCEYKFLPESGGIMWLDVNKDRIFAADSRDSILVLRWRYSDNQMQVISDD 1074

Query: 1236 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA--PKMSESWKGQ---------KLLS 1284
                 C     ++D +T+  VV D+  NI +       K + +W            K+  
Sbjct: 1075 TYP-RCITAAAVLDYNTI--VVGDKFDNIAVLRVPGDAKDAGAWGRDNDYASGNTFKMDL 1131

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD---E 1341
               FHVG  +T   R+ M+A  ++               +++ T+ G+IG + P     E
Sbjct: 1132 IGHFHVGETITSLQRVTMVAGGAE--------------IVIYSTVLGTIGALYPFSSKRE 1177

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
              F +   +  +   + P + G     +R F+   K       + VD +L   Y  LP E
Sbjct: 1178 HGFLQALEMHMRNTAASPSLTGREHVMYRSFYHPIK-------NFVDADLCEVYYQLPAE 1230

Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
            +Q +IA     T  +++  L D+
Sbjct: 1231 KQRQIAVDMDKTPQEVMKKLEDI 1253


>gi|308504990|ref|XP_003114678.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
 gi|308258860|gb|EFP02813.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
          Length = 270

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/260 (21%), Positives = 123/260 (47%), Gaps = 37/260 (14%)

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDG 1250
            P+ +V++      I++ D  +S++FL +++   QL + A D    + S  C     ++D 
Sbjct: 24   PVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC-----VLDY 78

Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA------ 1304
             T++  V+D+  N+ +     +++E  +    +S++ +  G       +++++A      
Sbjct: 79   HTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVANFFIGD 136

Query: 1305 --TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVP 1359
              TS  +T   PG+++    AL++ T+ G+IGC+      DE+ F    +L+  +    P
Sbjct: 137  TITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYP 190

Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGP---------DSIVDCELLSHYEMLPLEEQLEIAHQT 1410
             + G +  ++R +++  K                S++D ++   + ++ L +Q E+A + 
Sbjct: 191  PLCGRDHLAYRSYYAPCKVCFNFLLFRSIVSLFQSVIDGDICEQFSLMDLSKQKEVAEEL 250

Query: 1411 GTTRSQILSNLNDLALGTSF 1430
            G T S+I   L D+    +F
Sbjct: 251  GKTVSEISKKLEDIRTRYAF 270


>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Xenopus (Silurana) tropicalis]
          Length = 92

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/93 (38%), Positives = 54/93 (58%), Gaps = 1/93 (1%)

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  +S+ +  +    +++D ELL+ Y  L
Sbjct: 1    MQEKTYRRLLMLQNALT-VLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
               E+ E+A + GTT   IL +L ++   TS  
Sbjct: 60   SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92


>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
          Length = 1216

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+A+ QG+LL   G  + L++     L      +  P  V ++N+V   I++GD+ +S +
Sbjct: 949  AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008

Query: 1219 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  ++     QL + A D  S   F T    +D  T+    +D+  NI +     ++SE 
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064

Query: 1277 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +G A+ T  +    +  + TS  +   APG     R  +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R +++  KA       
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + MLP  +Q  IA        ++L  L  L  G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216



 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL IN               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
          Length = 1216

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+A+ QG+LL   G  + L++     L      +  P  V ++N+V   I++GD+ +S +
Sbjct: 949  AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008

Query: 1219 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  ++     QL + A D  S   F T    +D  T+    +D+  NI +     ++SE 
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064

Query: 1277 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +G A+ T  +    +  + TS  +   APG     R  +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R +++  KA       
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + MLP  +Q  IA        ++L  L  L  G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216



 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL +N               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|42409127|dbj|BAD10377.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
            Group]
 gi|42409258|dbj|BAD10521.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
            Group]
 gi|125538000|gb|EAY84395.1| hypothetical protein OsI_05771 [Oryza sativa Indica Group]
          Length = 1234

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P        T   + +Q +E A+++  V   N   KE+ TLLA+GTA        
Sbjct: 883  IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 927

Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
               + L F   RN      +    V E  S EL  K  +     AL   QG LL   G  
Sbjct: 928  ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 984

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+     +L         P  +VS++  ++ I +GD+ +S ++  ++    QL + A 
Sbjct: 985  LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 1044

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
            D       A    ID  T++   +D+  NI        +S+            W+  KL 
Sbjct: 1045 DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 1101

Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
                      +FHVG  VT   +  ++          PG  +     L++GT+ GS+G +
Sbjct: 1102 GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 1147

Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
                  +++ F     L+  L    P + G +  ++R       A+ P  D ++D +L  
Sbjct: 1148 LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 1198

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  LP + Q +IA +   T  +IL  L D+
Sbjct: 1199 QFPSLPADMQRKIADELDRTPGEILKKLEDI 1229


>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
          Length = 1212

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 87/369 (23%), Positives = 168/369 (45%), Gaps = 48/369 (13%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
            +TI ++ +E A+++ VV      ++E+ET L +GT    G+D+    R   FS G     
Sbjct: 873  STIHLEDNECAVSIAVVAF---ASQEDETFLCVGT----GKDMVVSPRS--FSAGFIHVY 923

Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            R  ++ + L   ++  +++    AL + QG LL   G  + ++     +L   A  +  P
Sbjct: 924  RFHEDGKEL-EFIHKTKVEEPPMALLAFQGRLLAGVGKDLRIYDLGMRQLLRKAQSEVAP 982

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
              +V L    + I++ D+ +SI  + +K Q  +L     D  +     T  ++D  T++ 
Sbjct: 983  NMIVGLQTQGSRIIVSDVQESITMVVYKFQENRLIPFVDDTIARWTSCTT-MVDYETVA- 1040

Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA-HVTKFLRLQMLATSSDRT 1310
               D+  N+ +     K SE       G  LL   ++  GA H     RL ++A +  + 
Sbjct: 1041 -GGDKFGNLWLLRCPAKASEEADEEGSGAHLLHERQYLAGAPH-----RLTLMAHNFSQD 1094

Query: 1311 GAAPGS-DKTN-----RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
               P S  KTN     R  LL+  L G++G + P    +++ F   Q+L++ L    P +
Sbjct: 1095 --IPMSIQKTNLVAGGRDCLLWSGLQGTLGILIPFVSREDVDF--FQTLEQHLRSEDPPL 1150

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            AG +   +R ++   K        ++D +L   Y +LP +++  IA +   +  +I   +
Sbjct: 1151 AGRDHLIYRSYYVPVKG-------VIDGDLCERYTLLPTDKKQMIAGELDRSVREIERKI 1203

Query: 1422 NDLALGTSF 1430
            +D+   +++
Sbjct: 1204 SDIRTRSAY 1212



 Score = 41.2 bits (95), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 85/377 (22%), Positives = 151/377 (40%), Gaps = 52/377 (13%)

Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDV 376
           GVLV G + I Y HS   +  +A+     + +  Q        V  +L  A   +     
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF---- 308

Query: 377 ALLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
            LL T  GDL  +T+  V D        V+RL +   +   + + +  + +   F+ S  
Sbjct: 309 FLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEF 368

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
           G+    QF         +  + ++F  G  E+  P     R + + +L + ++     + 
Sbjct: 369 GNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMD 428

Query: 488 GSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
              +N T E A + +S     +      LK   +GL ++    +            ELPG
Sbjct: 429 CKVANLTDEDAPQIYSICGTGARSTFRTLK---HGLEVSEIVES------------ELPG 473

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WT   K +RG            D Y AY+I+S    T+VL   + + EVT++  +
Sbjct: 474 VPSAVWTT--KLTRG------------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GF 518

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
                T+A   L G   +IQV  +G R +   +   + +  P +    + + N   ++V+
Sbjct: 519 LSSAPTLAVQQL-GEDSLIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVA 576

Query: 666 IAD-PYVLLGM-SDGSI 680
           ++    V   M SDGS+
Sbjct: 577 LSSGEIVYFEMDSDGSL 593


>gi|327309050|ref|XP_003239216.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
 gi|326459472|gb|EGD84925.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
          Length = 1209

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 105/491 (21%), Positives = 202/491 (41%), Gaps = 64/491 (13%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +    ++ S    DN    + IPL  TP       E   YPL     
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPE---YPLFY--- 808

Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH-------RTYTVEEYEVRILEPDRAGGP 1076
            V+   N +LS     ++  +    N  S +L        R        +++++P      
Sbjct: 809  VIGSDNNILSPATKAKLLSESTTVNGDSAELPPEGFGYPRGTNHWASSIQVVDPIHTKS- 867

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG- 1135
                +++ ++ +E A+++  V+    T++E+ET L +GT    G+D+    R   F+ G 
Sbjct: 868  --VLSSLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGF 916

Query: 1136 ----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
                R  +  + L   ++  +++    AL   QG LL   GP + ++     +L      
Sbjct: 917  IHIYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQA 975

Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
               P  +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  
Sbjct: 976  QITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALISFADDIIPRWTTCTT-MVDYE 1034

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQM 1302
            T++    D+  NI +     K SE       G  L+   ++  GA      V  F   Q 
Sbjct: 1035 TVA--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QD 1091

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVP 1359
            + TS  +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L    P
Sbjct: 1092 IPTSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNP 1145

Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
             +AG +   +R +++  K        ++D +L   + +LP +++  IA +   +  +I  
Sbjct: 1146 PLAGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIER 1198

Query: 1420 NLNDLALGTSF 1430
             ++D+    ++
Sbjct: 1199 KISDMRTKVAY 1209


>gi|297598550|ref|NP_001045829.2| Os02g0137400 [Oryza sativa Japonica Group]
 gi|255670583|dbj|BAF07743.2| Os02g0137400 [Oryza sativa Japonica Group]
          Length = 845

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P        T   + +Q +E A+++  V   N   KE+ TLLA+GTA        
Sbjct: 494  IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 538

Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
               + L F   RN      +    V E  S EL  K  +     AL   QG LL   G  
Sbjct: 539  ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 595

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+     +L         P  +VS++  ++ I +GD+ +S ++  ++    QL + A 
Sbjct: 596  LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 655

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
            D       A    ID  T++   +D+  NI        +S+            W+  KL 
Sbjct: 656  DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 712

Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
                      +FHVG  VT   +  ++          PG  +     L++GT+ GS+G +
Sbjct: 713  GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 758

Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
                  +++ F     L+  L    P + G +  ++R       A+ P  D ++D +L  
Sbjct: 759  LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 809

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  LP + Q +IA +   T  +IL  L D+
Sbjct: 810  QFPSLPADMQRKIADELDRTPGEILKKLEDI 840


>gi|342885857|gb|EGU85809.1| hypothetical protein FOXB_03657 [Fusarium oxysporum Fo5176]
          Length = 1189

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 103/480 (21%), Positives = 205/480 (42%), Gaps = 64/480 (13%)

Query: 964  NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
             C  G + +  Q   I  I +L  G T      +QK IPL  TP ++    ++ L+  I 
Sbjct: 761  QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
            +     P      LL D ++ +  D+  L   D         +   + +++P    G  Q
Sbjct: 814  ADNNTLPPELRAQLLADPKIVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870

Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
               TI ++++E A++  +V+    ++++NE+ L +GT    G+D+    R   +S G   
Sbjct: 871  VVQTIDLENNEAAVSAAIVSF---SSQDNESFLVVGT----GKDMVVNPRS--YSEG--- 918

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
                     +Y  +  G    L + QG + +A G ++ ++     ++   +  +     +
Sbjct: 919  ------YLHIYRFQDGGENLTLLAFQGRVAVAVGTQLRIYDLGMRQMLRKSQAEVAAQQI 972

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            VSLN   + I++GD+ + + ++ +K    +L     D  +     T  ++D    S+   
Sbjct: 973  VSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE--SVAGG 1029

Query: 1259 DEQKNIQIFYYAPKMSES----WKGQKLLSRAEF-HVGAHVTKFLRLQMLATSSDRTGAA 1313
            D+  N+ I     K SE       G  L++  E+ H   H       + + TS  +T   
Sbjct: 1030 DKFGNMFIVRCPEKASEEADEEQTGLHLINAREYLHGTPH-------RDIPTSITKTSLV 1082

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             G  +     LL+  + G+IG   P    ++  F   Q+L++ L    P +AG +   +R
Sbjct: 1083 VGGQEI----LLWSGIMGTIGVFIPFISREDADF--FQNLEQHLRTEDPPLAGRDHLMYR 1136

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
             +++  K        ++D +L   Y +LP +++L IA +   +  +I   ++D+   ++F
Sbjct: 1137 GYYAPVKG-------VIDGDLCERYNLLPNDKKLMIAGELDRSVREIERKISDIRTRSAF 1189


>gi|388582014|gb|EIM22320.1| hypothetical protein WALSEDRAFT_60013 [Wallemia sebi CBS 633.66]
          Length = 1208

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/357 (23%), Positives = 144/357 (40%), Gaps = 72/357 (20%)

Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISA 1159
            E+ET L +GTA         + R+++  + + A      VT+      ++  ++     A
Sbjct: 892  EDETHLVVGTA---------KDRMMMPQSHKEAYLRVYKVTQDSQLELLHKTDIDDVPYA 942

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + + +G LL   G  + L+      L       +    +V+LN+V + I +GD+ +S+ F
Sbjct: 943  IHAFKGRLLAGVGKALRLYDLGKKRLLRKCENKSFAAGIVNLNVVGSRIYVGDMQESVSF 1002

Query: 1220 LSWKEQGAQLNLLAKDFGSL----------------DCFATEFL--IDGSTLSLVVSDEQ 1261
              +K    +L + A D  S                 D F   F+  +D ST   V  DE 
Sbjct: 1003 AVYKAPENRLLVFADDIMSRWTTTATPVDYDTVAGGDKFGNIFITRVDKSTSEWVDEDES 1062

Query: 1262 KN-----IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
                     +++ AP  S      KLL  A F+VG  VT   + Q+ A   D        
Sbjct: 1063 GGGLLHARGLYHGAPNRS------KLL--AHFYVGDIVTSITKSQLSAGGRD-------- 1106

Query: 1317 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
                   L++  L G++G I P    D++ F  + +L+  +    P + G +   FR ++
Sbjct: 1107 ------VLVYTCLHGTVGMIIPFASKDDIEF--MSTLELHMRQESPSLVGRDHLGFRSYY 1158

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
               KA        VD +L   Y  LP+ +Q  IA++   T  ++L  +  L     F
Sbjct: 1159 IPCKA-------FVDGDLCELYASLPVTKQQAIANELDRTSGEVLKKIESLRSAAGF 1208


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
            occidentalis]
          Length = 1154

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/304 (21%), Positives = 134/304 (44%), Gaps = 30/304 (9%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA++  E+   + GR+ +    R  D     + E   KE  GA  ++      L IA 
Sbjct: 846  VGTAFINQEESEPKVGRIFVL---RWHDGKLETIAE---KEAAGAPYSIREFHQKLAIAI 899

Query: 1172 GPKIILHKWTGT---ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               + L+ W      +     F++   + ++ L  + ++IL+GD+ +S+  L++      
Sbjct: 900  NSTVRLYSWNAEKDLQSECTPFFN---IVILHLKCLGDYILVGDLMRSMTLLNYNADITS 956

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            L  + +D+ +    A E L + + L+   ++   N+ +    P  ++  + Q +   A +
Sbjct: 957  LEEIGRDYQTNWTTAVEILDEDTFLA---AESNLNLYVCKRDPSAADDTR-QHMHEVALY 1012

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--NRFALLFGTLDGSIGCIAPLDELTFRR 1346
            H+G  V   ++  ++        A PG      N+ + L+G+L G++G I P+ +  +  
Sbjct: 1013 HLGEMVNVIVKGSLVM-------AQPGDMPLPLNK-SFLYGSLHGAVGVIVPIKQELYAI 1064

Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1406
            L  +Q  L  ++  V  +    +R F +  K   P     +D +L+     LP +E LE 
Sbjct: 1065 LNQIQTNLAKTIKSVGKIEHGFWRTFLAERKIE-PAT-GFIDGDLIEQLLDLP-KEALES 1121

Query: 1407 AHQT 1410
              Q+
Sbjct: 1122 VSQS 1125



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 99/428 (23%), Positives = 165/428 (38%), Gaps = 82/428 (19%)

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +PV+ I++E + T       +H    + AL     L + P  W   NL  +A  L+ V  
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
              GV++VG  TI YH           +Y     S     + S    +D          V
Sbjct: 228 TETGVIIVGGETIVYHYG--------QDYICIQPSFLRTTKISCYCRIDNNRL------V 273

Query: 377 ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
            +L    G L +LT+  + + V    L       +   ++ + N + F+GSRLGDS L++
Sbjct: 274 FILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIR 333

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASNNTE 495
                               + A  P  + L   ++  A+ DM+   +L   G     T 
Sbjct: 334 --------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQLITC 372

Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYH 555
           S Q              G L+    G+ I+  A             VEL G KGIW +  
Sbjct: 373 SGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWAL-- 407

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGRTIA 613
              R + A        DD     L++S   +T V    + + L +VT    + +  +T  
Sbjct: 408 ---RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFC 460

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
           A N+ G  +VIQV ++   ++  +  T+ D  F P        + N   +++++ +  V 
Sbjct: 461 ARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNVLVY 519

Query: 673 LGMSDGSI 680
           L + DGS+
Sbjct: 520 LEIRDGSL 527


>gi|125580741|gb|EAZ21672.1| hypothetical protein OsJ_05303 [Oryza sativa Japonica Group]
          Length = 1224

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P        T   + +Q +E A+++  V   N   KE+ TLLA+GTA        
Sbjct: 873  IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 917

Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
               + L F   RN      +    V E  S EL  K  +     AL   QG LL   G  
Sbjct: 918  ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 974

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+     +L         P  +VS++  ++ I +GD+ +S ++  ++    QL + A 
Sbjct: 975  LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 1034

Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
            D       A    ID  T++   +D+  NI        +S+            W+  KL 
Sbjct: 1035 DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 1091

Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
                      +FHVG  VT   +  ++          PG  +     L++GT+ GS+G +
Sbjct: 1092 GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 1137

Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
                  +++ F     L+  L    P + G +  ++R       A+ P  D ++D +L  
Sbjct: 1138 LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 1188

Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +  LP + Q +IA +   T  +IL  L D+
Sbjct: 1189 QFPSLPADMQRKIADELDRTPGEILKKLEDI 1219


>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
          Length = 1230

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 103/486 (21%), Positives = 199/486 (40%), Gaps = 53/486 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +        G T       + IPL  TP ++    E  ++  I S  
Sbjct: 779  QCEEGVVGIQGQSLRIFAIEKLGDTLTQ----KSIPLTYTPRRMVKHPEHPMFYTIESDN 834

Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI 1083
               P      LL D  V +  D   L   +         +   I   D      QT  T+
Sbjct: 835  NTLPPELRAQLLADPSVVNG-DARTLPPAEFGYPRAKGRWASCISVIDPLSEELQTLQTV 893

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNA 1138
             + ++E A++  +V     T+++NE+ L +GT    G+D+    R   F+ G     R +
Sbjct: 894  DLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIYRFS 944

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++   +AL + QG L+   G  + ++     ++   A  D  P  +
Sbjct: 945  EDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAPQLI 1003

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLV 1256
            VSL+   + I++GD+ + + ++ +K    +L     D  +  + C     ++D    S+ 
Sbjct: 1004 VSLSTQGSRIVVGDVQQGVTYVVYKPLSNKLIPFVDDTVARWMTCTT---MVDYE--SVA 1058

Query: 1257 VSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAH----VTKFLRLQMLATSS 1307
              D+  NI I     K S    E   G  L  +R   H   H    V+ F    +L + +
Sbjct: 1059 GGDKFGNIFIVRAPEKASQEADEEGAGLHLTNTRDYLHGTPHRLSLVSHFYSQDVLTSIT 1118

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1364
              +    G D      LL+  + G+IG   P    ++  F   Q+L++ L      +AG 
Sbjct: 1119 KTSLVVGGQD-----VLLWSGISGTIGVFIPFVTREDADF--FQTLEQHLRTEDAPLAGR 1171

Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +   +R +++  K        ++D +L   Y +LP +++  IA +   +  +I   ++D+
Sbjct: 1172 DHLMYRGYYAPVKG-------VIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISDI 1224

Query: 1425 ALGTSF 1430
               ++F
Sbjct: 1225 RTRSAF 1230



 Score = 46.2 bits (108), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 122/548 (22%), Positives = 203/548 (37%), Gaps = 125/548 (22%)

Query: 128 RRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDP 187
           R  + +ILA +  +I+++E+  + +  +   +  F        K G      G  +  DP
Sbjct: 97  RGFNYLILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDP 149

Query: 188 QGRCGGVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           +GR    L+  L+      ++ + SQ        E T  S     A     HV+++  LD
Sbjct: 150 KGRA--CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALD 198

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL----- 297
           +          GY  PV   L E + T A +          +AL + T L  + L     
Sbjct: 199 V----------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLN 242

Query: 298 -IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVS 348
            +    + P D    L    P G     GVLV G   I Y HS   +  + +     A  
Sbjct: 243 HVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRRGATE 302

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRL 401
             S +    +    +L  +   +      LL T+ GDL  +T+      DG     V+RL
Sbjct: 303 DPSRKRCIVAGVMHKLKGSAGAFF----FLLQTEDGDLFKITIDMIEDRDGNPTGEVKRL 358

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
            +   +   + S +  + +   ++ S+ G+    QF              E+ GD     
Sbjct: 359 KIKYFDTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD----- 399

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
              + L  SS D   D     E   +          ++  + A+ +S+ ++ PL D    
Sbjct: 400 -DDEELEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVA 450

Query: 522 LRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNAD 564
                DA    +A G   +S + ++            ELPG    +WT+  K SRG    
Sbjct: 451 NLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG---- 504

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                   D+Y AY+++S    T+VL   + + EV +S   F+      A  L G   +I
Sbjct: 505 --------DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLI 554

Query: 625 QVFERGAR 632
           QV  +G R
Sbjct: 555 QVHPKGIR 562


>gi|322693432|gb|EFY85292.1| Pre-mRNA-splicing factor RSE1 [Metarhizium acridum CQMa 102]
          Length = 1221

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 150/346 (43%), Gaps = 38/346 (10%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
            Q   T+ ++++E A++  +V      +++NE+ L +GT    G+D+    R         
Sbjct: 870  QVVQTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDIVVNPRNFSEAYIYV 922

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  +  + L   ++  +++    AL   QG LL   G  + ++     ++   A  +  
Sbjct: 923  YRFQEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQMLRKAQAEVA 981

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +VSLN   + I++GD+ + + ++++K    +L   A D  +     T  ++D    S
Sbjct: 982  PQQIVSLNTQGSRIIVGDVQQGVTYVTYKPTTNKLIPFADDIIARWITCTT-MVDYE--S 1038

Query: 1255 LVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTKFLRLQMLATS 1306
            +   D+  N+ I    PK S    E   G  L++  ++  G      +      Q + TS
Sbjct: 1039 VAGGDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMCHFYTQDIPTS 1098

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G        LL+  L G+IG   PL   ++  F   QSL+  L    P +AG
Sbjct: 1099 MAKTSLVVGGQDV----LLWSGLMGTIGVFIPLISREDADF--FQSLESHLRTEDPPLAG 1152

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +   +R +++  K        I+D +L   Y +LP +++  IA +
Sbjct: 1153 RDHLMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGE 1191


>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
          Length = 1107

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 90/199 (45%), Gaps = 20/199 (10%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 830  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 883

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 884  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 938

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 939  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 995

Query: 1279 GQKLLSRAEFHVGAHVTKF 1297
             Q L     FH+G  V  F
Sbjct: 996  RQHLQEVGLFHLGEFVNVF 1014


>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
 gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
          Length = 1209

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR------GRVLLFSTGR 1136
            I ++ +E A++V  V     +++++ET L +GT    G+D+         G + ++   R
Sbjct: 872  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPPSSSCGYIHIY---R 921

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
              ++ + L   ++  +++    AL + QG LL   G  + ++     +L      +  P 
Sbjct: 922  FQEDGKEL-EFIHKTKVESPPQALLAFQGRLLAGIGTNLRIYDLGMKQLLRKCQAEVVPR 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
             +V L    + I++ D+ +S+ ++ +K Q  +L   A D  +     T  ++D  T++  
Sbjct: 981  MIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDIIARWTTCTT-MVDYETVA-- 1037

Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSD 1308
              D+  N+ +     K SE       G  L+   ++  GA     L +    Q + TS  
Sbjct: 1038 GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMIHFYPQDIPTSIQ 1097

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G     R  L++  L G+IG + P    +++ F   QSL+ +L    P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGLQGTIGMLIPFVSREDVDF--FQSLEMQLTSQTPPIAGRD 1151

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               +R +++  K         +D +L   Y  LP +++L IA +   +  +I   ++D+ 
Sbjct: 1152 HLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISDMR 1204

Query: 1426 LGTSF 1430
               ++
Sbjct: 1205 TKVAY 1209



 Score = 41.2 bits (95), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  V D        V+RL L   +   + S +  + N   F+ S  G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
           +    QF         +    ++F    +E  AP   R R + +  L + +N     +  
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426

Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
             +N TE  A + ++     +      LK   +GL +         S+    EL  +P  
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPS- 473

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WT   K +R            +D+Y AY+I+S    T+VL   + + EVT++  +  
Sbjct: 474 -AVWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517

Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
              T+A   L G   +IQV  +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541


>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
 gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
          Length = 2870

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 103/218 (47%), Gaps = 16/218 (7%)

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
             P  +++++++  ++I++GDI  S+  L +  + +QL  + +D+ ++ C +   L   S 
Sbjct: 2631 TPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVCRDYSNIWCTS---LCALSK 2687

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL---QMLATSSDR 1309
              +VVSD   N  I   +          KL S + F+ G+ + K L L    ++    D+
Sbjct: 2688 SHIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPLSNTNLIEEDYDK 2747

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  +D      +L  + +GSI  + P      F++   ++  + D++  +  L+  +
Sbjct: 2748 RNILTKND-----GILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIGNLSHNA 2802

Query: 1369 FRQFHSNGKA-HRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            +R++  N ++ H  G   IVD ELL  +  +  E+Q +
Sbjct: 2803 YREYKVNFRSKHCKG---IVDGELLKMFFHMSFEKQYK 2837


>gi|167390599|ref|XP_001739420.1| DNA damage-binding protein [Entamoeba dispar SAW760]
 gi|165896898|gb|EDR24200.1| DNA damage-binding protein, putative [Entamoeba dispar SAW760]
          Length = 1088

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/349 (22%), Positives = 158/349 (45%), Gaps = 44/349 (12%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1143
            ++ +E+AL++  + +      + + +  IGTA+ +  +V  + GR+L+       D    
Sbjct: 756  LKENEHALSIEEIVV------DEKEMFVIGTAFAKPNEVEPSSGRILIVQI---KDGKLE 806

Query: 1144 LVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP---- 1195
            +V   + K++ GA+ ++ +L + +L ++   K+++ ++     NG   +   +       
Sbjct: 807  IV---FEKDVNGAVYSIKTLLKKYLAMSIEKKLVIFEYQRIITNGEFEVKLQEKGSCNVK 863

Query: 1196 ---LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLID 1249
               LYV ++    N IL+GD+ KSI   S+   G   N    +++DF +    A EF+ +
Sbjct: 864  LIGLYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDE 920

Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
               LS   SD   N+ +F      +ES +  +L + A  HVG  +    +  +  T S  
Sbjct: 921  NCYLS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY 976

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-PRS 1368
                   +   +  +LFG + G IG I  +    +  L  +Q +++  +  +     P  
Sbjct: 977  -------ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDE 1029

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            +++   + K  R    +I+D  ++  Y  +  E+Q EIAH +G    QI
Sbjct: 1030 WKKVIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 1076


>gi|255081708|ref|XP_002508076.1| predicted protein [Micromonas sp. RCC299]
 gi|226523352|gb|ACO69334.1| predicted protein [Micromonas sp. RCC299]
          Length = 1199

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 86/388 (22%), Positives = 162/388 (41%), Gaps = 72/388 (18%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            VRI++P  A     T+  + M  +E AL    V       + +E  LA+G+A        
Sbjct: 851  VRIVDPASA----STKQIVEMTGNEAALCCCHVYF----PQADELFLAVGSA-------- 894

Query: 1125 ARGRVLLFSTGRNADN--------PQNLVTEVYSKE-LKGAISALASLQGHLLIASGPKI 1175
                V L  + R+++          Q+   E++ K  L G   A+   +G LL+  G  +
Sbjct: 895  ----VSLTFSPRDSEGGFIHLYRYTQDGGIELFHKTPLDGVPGAMCGFKGRLLVGVGNTL 950

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             L+ +   +L         P ++ +++     I +GD+ +S +++ +K +   + ++A D
Sbjct: 951  RLYDFGKKKLLRKVENRNFPNFIKTIHAQGERIYVGDVQESFHYVRYKREDGSMYIVADD 1010

Query: 1236 ----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
                                D F   F+   S L+  VSDE +        P   ++  G
Sbjct: 1011 VQPRHVTAACPLDYDTIAGGDRFGNVFV---SRLAQDVSDEIEE------DPTGGKTAYG 1061

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            Q  L+ A   +   VT+F   + +   +  T  A G +     ++++ TL G++G + P 
Sbjct: 1062 QGALNGASHKIN-QVTQFHVGETVCALTKGTLQAGGLE-----SMIYATLMGTLGALMPF 1115

Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
               +++ F     L+  +   +P + G +  +FR       ++ P  D ++D +L   Y 
Sbjct: 1116 GNREDVDF--CTHLEMHMRQELPPLLGRDHLAFR------SSYFPVKD-VIDGDLCEMYT 1166

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +LP E Q  +A     T S++L  L DL
Sbjct: 1167 VLPHEAQRRVAEDMDRTVSEVLKKLEDL 1194


>gi|242772631|ref|XP_002478075.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
            10500]
 gi|218721694|gb|EED21112.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
            10500]
          Length = 1209

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 106/484 (21%), Positives = 204/484 (42%), Gaps = 62/484 (12%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +    ++ S    DN    + IPL  TP       E+ L+ +I S  
Sbjct: 759  QCLEGMVGIQGQNL----RIFSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESEN 814

Query: 1024 -VLKPLNQVLSLLIDQEV--GHQI----DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
             VL P  Q   LL D ++  G  +    +         H    +E     +++P  +   
Sbjct: 815  NVLAPATQT-RLLEDSKLQNGEAVIPPAETFGFPRATGHWASCIE-----VVDPINSKS- 867

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG- 1135
                + + ++ +E+A++V  V+     +++NET L +GT    G+DV    R   FS G 
Sbjct: 868  --VLSRLELEENESAVSVAAVSF---ASQDNETFLVVGT----GKDVVTYPRS--FSAGF 916

Query: 1136 ----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
                R  ++ + L   ++  +++    AL + QG L+   G  + ++     ++      
Sbjct: 917  IHIYRFQEDGREL-EFIHKTKIEEPPLALLAFQGRLVAGIGKNLRVYDLGMKQMLRKCQV 975

Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
            +A P  +V L    + I++ D+ +S+ ++ +K Q  QL     D  +    AT  ++D  
Sbjct: 976  EASPNLIVGLQTQGSRIIVSDVQESVTYVVYKYQENQLIPFVDDVIARWTTATT-MVDYE 1034

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGA----HVTKFLRLQML 1303
            T +    D+  N+ +     K+SE       G  L+    +  G      +      Q +
Sbjct: 1035 TTA--GGDKFGNLWLVRCPKKVSEESDEDGSGAHLIHERSYLQGTPNRLDLMVHFYTQDI 1092

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
             TS  +T    G     R  L++  L G+IG + P    +++ F   Q+L+ +L    P 
Sbjct: 1093 PTSLHKTNLVVG----GRDILVWTGLQGTIGMMIPFISREDVDF--FQNLEMQLASQNPP 1146

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            +AG     +R ++   K        ++D +L   Y +LP +++L IA +   +  +I   
Sbjct: 1147 LAGREHLIYRSYYVPVKG-------VIDGDLCESYFLLPNDKKLMIAGELDRSVREIERK 1199

Query: 1421 LNDL 1424
            ++D+
Sbjct: 1200 ISDM 1203


>gi|298713790|emb|CBJ27162.1| spliceosomal protein sap, putative [Ectocarpus siliculosus]
          Length = 1256

 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/378 (22%), Positives = 153/378 (40%), Gaps = 55/378 (14%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +R+L+P        T   + +  +E AL+V  V   N   +  E  +A+GTA  +     
Sbjct: 909  IRLLDPVEG----TTVECLDLDDNEAALSVAPVAFHN---RNGEAFVAVGTA--KSLTFH 959

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
             RG    F         +N +  ++  E+     A+   QG LL+  G  + ++     +
Sbjct: 960  PRGHEGCFVHVYRI--LENRLVLLHKTEVPDVPLAMKEFQGRLLVGVGQSLRMYDLGRKK 1017

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLD 1240
            L         P  VVSL +  + +  GD  +S +   ++    +L   A D    F +  
Sbjct: 1018 LLRKCENKRMPSMVVSLTVTGDRVFAGDQMESCHCFKYRRAENRLVEFADDQVPRFMTKT 1077

Query: 1241 CFATEFLIDGST---------LSLVVSDEQKNI---QIFYYAPKMSESWKGQKLLSRAEF 1288
            C      I G+          + L VSD+  N    ++ + +  +S +    K+  + +F
Sbjct: 1078 CLLDYDSIAGADKFGNIFVLRVPLDVSDDVDNPTGNRLLWDSGHLSGA--PNKVQQQLQF 1135

Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1345
            HVG  V+          S  RT   PG  +     LL+ T++GSIG + P    D++ F 
Sbjct: 1136 HVGEVVS----------SLRRTTLVPGGAEV----LLYSTINGSIGALLPFKSRDDVDF- 1180

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
                ++  +    P + G +  S+R ++   K        ++D +L   +  LP E+Q  
Sbjct: 1181 -FTHMEMYMRQEKPTLCGRDHISYRSYYLPAK-------DVIDGDLCEQFSSLPFEKQKL 1232

Query: 1406 IAHQTGTTRSQILSNLND 1423
            +A+    T  +++  L D
Sbjct: 1233 VANGLDRTVGEVVKKLED 1250


>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1409

 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 137/321 (42%), Gaps = 40/321 (12%)

Query: 1113 IGTAYV----QGEDVAARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1167
            +GT+Y+      E+   +GR+L+   G ++D NP  +V    S ELKGA  +LA +   L
Sbjct: 826  VGTSYLADPEMDENSEVKGRILVL--GVDSDKNPYQIV----SHELKGACRSLAVMGDKL 879

Query: 1168 LIASGPKIILHKW------TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
            +      ++++ +      +G+ L    F   P  + V L++  N I + D+ +S+  + 
Sbjct: 880  VAGLSKTVVVYDYAEESSTSGSLLKLATFR--PSTFPVDLDVNGNMIGVADLMQSMTLIE 937

Query: 1222 W--KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            +   + G +  L+ +       +AT        L  + +D Q N+ +    P        
Sbjct: 938  FIPAQDGNKARLVERARHFQYIWATAVCHLEQDL-WIEADAQGNLMVLRRNPNAPTEHDK 996

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            +++   +EFH+G  + K   L +++  +D     P   K         T++GSI   A +
Sbjct: 997  KQMEVISEFHLGEQINKIRPLDVVSGEND-----PIEPKA-----FLATIEGSIYVFADI 1046

Query: 1340 DE------LTFR-RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
                    L F+ RL  + K L  +    AGL+  S+R F  N K    GP   VD EL+
Sbjct: 1047 KPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGFR-NAKRSADGPFRFVDGELI 1105

Query: 1393 SHYEMLPLEEQLEIAHQTGTT 1413
              +  L    Q  +    G T
Sbjct: 1106 ERFLDLDAGRQEAVVQGLGPT 1126


>gi|389638952|ref|XP_003717109.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
 gi|148887431|sp|Q52E49.2|RSE1_MAGO7 RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|351642928|gb|EHA50790.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
          Length = 1216

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/369 (22%), Positives = 164/369 (44%), Gaps = 52/369 (14%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I + ++E AL++ VV+     +++ E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 879  IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
            +++ + L   ++  +++   +AL   QG L+   G  + ++     +L   A  +  P  
Sbjct: 930  SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            +VSLN   + I++GD+   + ++++K +  +L   A D  +     T  +   ST     
Sbjct: 989  IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
            +D+  N+ I     K S+          +E H+  H   +L     RL ++A        
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G  +     LL+G   G+IG + P    ++  F   QSL++ L    P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            AG +   +R  +   K        ++D +L   Y MLP +++  IA +   +  +I   +
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207

Query: 1422 NDLALGTSF 1430
            +D+   ++F
Sbjct: 1208 SDIRTRSAF 1216


>gi|212531303|ref|XP_002145808.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
            18224]
 gi|210071172|gb|EEA25261.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1209

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 105/483 (21%), Positives = 201/483 (41%), Gaps = 60/483 (12%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +    ++ S    DN    + IPL  TP       E+ L+ +I S  
Sbjct: 759  QCLEGMVGIQGQNL----RIFSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESEN 814

Query: 1024 -VLKPLNQVL-----SLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
             VL P  Q        L   + V    +         H    +E     +++P  A    
Sbjct: 815  NVLAPATQTRLLEESKLQNGEAVIPPAETFGYPRATGHWASCIE-----VVDPVNAKS-- 867

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-- 1135
               + + ++ +E+A+++  V+     +++NET L +GT    G+DV    R   FS G  
Sbjct: 868  -VLSRLELEENESAVSIAAVSF---ASQDNETFLVVGT----GKDVVTYPRS--FSAGFI 917

Query: 1136 ---RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
               R  ++ + L   ++  +++    AL + QG L+   G  + ++     ++      +
Sbjct: 918  HIYRFQEDGREL-EFIHKTKIEEPPLALLAFQGRLVAGIGKNLRIYDLGMKQMLRKCQVE 976

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
            A P  +V L    + I++ D+ +S+ ++ +K Q  QL     D  +    AT  ++D  T
Sbjct: 977  AVPNLIVGLQTQGSRIIVSDVQESVTYVVYKYQENQLIPFVDDVIARWTTATT-MVDYET 1035

Query: 1253 LSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRL----QMLA 1304
             +    D+  N+ +     K S    E   G  L+    +  G      L +    Q + 
Sbjct: 1036 TA--GGDKFGNLWLVRCPQKASDDSDEDGSGAHLIHERSYLQGTANRLNLMIHYYTQDIP 1093

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G     R  L++  L G+IG + P    +++ F   Q+L+ +L    P +
Sbjct: 1094 TSLHKTNLVVG----GRDILVWTGLQGTIGIMVPFISREDVDF--FQNLETQLASQNPPL 1147

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            AG +   +R ++   K        ++D +L   Y +LP +++L IA +   +  +I   +
Sbjct: 1148 AGRDHLIYRSYYVPSKG-------VIDGDLCESYFLLPNDKKLMIAGELDRSVREIERKI 1200

Query: 1422 NDL 1424
            +D+
Sbjct: 1201 SDM 1203


>gi|312370905|gb|EFR19207.1| hypothetical protein AND_22901 [Anopheles darlingi]
          Length = 1287

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/311 (21%), Positives = 126/311 (40%), Gaps = 43/311 (13%)

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            DN  N +  V+  E+  A  AL + QG LL   G  + ++     +L         P  +
Sbjct: 1001 DNQTNELEHVHRTEIDDAPGALCAFQGRLLAGIGKVLRMYDLGKKKLLRKCENKHIPNQI 1060

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V++  +   + + D+ +S+Y L +K    QL + A D        +  L+D  T++    
Sbjct: 1061 VNIQGMGQRVYVSDVQESVYCLKYKRPENQLIIFADDTHP-RWVTSATLLDYDTVA--TG 1117

Query: 1259 DEQKNIQIFYYAPKMS----ESWKGQKLL----------SRAE----FHVGAHVTKFLRL 1300
            D+  NI +      +S    E   G K L           +AE    FH+G  V    + 
Sbjct: 1118 DKFGNIAVLRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQKA 1177

Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVP 1359
             ++          PG  ++    L++ T+ G++G + P      +   Q L+  + +  P
Sbjct: 1178 TLI----------PGGSES----LIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENP 1223

Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
             + G +  S+R ++   K       +++D +L   +  L   +Q  IA   G T S++  
Sbjct: 1224 PLCGRDHLSYRSYYYPVK-------NVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAK 1276

Query: 1420 NLNDLALGTSF 1430
             L D+    +F
Sbjct: 1277 KLEDIRTRYAF 1287


>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
            queenslandica]
          Length = 1214

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/389 (19%), Positives = 160/389 (41%), Gaps = 59/389 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
            +R++ P++     +T   +  + +E A ++ V      +  + E  + +GTA   +    
Sbjct: 862  LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 915

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
              + G +++F    +       +  V++ +L     A+A  QG LL+  G  + ++    
Sbjct: 916  AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 971

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             ++         P  VV + ++   + +GD+ ++++FL ++    QL + A +     C 
Sbjct: 972  KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 1030

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
             T  ++D +T++   +D+  NI I      +++            W         QK   
Sbjct: 1031 TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1088

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
               +HVG  +    ++ ++          PG  +     L++ TL GSIG + P    ++
Sbjct: 1089 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFSSKED 1134

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
              F   Q L+  +     ++ G +  SFR ++   K       S++D +L   Y  L   
Sbjct: 1135 SDF--FQHLEMHMRSEWSNLVGRDHLSFRSYYVPVK-------SVIDGDLCEVYNSLDPS 1185

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            ++ EIA     + S++   L DL    +F
Sbjct: 1186 KRREIALDLDRSPSEVAKKLEDLRTRYAF 1214


>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
 gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
          Length = 1197

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 126/298 (42%), Gaps = 48/298 (16%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V++ E+     A+   QG LL++ G  + ++     ++         P  +V L    + 
Sbjct: 922  VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1266
            I   D+H+S +F+ +K+   QL + A D   +  F T   L+D  TL    +D+  N+ +
Sbjct: 982  IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037

Query: 1267 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                 ++S+           W          KL   A+FHVG  VT  +R  ++      
Sbjct: 1038 SRLPSEVSDEIDNPTGNRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRSSLV------ 1091

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
                PG  +    A+++ T+ G IG + P    +++ F     L+  +    P + G + 
Sbjct: 1092 ----PGGTE----AVIYATIMGRIGALIPFTSREDVDF--YTHLEMYMRQEQPPLCGRDH 1141

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             S+R ++   K       +I D +L   +  L +E+Q  +A     T +++L  L D+
Sbjct: 1142 LSYRSYYIPVK-------NITDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192



 Score = 46.6 bits (109), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
           VLV+G NT+ Y ++     +CA+               PR        +  + AT  Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291

Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           +   LL ++ GDL  +++ Y G VV+ + +   +   + S +      L F  S   +  
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351

Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
           L QF + G G        K     ++    ST  LR+ ++ A               + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
           + N +   +Y    NN  S+                 L+   +GL I   A++       
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444

Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                 LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484

Query: 597 TEVTES 602
            EV +S
Sbjct: 485 EEVAQS 490


>gi|164656549|ref|XP_001729402.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
 gi|159103293|gb|EDP42188.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
          Length = 1207

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 85/361 (23%), Positives = 143/361 (39%), Gaps = 35/361 (9%)

Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1133
            G   QT A   ++  E AL++ +V         NE  L +G+A +         R    S
Sbjct: 859  GTTMQTCAEYALEKDEAALSMALVPF---AACGNELFLVVGSA-LGVTHAPLTWRAAFLS 914

Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
            T R  DN   L   V+  E+     AL +  G LL  +GP + +      +L        
Sbjct: 915  TYRLTDNGCGLAL-VHKTEVDHVPLALRAFHGRLLAGTGPYVRIFDMGTKKLLRKCQSRP 973

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
             P  VVSL +    +++GD+ +S+++  +K     L   A D       +   ++D  T+
Sbjct: 974  FPSKVVSLQVQGYRVIVGDMQESVHYSVYKPATNTLVAFADDIMPRWTTSALLMLDYDTV 1033

Query: 1254 SLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRLQMLA----- 1304
              +  D+  N+ +       S    E   G  L +   + +GA      R Q+LA     
Sbjct: 1034 --MAGDKFGNVFVLRIDSSASLSADEDPTGLMLQNERSYLMGAA----HRAQLLAHYHVG 1087

Query: 1305 ---TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPH 1360
               TS       PG     R  +L+  ++G+IG + P +     R   +L+  +      
Sbjct: 1088 DIITSLSMESLVPG----GRPVVLYTCVNGTIGALVPFISREDVRLFTTLEMHMRQENLS 1143

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            + G +  ++R  ++  KA       +VD +L   Y  LP E+Q  IA +   T + I   
Sbjct: 1144 LTGRDHLAYRGHYTPVKA-------VVDADLCELYTALPHEKQESIADELDRTPADIAKK 1196

Query: 1421 L 1421
            L
Sbjct: 1197 L 1197


>gi|358060450|dbj|GAA93855.1| hypothetical protein E5Q_00501 [Mixia osmundae IAM 14324]
          Length = 1153

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 107/510 (20%), Positives = 192/510 (37%), Gaps = 106/510 (20%)

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
            C  G I VT    L+I  LP   T      +  IPL  TP +  +     L  +I     
Sbjct: 703  CPEGIIGVTGD-TLRIFTLPKIGTK---VKMDSIPLSLTPRRTAFHPAGTLLYMI----- 753

Query: 1025 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA----------- 1073
                              Q D+  LS +      T EE    ++EP  A           
Sbjct: 754  ------------------QSDHRTLSPI------TQEEKAKDLMEPSEAMWTAEINGLMR 789

Query: 1074 --GGPWQTRATI--PMQSSENALTVRV----------VTLFNTTTKENETLLAIGTAYVQ 1119
               G W +  +I  P +  ENA   ++          V +     +  +  L +GTA  Q
Sbjct: 790  AEAGQWSSCISIIDPTEP-ENATVTQIYLDNNEAAFSVAVAQFAERPGKWFLLVGTA--Q 846

Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1179
               V+ R     F            +  ++  EL     ++A+ QG  ++  G  + L+ 
Sbjct: 847  DTTVSPRTCTHGFIRTYEITEAGRSLELLHKTELDDVPLSIAAFQGRAVVGVGRALRLYT 906

Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---- 1235
               + L   +   + P  VVSL +  + I   D   S+YF+++K    +L + A D    
Sbjct: 907  MGKSRLLRKSENKSFPAAVVSLQVQGSRIYASDAQDSVYFVAYKAADNRLLIFADDTQQR 966

Query: 1236 ------------FGSLDCFATEFLIDGSTL-SLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
                          S D F   F+     L S  V ++Q    I +  P    +    +L
Sbjct: 967  WITCNTVVDYDTVASGDKFGNVFVSRVDKLVSEDVDEDQTGAGILHEKPLFMGAPHRLQL 1026

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
            L+   F+VG  +T   ++ ++A                R  LL+  L G++G + P    
Sbjct: 1027 LT--HFNVGDILTCIQKVSLVAG--------------GREILLYTCLGGTVGMLIPFISK 1070

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            +++ F    +L+  +    P + G +  ++R ++   KA        VD +L   + +LP
Sbjct: 1071 EDVEFS--STLEMHMRAENPSIVGRDHLAYRGYYVPQKA-------TVDGDLCETFALLP 1121

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            +++Q +IA +   + S++L  ++ + + +S
Sbjct: 1122 MQKQAQIAGELDRSVSEVLKKIDSMRILSS 1151


>gi|408400551|gb|EKJ79630.1| hypothetical protein FPSE_00190 [Fusarium pseudograminearum CS3096]
          Length = 1212

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 105/488 (21%), Positives = 208/488 (42%), Gaps = 57/488 (11%)

Query: 964  NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
             C  G + +  Q   I  I +L  G T      +QK IPL  TP ++    ++ L+  I 
Sbjct: 761  QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
            +     P      LL D  V +  D+  L   D         +   + +++P    G  Q
Sbjct: 814  ADNNTLPPELRAQLLADPGVVNG-DSKVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870

Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
               TI ++++E A++  +V+    ++++NE+ L IGT    G+D+    R   FS G   
Sbjct: 871  VLQTIDLENNEAAVSAAIVSF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921

Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
              R  +  + L   ++  +++    AL + QG +L+A G  + ++     ++   +  + 
Sbjct: 922  IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                +VSLN   + I++GD+ + + ++ +K    +L     D  +     T  ++D    
Sbjct: 981  ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
            S+   D+  N+ I     K SE    ++     + +R   H   H    +     Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +     G  +     LL+  + G+IG   P    ++  F   Q+L++ L    P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +   +R +++  K        ++D +L   Y +LP +++  IA +   +  +I   ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204

Query: 1423 DLALGTSF 1430
            D+   ++F
Sbjct: 1205 DIRTRSAF 1212


>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
 gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
          Length = 1221

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 135/344 (39%), Gaps = 60/344 (17%)

Query: 1108 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL--KGAISALA 1161
            E LL +G A    Y+  +  AA  RV      R AD  + L  E+  K +   G   AL 
Sbjct: 906  EKLLVVGCAKGLRYMPTDCEAAYIRVY-----RLADGGKRL--ELVHKTIVDGGVPGALC 958

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
              +G LL   GP + L+     +L     Y+  P  ++++ +    I +GD  +S++ + 
Sbjct: 959  GFKGRLLAGVGPTLRLYDMGKKKLLRKCEYNRLPHQIMNITVQGPRIYVGDAQESVHMMR 1018

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-- 1279
            +K+      + A D        T   +D  TL+    D+  N  +     + S+  +   
Sbjct: 1019 YKKADNAFYIFADDIAP-RYLTTILPLDYDTLA--AGDKFGNFVVLRLPREASQQVEDDP 1075

Query: 1280 ----------------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
                             KL    +FHVG  +T   R +M A   +               
Sbjct: 1076 TGGKMAAASGRLNGAPHKLEEVVKFHVGDTITSLQRAEMQAGGQE--------------V 1121

Query: 1324 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            LL+ T+ G+IG + P    +++ F     L+  L    P + G +  SFR       A+ 
Sbjct: 1122 LLYSTVMGAIGVLYPFTNREDVDF--FGHLEMHLRQEHPPLCGRDHLSFR------SAYF 1173

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P   S VD +L   Y  +P ++Q  IA     T  ++L  L D+
Sbjct: 1174 P-VRSCVDGDLCGQYASIPAKKQQMIAEAMDRTPGEMLKKLEDI 1216



 Score = 42.0 bits (97), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
           A  ++ KH T     L ++  L++    W+   + + A  L+AVP      GGVLV   N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253

Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            I Y +Q      A+      L   + +   S++     A++ +L      + ++ GD+ 
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
            +T+ Y+G  V  L +   +     + I  +     F  S  G+  L QF          
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
            G  E+  D+E           SSS AL     G +   +          +   +  + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398

Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
            + ++ P+ D                   +G R +      G++  +   +  LPG    
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           +WTV   ++              DE+ A++++S    T+V    + + E  ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496


>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
          Length = 1197

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 124/298 (41%), Gaps = 48/298 (16%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V++ E+     A+   QG LL++ G  + ++     ++         P  +V L    + 
Sbjct: 922  VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1266
            I   D+H+S +F+ +K+   QL + A D   +  F T   L+D  TL    +D+  N+ +
Sbjct: 982  IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037

Query: 1267 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                 ++S+           W          KL   A+FHVG  VT  +          R
Sbjct: 1038 SRLPSEVSDEIDNPTANRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMV----------R 1087

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
            T   PG  +    A+++ T+ G IG + P     ++ F     L+  +    P + G + 
Sbjct: 1088 TSLVPGGIE----AIIYATIMGRIGALIPFTSRQDVDF--YTHLEMYMRQEQPPLCGRDH 1141

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             S+R ++   K       ++ D +L   +  L +E+Q  +A     T +++L  L D+
Sbjct: 1142 LSYRSYYIPVK-------NVTDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192



 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
           VLV+G NT+ Y ++     +CA+            Q  PR    V    + AT  Q D+ 
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293

Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
             LL ++ GDL  +++ Y G  V+ + +   +   + S +      L F  S   +  L 
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353

Query: 436 QF-TCGSGT------------SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
           QF + G G             + LS+    +  +++  A S   L   +   + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
              +Y    N+  S+                 L+   +GL I   A++            
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444

Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
            LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489

Query: 602 S 602
           S
Sbjct: 490 S 490


>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
            AG-1 IA]
          Length = 1037

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 79/348 (22%), Positives = 146/348 (41%), Gaps = 53/348 (15%)

Query: 1096 VVTLFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
            +VT     T    + +  GTA +  GE+    GR++LF       + +N++    SK+++
Sbjct: 704  MVTSIGVYTHGGNSYILAGTAIINPGENEPLAGRIILF-----GQDEENMIKFKASKDVE 758

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTEL---NGIAFYDAPPLYVVSLNIVK-NFILL 1210
            G +S++  L   ++ A G  I L+     E+   + +A ++    Y+V   IV+ N I++
Sbjct: 759  GGVSSIKQLGARIIAAIGHGIYLYNLGRGEVTISDPVARWERG--YIVHDIIVRPNMIVV 816

Query: 1211 GDIHKSIYFLSWKEQGA-----------------QLNLLAKDFGSLDCFATEFLIDGSTL 1253
             D  +S+  L + E+ +                 Q   +A D  ++   + E L D  T 
Sbjct: 817  SDRLRSVSVLRFIERTSTPESHEEIETEEDSTILQFETVAMDMHAVWPTSVEVLPDNKT- 875

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
             ++ S    NI        ++   +   L  RA FH G  + KF+       S+ ++ A 
Sbjct: 876  -IIASQTDGNI--------LTWELEDGNLEPRAAFHTGEIIHKFI------ASTAKSSAG 920

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
            P      R   +F T  G IG ++ +D+    +L  L+ KL D++  +  +    +R   
Sbjct: 921  P------RTVAIFVTNTGRIGTLSTVDDADALQLTRLEMKLGDAIKGLGNIKHPEWRAPK 974

Query: 1374 SNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
                  +P P   + D + +  +  L  EE   I   +G+    I SN
Sbjct: 975  LLHTGTKPPPRRGVTDGDFIKKFLELSSEEAKRIL-SSGSAAETIGSN 1021


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 126/309 (40%), Gaps = 30/309 (9%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            +E    + +GT Y    D    GR+L+F   +  D+   L+ E     ++G+I  + +  
Sbjct: 810  EEQSEYIVVGTTY-HCHDRKECGRILVF---KMIDSRLILLDET---TVRGSIFCMIAFN 862

Query: 1165 GHLLIASGPKIILHKWTGT----ELNGIAFY--DAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            G LL+A    +  + W+G     +L G   Y      LY+       +F+L+GD+ KS+ 
Sbjct: 863  GQLLVAINKSVHRYTWSGDSSSGKLTGEEIYGGHTASLYLAGRG---DFVLVGDMMKSMA 919

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L  +  G  +  L++           F+ D + L    SD   N+ +     + +    
Sbjct: 920  LL--QASGKDVKELSRSSQPFWLTGLTFIDDDTYLG---SDNSYNLILMKKNTETANEVD 974

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
             Q L +    H G  + +F     LAT +D     P S       ++F T+ G IG I+ 
Sbjct: 975  SQLLDNIGHIHTGEFINRF-HHGTLATLTDVDSPKPNS-------IIFATISGCIGVIST 1026

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + +  +     LQ  L   +  + G +   +R F  N         + +D +L+  +  L
Sbjct: 1027 ISKQDYDFFSKLQVGLNRVIRGIGGFSHDRWRSFQ-NEHISNIESRNFIDGDLVEQFLHL 1085

Query: 1399 PLEEQLEIA 1407
              ++ LE+ 
Sbjct: 1086 RHDKMLEVT 1094



 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)

Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
            V D+  N+GP+ DF                S G +   +    +  GI++Q++   ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           PG K IW++   S R                  YLI+S  + T VLE      E TE   
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           + +Q  T+  GN+   ++++Q+   G  ++D         + PS+      S     + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492

Query: 665 SIADPYVLLGMSDGSI 680
           S     +   + D  I
Sbjct: 493 SFGKTLIYFEIKDQKI 508


>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
 gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
          Length = 1213

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 102/484 (21%), Positives = 199/484 (41%), Gaps = 73/484 (15%)

Query: 977  ILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
            +L+I  +P  GS          +PL  TP ++    E N + LI S   +       + L
Sbjct: 770  VLRIFTIPKLGSKLKQ----DTLPLSYTPRKLITHPENNYFYLIESDHRVYSEEATKAKL 825

Query: 1036 ID-QEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSE 1089
             + Q+ G +ID    +L   +  R           +RI++P       +T A  P+ ++E
Sbjct: 826  DELQKKGKKIDEEIISLPPSEFGRPKAPAGTWASNIRIIDPVEN----KTVAVFPLDNNE 881

Query: 1090 NALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVT 1146
             A ++ +V     + +  E  L +GTA    +D     R        T +  +N   L  
Sbjct: 882  AAFSIAIVPF---SARNGELHLVVGTA----KDTTVSPRTCESGFLRTYKFTENGTGLEL 934

Query: 1147 EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
             ++  E      AL + QG L    G  + ++     +L       +    +V+L    +
Sbjct: 935  -LHKTETDDVPMALLAFQGRLAAGVGKALRIYDIGKKKLLRKVENKSFTTAIVTLTTQGS 993

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAK-------------DFGSL---DCFATEFL--I 1248
             IL+GD+ +S+ ++ +K+   +L   A              D+ ++   D F   F+  +
Sbjct: 994  RILVGDMQESVQYVVYKQPENRLLTFADDTQPRWVTAITMVDYNTIVAGDRFGNIFVNRL 1053

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            D S +S  V ++     I +  P +  +    K++  A FHVG  +T   ++ ++A    
Sbjct: 1054 D-SKVSDQVDEDPTGAGILHEKPILMGAPHKTKMI--AHFHVGDIITSLHKVSLVA---- 1106

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
                        R  +++  L G+IG + P    +++ F  + +L++ +    P + G +
Sbjct: 1107 ----------GGREVIVYTGLHGTIGILMPFISKEDVDF--ISTLEQHMRTEQPSLVGRD 1154

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
              ++R ++   KA       +VD +L   Y  LP  +Q  IA++   T  ++L  L  + 
Sbjct: 1155 QLAYRGYYVPVKA-------VVDGDLCETYAHLPASKQSSIANELDRTVGEVLKKLEQMR 1207

Query: 1426 LGTS 1429
            + +S
Sbjct: 1208 VTSS 1211


>gi|409045147|gb|EKM54628.1| hypothetical protein PHACADRAFT_210427 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1213

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 92/391 (23%), Positives = 165/391 (42%), Gaps = 66/391 (16%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RI+ P  A    QT   IP+ ++E A ++ VV     T +  E  L +GTA  Q   +A
Sbjct: 861  IRIVSPVDA----QTVNFIPLDNNEAAFSIAVVPF---TARGGELTLVVGTA--QDTFLA 911

Query: 1125 ARGRVLLF-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
             R     F  T R  D+ ++ +  ++  E      A+ + QG L    G  + ++     
Sbjct: 912  PRSCTSGFLRTYRFLDDGRD-IELLHKTETNDVPLAIMAFQGRLAAGIGKALRIYDIGKK 970

Query: 1184 EL----NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
            +L        F +A    +V+LN   + I++GD+ +SI++  +K    +L + A D    
Sbjct: 971  KLLRKVESKNFSNA----IVTLNTQGSRIIVGDMQESIFYAVYKPPENRLLIFADDAQPR 1026

Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW------------KG------QK 1281
               A   +ID +T++    D   N+ +    PK+S+              KG       K
Sbjct: 1027 WITAVT-MIDYNTVA--AGDRFGNVFVNRLDPKISDQVDDDPTGAGILHEKGILMGAPHK 1083

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
                A FHVG  VT   ++ ++A                R  LL+  L G++G + P   
Sbjct: 1084 TAMIAHFHVGDIVTSIHKVSLVA--------------GGRELLLYTGLHGTVGILVPFVS 1129

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F  + +L++ +      + G +  ++R ++   KA       +VD +L   +  L
Sbjct: 1130 KEDVDF--ISTLEQHMRTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEMFARL 1180

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            P  +Q  IA +   T  ++L  L  L +  S
Sbjct: 1181 PASKQSSIATELDRTVGEVLKKLEQLRVTAS 1211


>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
          Length = 1047

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
             L++K        +A+DF      A E L D + L
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966


>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
          Length = 1213

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 167/389 (42%), Gaps = 62/389 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RI++P       ++ A IP+ ++E A ++ +V     + + NE  L +GTA  Q   +A
Sbjct: 861  IRIVDPAEG----KSVAEIPIDNNEAAFSLAIVPF---SVRNNEYHLVVGTA--QDTFLA 911

Query: 1125 ARGRVLLF-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
             R     F  T +  D+   L   ++  E      +L + QG L+   G  + ++     
Sbjct: 912  PRSCTSGFLRTYKFVDDGAGLEL-LHKTETDDIPMSLLAFQGRLVAGIGKALRIYDIGKK 970

Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDC 1241
            +L   A        ++SLN   + I++GD+ +SI +  +K    +L + A D  +  + C
Sbjct: 971  KLLRKAESKTFASAIISLNTQGSRIIVGDMQESIAYAVYKAPENKLLVFADDTQARWVTC 1030

Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLL 1283
                 ++D +T++    D   NI I     K+S+              KG       K  
Sbjct: 1031 ---STMVDYTTVA--AGDRFGNIFINRLDSKVSDQVDDDPTGAGILHEKGILMGAPHKTA 1085

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
              A FHVG  VT   ++ ++A                R  LL+  L G+IG + PL   +
Sbjct: 1086 MLAHFHVGDLVTSIHKVSLVA--------------GGREVLLYTGLHGTIGMLVPLVSKE 1131

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F  + +L++ +      + G +  ++R ++   KA       +VD +L   +  LP 
Sbjct: 1132 DVDF--ISTLEQHIRTEQTSLVGRDHLAWRGYYVPVKA-------VVDGDLCETFARLPA 1182

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             +Q  IA +   T S++L  L+ L +  S
Sbjct: 1183 AKQSMIAGELDRTVSEVLKKLDQLRVTAS 1211


>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731 SS1]
          Length = 1175

 Score = 57.4 bits (137), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 110/440 (25%), Positives = 180/440 (40%), Gaps = 80/440 (18%)

Query: 926  FFLSGSRPCWCMVFRERLRVHP-QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
             F  G RP    +  +RL   P +L D  I A +VLH       FI+ ++  +L I Q+ 
Sbjct: 718  IFACGDRPALLFLKNDRLTASPIKLRD--IHAGSVLHIPQFPSSFIFASASTLL-IGQIR 774

Query: 985  SGSTYDNYWPVQKIPLKA-TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQ 1043
                 D    V+ I L   TP ++TY      Y ++      K LN+      D+E+   
Sbjct: 775  ESQKID----VRTISLGLDTPIRLTYHRGLRAYGVVCQ---RKELNRE----DDREIYS- 822

Query: 1044 IDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1103
                  SS  L    T E        PD                      + V T+ ++T
Sbjct: 823  ------SSFKLFDDITFEYLNNFTARPDEQ-------------------MMCVTTIPDST 857

Query: 1104 TKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
             +E+   L +GT    G E+  ++GR+L+F      + P   +  V S ++ G + A+ +
Sbjct: 858  GEEDSDFLVVGTYEATGAEEDVSKGRILIFE-----EVPNRKLKLVVSHDVGGCVYAVTN 912

Query: 1163 LQGHLLIASGPKI---ILHK-WTGTELNGIAFYDAPPLYVVSLNIVK-NFILLGDIHKSI 1217
            +  +L  A    +    LH+      +  +A + +   YV S  I + N +L+GD  +++
Sbjct: 913  VGANLAAAINGTLQVFSLHRSHDDIRIESVAKWSSA--YVASSLICRGNTLLVGDAMRAV 970

Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
              L W   GA+L  L  D+ SL     E + +G     V+  E  N  +         +W
Sbjct: 971  CILRWT--GAKLETLYHDYASLWIQTLESIDEGG----VIGAELNNNIV---------TW 1015

Query: 1278 KGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIG 1334
            +    L R   ++ G  + +F R  + A       AAPG+   N     L+F T  G IG
Sbjct: 1016 RKDGKLERDGMWYFGEGINRFRRASLNA-------AAPGAGGNNAGRGNLVFCTNTGRIG 1068

Query: 1335 CIAPLDELTFRRLQSLQKKL 1354
             +A LDE    +L +LQ+ +
Sbjct: 1069 IVASLDEDLSMQLSNLQRNI 1088


>gi|398019848|ref|XP_003863088.1| CPSF-domain protein, putative [Leishmania donovani]
 gi|322501319|emb|CBZ36398.1| CPSF-domain protein, putative [Leishmania donovani]
          Length = 1347

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/308 (22%), Positives = 123/308 (39%), Gaps = 37/308 (12%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
              + +E + LL IG+++   ++  AR   + + T R     Q L   + SK++ GA+   
Sbjct: 913  GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCC 971

Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
            A++   +G + +     + L++W   +           G+      PLY  SL    + +
Sbjct: 972  AAVPHYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            +  D+  S +F+        L +L +D             D   L L   D+  N     
Sbjct: 1030 VALDVRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALK 1087

Query: 1269 YAPKMSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
              P   E+  G            +   RA+ H+G  VT  +R    A +S     A  S 
Sbjct: 1088 VVPLPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSS 1146

Query: 1318 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
              N+          L+F T  G  G + P+   T+  L++L+  LV +VP V GL+ ++F
Sbjct: 1147 AQNQLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAF 1206

Query: 1370 RQFHSNGK 1377
            R+    G+
Sbjct: 1207 REVLRAGQ 1214


>gi|219110831|ref|XP_002177167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411702|gb|EEC51630.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1303

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 126/289 (43%), Gaps = 47/289 (16%)

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
            G + +LA  QG LL+  G  + L++    +L   +     P +V ++  V     +GD+ 
Sbjct: 1036 GPVLSLAHFQGRLLVGIGTTLRLYEMGKRQLLRKSELRNFPTFVKTVQTVGERAYIGDMM 1095

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +SI  + +     +L L+A D         E L+D +T++  V D+  NI +    P+ +
Sbjct: 1096 QSIQIVRYDVSANRLVLIANDASPRPIVCQE-LLDWNTVA--VGDKFGNISVMRL-PRGA 1151

Query: 1275 ES----WKGQKLL---SR----------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
            ++      GQ+ L   SR           +++VG  VT   R  ++A  ++         
Sbjct: 1152 DTSAIDVTGQRALWDSSREDMIPKLELLCQYYVGEVVTSMTRSSLVAGGAE--------- 1202

Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
                 +L++ T+ G IG   P    +++ F     L+ +L        G +P+S+R +++
Sbjct: 1203 -----SLIYVTVSGRIGAFVPFTNRNDVDF--YSQLESELRGDASRPTGRDPQSYRSYYA 1255

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
                H      +VD +L   +  L  E+Q +IA +   T  +I+  L D
Sbjct: 1256 P-MMH------VVDGDLCDAFNSLGPEKQNKIAEKLDRTVGEIMKKLED 1297


>gi|146094112|ref|XP_001467167.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
 gi|134071531|emb|CAM70220.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
          Length = 1347

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 37/304 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL- 1163
            +E + LL IG+++   ++  AR   + + T R     Q L   + SK++ GA+   A++ 
Sbjct: 917  EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCCAAVP 975

Query: 1164 --QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFILLGD 1212
              +G + +     + L++W   +           G+      PLY  SL    + ++  D
Sbjct: 976  HYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVLVALD 1033

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            +  S +F+        L +L +D             D   L L   D+  N       P 
Sbjct: 1034 VRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALKVVPL 1091

Query: 1273 MSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
              E+  G            +   RA+ H+G  VT  +R    A +S     A  S   N+
Sbjct: 1092 PVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSSAQNQ 1150

Query: 1322 F--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
                      L+F T  G  G + P+   T+  L++L+  LV +VP V GL+ ++FR+  
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210

Query: 1374 SNGK 1377
              G+
Sbjct: 1211 RAGQ 1214


>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
 gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
          Length = 1169

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 105/489 (21%), Positives = 191/489 (39%), Gaps = 87/489 (17%)

Query: 965  CNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
            CN G++ ++     I + C+L  G T+  +    ++PL  TP ++     +         
Sbjct: 734  CNDGYVAISGSNLRIFRCCRL--GETFSEH----RLPLDYTPRKLVMMPNE--------A 779

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
            P +  LN +++++      +  +N    S  L             ++ D   G     A 
Sbjct: 780  PNVGGLNYMVAVVESDHNAYGPENVAEISKALGD-----------IKLDNEVGDLLPLAN 828

Query: 1083 IPMQSSENALTVRVVTLFNTTT------KENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
                +   A  VR+V   N TT      + NE   A     + G      G  + +   +
Sbjct: 829  YKAGTGRWASCVRIVNPLNLTTAAKLLFETNEAATAAAVVVLDGMQCLCIGTTVGYDL-K 887

Query: 1137 NADNPQNLVTEVYS------------KELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
            N D+ ++ +  VY               + G + A    +G LL + G +I L+     +
Sbjct: 888  NTDDVESYI-RVYCYGANFEIRLLHVTRVGGVVRAFTGYEGRLLASVGKRIRLYALGKKQ 946

Query: 1185 LNGIAFYDAPPLY-VVSLNIVKNFILLGDIHKSIYFLSWK---EQGAQLNLLAKDFGSLD 1240
            L   A +     +  + LN V + I  GDI + I  L  K   E+ A+   +    G   
Sbjct: 947  LLLKAEHRTCSDHGFIWLNAVGSRIFAGDIREGIQILRIKFYSEEAAEFEWVGGATGP-- 1004

Query: 1241 CFATEFLIDGSTL--SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
                 +L   + L  S V++ ++ +  IF       ES +  +L +  +FH+G       
Sbjct: 1005 ----RWLTSCAQLDYSTVIAGDKFD-SIFVTRVPQEESTRHIQLENVCQFHLGD------ 1053

Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
                L T+ D+   +  +       +L+GT+ GSIG + P    DEL F  LQ L+  + 
Sbjct: 1054 ----LPTAMDKAALSQSTH-----VVLYGTVMGSIGALVPFQSKDELDF--LQHLEMLMA 1102

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
               P + G     +R ++   +        +VD +L   +  L   +Q ++A Q  TT +
Sbjct: 1103 TEAPPLCGREHSFYRSYYVPVQ-------QVVDGDLCEQFRHLTEAQQRKVAQQLDTTVN 1155

Query: 1416 QILSNLNDL 1424
             +L  L+D+
Sbjct: 1156 NVLRKLDDI 1164


>gi|328770812|gb|EGF80853.1| hypothetical protein BATDEDRAFT_29900 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1213

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/295 (22%), Positives = 127/295 (43%), Gaps = 29/295 (9%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            ++   +KG    + S QG LL+  G  + ++     ++         P  +V+L+   N 
Sbjct: 936  LHKTPIKGIPKVMCSFQGRLLVGVGSLLRIYDLGKKKMLRKCECKGFPTTIVTLHTQGNR 995

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            I+LGD  +S+++  ++    ++ + A D       AT  ++D  T  +V  D+  NI + 
Sbjct: 996  IILGDAQESVHYAMYRAFDNRIVIFADDTIPRWVTAT-CMVDYDT--VVGGDKMGNIFVN 1052

Query: 1268 YYAPKMS-----ESWKGQKLLSRAEF----HVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
              + ++S     ++   Q +  R       H   H   F   + L TS  +T   PG   
Sbjct: 1053 RLSAEVSKGIDEDTTGNQAIFDRGYLQGAPHKVHHEADFFLGETL-TSLTKTSLVPG--- 1108

Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
              R  LL+ TL G IG + P    D++ F   Q+L+  +    P + G +  ++R F++ 
Sbjct: 1109 -GREILLYTTLMGGIGLLIPFISKDDVDF--FQTLEMTMRSECPPLCGRDHLAYRSFYTP 1165

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
              A       I+D +L   + ++  +++  IA     + + +   L D+    +F
Sbjct: 1166 VHA-------IIDGDLCEMFNVMVGDKKRGIAESVDRSVADVGKKLEDMRTRVAF 1213


>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
            MF3/22]
          Length = 1213

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 114/512 (22%), Positives = 194/512 (37%), Gaps = 106/512 (20%)

Query: 965  CNHGFIYVTSQGILKICQLPSGSTY--DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
            C  G I + S  +L+I Q+P   T    +  P+   P K  PH                 
Sbjct: 759  CPEGLIGI-SGSVLRIFQIPRLGTKLKQDSMPLTYTPRKFIPH----------------- 800

Query: 1023 PVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
                P+NQ   ++   E  H++   D       +L +     + EV  L P+  G P   
Sbjct: 801  ----PMNQYFYMI---EADHRVMGDDAAKEKLAELRQRGVKYDQEVVDLPPEVFGRPKAP 853

Query: 1080 RAT----IPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVA 1124
              T    I +    N  TV+VV L N              + +E  L +GTA      +A
Sbjct: 854  AGTWGSCIRILDPINKATVKVVHLDNNEAAFSIAIVPFAARNSELFLCVGTA--SSTFLA 911

Query: 1125 ARG------RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
             R       R   F+ G  AD     +  V+  E      AL + QG L    G  + ++
Sbjct: 912  PRSCSSGFIRTYAFTNG-GAD-----LELVHKTEADDVPMALMAFQGRLCAGVGKSLRIY 965

Query: 1179 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
            +    +L            +V+LN   + I++GD+ +SI +  +K    +L + A D   
Sbjct: 966  EIGKKKLLRKVETKTYGSAIVTLNTQGSRIIVGDMQESIVYAVFKPPENRLLIFADD-SQ 1024

Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSR--------- 1285
                 +  ++D +T++    D+  N+ I     K+S    E   G  +L           
Sbjct: 1025 PRWTTSAVMVDYTTIA--AGDKFGNVFINRLDSKISDQVDEDPTGAGILHEKGLLMGAPH 1082

Query: 1286 -----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL- 1339
                 A FHVG  VT   ++ ++A                R  LL+  L G+IG + P  
Sbjct: 1083 KTGMIAHFHVGDIVTSIHKISLVA--------------GGREVLLYTCLHGTIGILVPFV 1128

Query: 1340 --DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
              +++ F  + +L++ +      + G +  ++R ++   KA       +VD +L   +  
Sbjct: 1129 SKEDVDF--ISTLEQHMRSEKLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEQFAR 1179

Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
            LP  +Q  IA +   T  ++L  L  L +  S
Sbjct: 1180 LPANKQSAIAVELDRTVGEVLKKLEQLRVTAS 1211


>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1140

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 124/281 (44%), Gaps = 33/281 (11%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1143
            MQ  +N     V +L        E +  +GTAY++  ++  +RGR+L+F +  ++    +
Sbjct: 797  MQLDDNEEITSVASLPIMPESRTE-MFVVGTAYIKDSEMEPSRGRILVFGSLEDSGTGGS 855

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL------- 1196
             +T     ++ GA+ +L S+ G ++      +IL++     L+         L       
Sbjct: 856  WLTAFL--QVTGAVLSLTSVDGLIVAGVNTAVILYELRRNTLSEAERASHLTLRQKKEWN 913

Query: 1197 --YVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
              YVV SL    + I +GD   SI  L WK +   L+ +A+ FG +   A + +  G   
Sbjct: 914  HNYVVTSLAARGDTIYIGDSVASIAILRWKHE--TLHTIARHFGPIFPLALDVMSSG--- 968

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
            S++ ++   N+  F+      ES   +KL     +H+G  V KF+  ++         +A
Sbjct: 969  SVITANIDYNLHTFH-----QESPTDRKLEIDGSYHLGDQVNKFIPGRL---------SA 1014

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
            P    +     +F T  G IG +A  D+     L +L++ +
Sbjct: 1015 PTVGASIVLEQVFVTSLGRIGIVAEADKDASWALSALERNI 1055


>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
          Length = 994

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
             L++K        +A+DF      A E L D + L
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966


>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
 gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
            Af293]
 gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
            A1163]
          Length = 1140

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)

Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTAY+  E D + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ A 
Sbjct: 834  VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888

Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
               ++++K     +    L  +A Y    AP    V + +  N I + D+ KS+  + +K
Sbjct: 889  VKTVVVYKVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944

Query: 1224 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            E        +  +A+ F ++       +   + L    SD + N+ + +      E    
Sbjct: 945  EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001

Query: 1280 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
            ++L    E  +G  V +   + +Q LA+        P +          GT++GSI   A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
             ++      L  LQ  +   V  V  +    FR F S  +  +  P   VD EL+  +  
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNIPFNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108

Query: 1398 LPLEEQLEIAHQTG 1411
                 Q EI    G
Sbjct: 1109 CEPSLQEEIVSTVG 1122


>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1213

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 85/385 (22%), Positives = 166/385 (43%), Gaps = 58/385 (15%)

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR 1126
            I++P  A    +T  T+ + ++E+A +V VV       ++NE  L +GTA      +  R
Sbjct: 863  IIDPIEA----RTIHTVELDNNESAFSVAVVPF---AARDNELHLVVGTA--ADTLLTPR 913

Query: 1127 G-RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
              R     T R  D  ++L   ++  E      A+ + QG L+   G  + L++    +L
Sbjct: 914  SCRSGYLRTYRFTDEGRSLEL-LHKTETDDVPLAVMAFQGRLIAGVGKSLRLYEIGKKKL 972

Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
               A   +    +V+LN   + I++GD+ +S++F ++K    +L + A D       A  
Sbjct: 973  LRKAENKSFASAIVTLNTQGSRIIVGDMQESVHFAAYKAPENRLLIFADDMQPRWVTALT 1032

Query: 1246 FLIDGSTLSL------------------VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
             ++D +T+++                   V D+     I +   ++S +    KLL    
Sbjct: 1033 -MVDYTTIAVGDRFGNVFINRLDMRVSDQVDDDPTGAGILHEKGQLSGAPHKTKLL--CH 1089

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
            FHVG  +T   ++ ++A                R  LL+  + G+IG + P    +++ F
Sbjct: 1090 FHVGDLITSIHKVSLVA--------------GGREVLLYTGIHGTIGILVPFVSKEDVDF 1135

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
              + +L++ +      + G +  S+R +++  KA       +VD +L   +  L   +Q 
Sbjct: 1136 --ISTLEQHMRSEQSSLVGRDQLSWRGYYTPVKA-------VVDGDLCEAFARLTGSKQS 1186

Query: 1405 EIAHQTGTTRSQILSNLNDLALGTS 1429
             IA +   T  ++L  L  L + +S
Sbjct: 1187 AIAGELDRTVGEVLKKLEQLRVTSS 1211



 Score = 46.2 bits (108), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ +D   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
                       S TS  S G+ E F  +     +  R R   + AL D +   +  L  
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424

Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
              N   N+++ Q  F+   R S      L+   +GL +    S+            ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WT                   DD Y +Y+I+S    T+VL   + + EV ++  
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
           +     T+A   + G   ++QV  +G             R+  G                
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572

Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
                  Y   DL  G  N      +  STVL++S+ +        PY+ +G  D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631

Query: 683 LVGDPSTC--TVSVQT----PAAI 700
           +  DP +   T+S+Q     P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655


>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
          Length = 1515

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
           V VDP   C  V +Y   + I+  +  G  L  D  E    +   ++ RIE  +++    
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
                  D  F+HGY  P   +++E EL          H            L+   L   
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           ++    D+  L+ VP P GGV++VG N I+YH++       ++ Y     +SQ L  ++ 
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256

Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLT----VVYDGRVVQ-RLDLSKTNPSVL 411
             +     D A   ++ + +A   T +G+ +L +     V   R+   R++L     +  
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
              I  + N + F+G  LGDS L++ 
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340


>gi|71004436|ref|XP_756884.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
 gi|74704394|sp|Q4PGM6.1|RSE1_USTMA RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|46095609|gb|EAK80842.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
          Length = 1221

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 150/374 (40%), Gaps = 54/374 (14%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
            QT   + M  +E A ++ VV      + E E +L +G+A     DV    R       +T
Sbjct: 878  QTTHRLEMDDNEAAFSIAVVPF---ASAEKEVMLVVGSAV----DVVLSPRSCKKAYLTT 930

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  DN + L   ++  E+      L + QG LL   G  + ++     +L       + 
Sbjct: 931  YRLLDNGRELEL-LHKTEVDDIPLVLRAFQGRLLAGIGKALRIYDLGKKKLLRKCENRSF 989

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGS 1238
            P  VVSL+   + I++GD+ +SI F S+K    +L   A D                  +
Sbjct: 990  PTAVVSLDAQGSRIVVGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAA 1049

Query: 1239 LDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
             D F   ++  +DG+T S  V ++   + I +  P +  +     L+  A F VG  +T 
Sbjct: 1050 ADKFGNIYVLRLDGNT-SRSVDEDPTGMTIVHEKPVLMGAAHKASLV--AHFFVGDIITS 1106

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLV 1355
              R  M+A                R  LL+  L GSIG + P + +     L +L+  L 
Sbjct: 1107 LHRTAMVA--------------GGREVLLYTGLSGSIGALVPFVSKEDVDTLSTLESHLR 1152

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
                 + G +  ++R  ++  K       S++D +L   + +L   +Q  IA +      
Sbjct: 1153 QENNSIVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLSPAKQNAIAGELDRKPG 1205

Query: 1416 QILSNLNDLALGTS 1429
            +I   L  L  G +
Sbjct: 1206 EINKKLAQLREGAT 1219


>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
            1]
 gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
            1]
          Length = 1140

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 46/317 (14%)

Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTA++  E D + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ A 
Sbjct: 834  VGTAFLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGNRIVAAL 888

Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
               ++++K     +   +L  +A Y    AP    V + +  N I + D+ KS+  + +K
Sbjct: 889  VKTVVVYKAVSNNFGAMKLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSVCLVEYK 944

Query: 1224 EQ----GAQLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            E        L  +A+ F +     + C A +  ++        SD + N+ I        
Sbjct: 945  EGEDGLPDTLTEVARHFQTVWATGVACIAQDTFLE--------SDAEGNLIILCRNTTGV 996

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
            E    ++L    E  +G  V +   + +   +S      P +           T++GSI 
Sbjct: 997  EEDDKRRLEVTGEISLGEMVNRIRPVNIQQLTS--VAVTPRA--------FLATVEGSIY 1046

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
              A ++      L  LQ  +   V  V  +    FR FHS  +  +  P   VD EL+  
Sbjct: 1047 LFAMINPDHQDFLMRLQATIAGKVELVGNMPFNEFRGFHSMVREAQE-PYRFVDGELIER 1105

Query: 1395 YEMLPLEEQLEIAHQTG 1411
            +       Q EI    G
Sbjct: 1106 FLACEPSVQEEIVSIVG 1122



 Score = 41.6 bits (96), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+L++G  +I Y               V  D+++ + R      LD A   
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
             W Q D    LL+   G L  L +V D    V+   L     +   S +  +G  + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363


>gi|302654423|ref|XP_003019019.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
 gi|291182709|gb|EFE38374.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
          Length = 460

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 150/344 (43%), Gaps = 44/344 (12%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
            + + ++ +E A+++  V+    T++E+ET L +GT    G+D+    R   F+ G     
Sbjct: 102  SNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHIY 152

Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            R  +  + L   ++  +++    AL   QG LL   GP + ++     +L         P
Sbjct: 153  RFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 211

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
              +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  T++ 
Sbjct: 212  RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA- 269

Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLATS 1306
               D+  NI +     K SE       G  L+   ++  GA      V  F   Q + TS
Sbjct: 270  -GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPTS 327

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L    P +AG
Sbjct: 328  IQKTQLVAGG----RDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLAG 381

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             +   +R +++  K        ++D +L   + +LP +++  IA
Sbjct: 382  RDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 418


>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
          Length = 986

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 97/222 (43%), Gaps = 43/222 (19%)

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
            E+G +I+ HNL  +D H    +  +                     + ++E AL+     
Sbjct: 778  EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 812

Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
            L +T   E+ T    +GTA++  ++   + GR+LL+  S G+         T+V  KE+K
Sbjct: 813  LISTRLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 864

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            G+  +L    G LL +    + L +WT  +   L    F +   LY   L    +F+L+G
Sbjct: 865  GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 921

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
            D+ +S+  L +K        +A+D+      + E L D + L
Sbjct: 922  DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFL 963



 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSP+ G +++G  +I YH          N+Y   +    +    +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +      DG + V+ L +       +   
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           IT + N + ++GSRLGDS L++ 
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336


>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
          Length = 172

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 65/128 (50%), Gaps = 15/128 (11%)

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            + ++ +A+FH+ + +T  L + +           P     N    +  T +G++G    +
Sbjct: 26   RNVVRQADFHLASQITSILPISL-----------PDGQCIN----VILTAEGAMGVFLFV 70

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
                + +L SLQK+L++++P  A LN  +FR++ S+G    P    ++D  ++  Y ML 
Sbjct: 71   TGEEYTKLSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLS 130

Query: 1400 LEEQLEIA 1407
             +EQ +IA
Sbjct: 131  TQEQEDIA 138


>gi|302504585|ref|XP_003014251.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
 gi|291177819|gb|EFE33611.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
          Length = 460

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 150/344 (43%), Gaps = 44/344 (12%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
            + + ++ +E A+++  V+    T++E+ET L +GT    G+D+    R   F+ G     
Sbjct: 102  SNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHIY 152

Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            R  +  + L   ++  +++    AL   QG LL   GP + ++     +L         P
Sbjct: 153  RFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 211

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
              +V L    + I++ D+ +S+ ++ +K Q   L   A D        T  ++D  T++ 
Sbjct: 212  RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA- 269

Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLATS 1306
               D+  NI +     K SE       G  L+   ++  GA      V  F   Q + TS
Sbjct: 270  -GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPTS 327

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L    P +AG
Sbjct: 328  IQKTQLVAGG----RDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLAG 381

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             +   +R +++  K        ++D +L   + +LP +++  IA
Sbjct: 382  RDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 418


>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
          Length = 968

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 73/151 (48%), Gaps = 17/151 (11%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1249
             L++K        +A+DF      A E L D
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDD 962


>gi|119471789|ref|XP_001258220.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
            181]
 gi|119406372|gb|EAW16323.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
            181]
          Length = 1140

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)

Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTAY+  E D + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ A 
Sbjct: 834  VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888

Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
               +++++     +    L  +A Y    AP    V + +  N I + D+ KS+  + +K
Sbjct: 889  VKTVVVYRVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944

Query: 1224 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            E        +  +A+ F ++       +   + L    SD + N+ + +      E    
Sbjct: 945  EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001

Query: 1280 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
            ++L    E  +G  V +   + +Q LA+        P +          GT++GSI   A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049

Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
             ++      L  LQ  +   V  V  +    FR F S  +  +  P   VD EL+  +  
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNMPLNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108

Query: 1398 LPLEEQLEIAHQTG 1411
                 Q EI    G
Sbjct: 1109 CEPSLQEEIVSTVG 1122


>gi|320037168|gb|EFW19106.1| pre-mRNA-splicing factor rse1 [Coccidioides posadasii str. Silveira]
          Length = 970

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/367 (20%), Positives = 162/367 (44%), Gaps = 48/367 (13%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
            I ++ +E A++V  V     +++++ET L +GT    G+D+      ++ G + ++    
Sbjct: 633  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 685

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
            +       +  ++  +++    AL + QG LL   G  + ++     +L      +  P 
Sbjct: 686  DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 741

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
             +V L    + I++ D+ +S+ ++ +K Q  +L   A D  +    C A   ++D  T++
Sbjct: 742  LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 798

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
                D+  N+ +     K SE       G  L+   ++  GA     L +    Q + TS
Sbjct: 799  --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 856

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G + P    +++ F   QSL+ +L    P +AG
Sbjct: 857  IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 910

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +   +R +++  K         +D +L   Y  LP +++L IA +   +  +I   ++D
Sbjct: 911  RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 963

Query: 1424 LALGTSF 1430
            +    ++
Sbjct: 964  MRTKVAY 970


>gi|300120114|emb|CBK19668.2| unnamed protein product [Blastocystis hominis]
          Length = 1240

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 124/542 (22%), Positives = 208/542 (38%), Gaps = 102/542 (18%)

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV-------------LHNVNCNHGFIYV- 972
             L+G  PC        L V P LC  +    T+               N +C+ G + V 
Sbjct: 715  ILAGGNPCVLA-----LSVKPWLCYCANNTLTLTSLVSDPLDLAAPFCNEDCSEGIVCVA 769

Query: 973  -TSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
             T+  I++I  L    T         IPL  TP ++       +YP    + +L+  +  
Sbjct: 770  GTNLNIIRIDDLTQPFT------ATSIPLSYTPRELV------VYPGQPRLLLLETDHNA 817

Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI---LEPDRAGG--------PWQTR 1080
             S L  Q    Q   HN+S V+        EY+       EPD+             QT 
Sbjct: 818  YSELEKQSFYQQ---HNVSYVN--------EYDCGAPIPAEPDKWASCIRVVDAISLQTL 866

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG------RVLLFST 1134
              + +  +E A ++ V       +K +E  + IGTA  +   +  R        V  F  
Sbjct: 867  ERLELADNEAAFSMCVCRF---ASKGDEPFVVIGTA--KNLKIHPRSCSQGFISVFRFVE 921

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
            G +       +  ++  E+    +AL    G L    G  + ++     +L       A 
Sbjct: 922  GHS-------LQLLHRTEVDEVPAALCEFDGKLAAGIGRSVRVYDLGKKKLLRKCENKAM 974

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P +V  L  +   +  GD+  ++ F+ +++   QL   A D G         ++D +T+ 
Sbjct: 975  PHFVTKLRAMGERLYAGDLTDNVSFVKFRKGTNQLVEFA-DGGIPRSITALDVLDYNTV- 1032

Query: 1255 LVVSDEQKNIQIFYYAPKM---------SESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
             V  D+  N+ +    PK+         S S     LLS A     A     + L  + T
Sbjct: 1033 -VCGDKGGNLFVERVDPKVDDDIANPTGSRSLWNSGLLSAAPNK--AEQAASIYLGEIVT 1089

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +T   PG D+     +L+GT+ G+IG + P+   D+L    L  ++  +    P + 
Sbjct: 1090 SVQKTVLIPGGDEV----VLYGTIFGTIGALLPMPSRDDL--HHLMHIEMYIRKQEPSLV 1143

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +  S+R  ++  K        I+D  L   + MLP  +Q EIA+    + S I+  + 
Sbjct: 1144 GRDILSWRSAYTPMKG-------IIDGNLCETFSMLPQIKQEEIANALVLSVSSIVKKME 1196

Query: 1423 DL 1424
            DL
Sbjct: 1197 DL 1198


>gi|336469942|gb|EGO58104.1| pre-mRNA splicing factor RSE1 [Neurospora tetrasperma FGSC 2508]
          Length = 1192

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
            RA G W +  +I    SE    ++ + L N             ++E E+ L +GT    G
Sbjct: 830  RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 885

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   F+ G     R  ++ ++L   ++   ++    AL   QG LL   G  +
Sbjct: 886  KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 942

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     +L   A  D  P  +VSL    N I++GD+ + I ++ +K +G +L   A D
Sbjct: 943  RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1002

Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
              +L+ + T   ++D    S+   D+  NI I     ++S+          +E H+  H 
Sbjct: 1003 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1053

Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
              +L     RL        Q L TS  +T    G        LL+  L G++G   P   
Sbjct: 1054 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1109

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F   Q+L+  +    P +AG +   +R +++  K        ++D +L   + +L
Sbjct: 1110 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1160

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            P +++  IA +   +  +I   ++D+   ++F
Sbjct: 1161 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192


>gi|164429062|ref|XP_957282.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
 gi|157072394|gb|EAA28046.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
          Length = 1192

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
            RA G W +  +I    SE    ++ + L N             ++E E+ L +GT    G
Sbjct: 830  RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 885

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   F+ G     R  ++ ++L   ++   ++    AL   QG LL   G  +
Sbjct: 886  KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 942

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     +L   A  D  P  +VSL    N I++GD+ + I ++ +K +G +L   A D
Sbjct: 943  RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1002

Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
              +L+ + T   ++D    S+   D+  NI I     ++S+          +E H+  H 
Sbjct: 1003 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1053

Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
              +L     RL        Q L TS  +T    G        LL+  L G++G   P   
Sbjct: 1054 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1109

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F   Q+L+  +    P +AG +   +R +++  K        ++D +L   + +L
Sbjct: 1110 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1160

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            P +++  IA +   +  +I   ++D+   ++F
Sbjct: 1161 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192


>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
            occidentalis]
          Length = 1211

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +V++N V N I++GD+ +S +F+ ++    QL + A DF      A   ++D  T  
Sbjct: 981  PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1037

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1306
            +V  D+  N+ I       S+         R+ +  G        A V     +  L  S
Sbjct: 1038 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1097

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T   PG  +    A+++ T+ G +G + P    D+  F   Q L+  +    P + G
Sbjct: 1098 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1151

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +  SFR ++   KA       ++D +L   Y  L   +Q +IA +      ++   L D
Sbjct: 1152 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1204

Query: 1424 LALGTSF 1430
            +    +F
Sbjct: 1205 IRTKFAF 1211


>gi|303324325|ref|XP_003072150.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
            delta SOWgp]
 gi|240111860|gb|EER30005.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1209

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/367 (21%), Positives = 165/367 (44%), Gaps = 48/367 (13%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
            I ++ +E A++V  V     +++++ET L +GT    G+D+      ++ G + ++   R
Sbjct: 872  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIY---R 921

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
              ++ + L   ++  +++    AL + QG LL   G  + ++     +L      +  P 
Sbjct: 922  FQEDGKEL-EFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
             +V L    + I++ D+ +S+ ++ +K Q  +L   A D  +    C A   ++D  T++
Sbjct: 981  LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
                D+  N+ +     K SE       G  L+   ++  GA     L +    Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G + P    +++ F   QSL+ +L    P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +   +R +++  K         +D +L   Y  LP +++L IA +   +  +I   ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202

Query: 1424 LALGTSF 1430
            +    ++
Sbjct: 1203 MRTKVAY 1209


>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
            occidentalis]
          Length = 1223

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
            P  +V++N V N I++GD+ +S +F+ ++    QL + A DF      A   ++D  T  
Sbjct: 993  PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1049

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1306
            +V  D+  N+ I       S+         R+ +  G        A V     +  L  S
Sbjct: 1050 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1109

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T   PG  +    A+++ T+ G +G + P    D+  F   Q L+  +    P + G
Sbjct: 1110 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1163

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +  SFR ++   KA       ++D +L   Y  L   +Q +IA +      ++   L D
Sbjct: 1164 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1216

Query: 1424 LALGTSF 1430
            +    +F
Sbjct: 1217 IRTKFAF 1223


>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
 gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
          Length = 1134

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/281 (19%), Positives = 119/281 (42%), Gaps = 17/281 (6%)

Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            T ++ T   +GT  +  ++   + GR+++F      D  ++ +  V+   ++G+  A+  
Sbjct: 814  TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            L G L+ A    I L +WT  +   +       +  + L ++   + + D+ +S+  LS+
Sbjct: 871  LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
            +        +AKD+ S      EF+   S L          +++    P   +   G+ +
Sbjct: 931  RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1340
            L    +     + K +    L    +        D   +++  ++FGT  G+IG I  +D
Sbjct: 988  LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
            +   + L +++K + DSV +   +   S+R F    +A  P
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080



 Score = 47.8 bits (112), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP  IGGV+V+G+N++ Y        +    Y  SL     L  ++F+    +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           DA+   +L      LS   G L++L    T    G  V+ + +     + +   I  I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            + F+GSRLGDS L++  T  +G S   S + E + +I                 ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
             E         ++ +    T + A +D     G L+    G+ I+  AS          
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             V+L G  GI+ +   S    NAD+            Y+I+SL   T VL+      E 
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437

Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
            + ++      TI A  LFG      ++Q  E+  R++  S +++   + P+N E  S  
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494

Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
                +SV+ A+  ++L   D ++ LL     TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519


>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
           [Trypanosoma cruzi strain CL Brener]
 gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma cruzi]
          Length = 499

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
           NN  +                   D   E   + F V L  A+ T     + N++    +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499


>gi|392869416|gb|EJB11761.1| pre-mRNA-splicing factor rse1 [Coccidioides immitis RS]
          Length = 1209

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/367 (20%), Positives = 162/367 (44%), Gaps = 48/367 (13%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
            I ++ +E A++V  V     +++++ET L +GT    G+D+      ++ G + ++    
Sbjct: 872  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
            +       +  ++  +++    AL + QG LL   G  + ++     +L      +  P 
Sbjct: 925  DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
             +V L    + I++ D+ +S+ ++ +K Q  +L   A D  +    C A   ++D  T++
Sbjct: 981  LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
                D+  N+ +     K SE       G  L+   ++  GA     L +    Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G + P    +++ F   QSL+ +L    P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +   +R +++  K         +D +L   Y  LP +++L IA +   +  +I   ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202

Query: 1424 LALGTSF 1430
            +    ++
Sbjct: 1203 MRTKVAY 1209


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
          Length = 1104

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 112/268 (41%), Gaps = 30/268 (11%)

Query: 1113 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIA 1170
            +GTA++  E+    +GR+L+    R       LVTE   KE+KGA  + L  ++  +L +
Sbjct: 835  VGTAFIVPEEPEPTKGRILVLEHVR-------LVTE---KEVKGAAYNVLPFVKDKILAS 884

Query: 1171 SGPKIILHKWTGTELNGIAFYDAPP------LYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
               K+     +G +L G+    A        +  + L    N +++GD+ +S+  LS+  
Sbjct: 885  VNSKV---PASGCDLGGVRVELASECSYLGNILALYLATRGNLVVVGDLMRSVSLLSYNV 941

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
            +   L   A D+ S    + E L D + L     D   N+ +       +   +  +L  
Sbjct: 942  EQGVLEHRAADYNSGWTTSVEALDDDTYLE---GDNHLNLVVLRRNADSATDEERARLQV 998

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
              E+H G  V +F    ++    D          +    LLFG  DG +G IA L    +
Sbjct: 999  VGEYHTGTFVNRFRHGSLVMRPPDSEFV------SLPVPLLFGGTDGRLGVIARLPPGLY 1052

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
              L  LQ  L   V  V GL+  ++  F
Sbjct: 1053 EMLTKLQSALRQVVRGVGGLSHEAWIAF 1080



 Score = 47.0 bits (110), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL  + G + LL + +DG  V  L       +   S +  + + L F+GSR GDS LV+ 
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
           +                     + P T      S  +L  +V+   + L           
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
           Q      +   + + G L+    G+ IN  A+            VELPG KG+W++    
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
            R H         YDDEY  YL+++   E R + L T + L E  E   +    +T+  G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467

Query: 616 NL 617
           N+
Sbjct: 468 NM 469


>gi|189044515|sp|Q7RYR4.2|RSE1_NEUCR RecName: Full=Pre-mRNA-splicing factor rse-1
          Length = 1209

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
            RA G W +  +I    SE    ++ + L N             ++E E+ L +GT    G
Sbjct: 847  RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   F+ G     R  ++ ++L   ++   ++    AL   QG LL   G  +
Sbjct: 903  KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     +L   A  D  P  +VSL    N I++GD+ + I ++ +K +G +L   A D
Sbjct: 960  RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019

Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
              +L+ + T   ++D    S+   D+  NI I     ++S+          +E H+  H 
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070

Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
              +L     RL        Q L TS  +T    G        LL+  L G++G   P   
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F   Q+L+  +    P +AG +   +R +++  K        ++D +L   + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            P +++  IA +   +  +I   ++D+   ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209


>gi|350290373|gb|EGZ71587.1| Pre-mRNA-splicing factor rse-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1209

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
            RA G W +  +I    SE    ++ + L N             ++E E+ L +GT    G
Sbjct: 847  RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   F+ G     R  ++ ++L   ++   ++    AL   QG LL   G  +
Sbjct: 903  KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     +L   A  D  P  +VSL    N I++GD+ + I ++ +K +G +L   A D
Sbjct: 960  RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019

Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
              +L+ + T   ++D    S+   D+  NI I     ++S+          +E H+  H 
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070

Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
              +L     RL        Q L TS  +T    G        LL+  L G++G   P   
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F   Q+L+  +    P +AG +   +R +++  K        ++D +L   + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            P +++  IA +   +  +I   ++D+   ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209


>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
 gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
          Length = 1213

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/376 (23%), Positives = 157/376 (41%), Gaps = 36/376 (9%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RI++P +      T A IP+ ++E A ++ VV     + +  E  L +GTA      V+
Sbjct: 861  IRIIDPTQN----STVAVIPLDNNEAAFSIAVVPF---SARNGELFLVVGTA--ANTRVS 911

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
             R     +       N    +   +  E      AL + QG L    G  + ++     +
Sbjct: 912  PRTCSSGYLRTYQFTNDGAGLELHHKTETDDVPLALLAFQGRLAAGVGKALRIYDIGKKK 971

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            L   A        +V+LN   + I+ GD+ +S+++  +K    +L + A D       + 
Sbjct: 972  LLRKAENKGFGTTIVTLNTQGSRIIAGDMQESLFYAVYKAPENRLLVFADD-SQPRWISA 1030

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL- 1298
              ++D  T++    D   N+ +     K+SE       G  +L      +GA H TK L 
Sbjct: 1031 ATMVDYYTVA--AGDRFGNVFVNRLDYKVSEQVDDDPTGAGILHEKGILMGAPHKTKLLC 1088

Query: 1299 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1353
               +  L TS  +     G     R  LL+  L G+IG + P    +++ F  + +L++ 
Sbjct: 1089 HFHVGDLITSIHKVALVAG----GREVLLYTGLHGTIGMLVPFVSKEDVDF--ISTLEQH 1142

Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
            +      + G +  S+R ++   KA       +VD +L   +  LP  +Q  IA++   T
Sbjct: 1143 MRSEQSSLVGRDHLSWRGYYVPVKA-------VVDGDLCETFAKLPASKQSAIANELDRT 1195

Query: 1414 RSQILSNLNDLALGTS 1429
              ++L  L+ L   TS
Sbjct: 1196 VGEVLKKLDSLRTTTS 1211


>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
 gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
          Length = 1284

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/286 (23%), Positives = 121/286 (42%), Gaps = 30/286 (10%)

Query: 1110 LLAIGTAYVQ-GEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASL- 1163
             L +GTAY    ED  +RGR+L++S               V ++     +G + ++    
Sbjct: 931  FLLVGTAYAMPDEDEPSRGRILVYSCQADEASGTPTSTRAVRQITEMSTQGGVYSICQFY 990

Query: 1164 QGHLLIASGPKIILHKWTGT------ELNGIAFYDAPPLYVVSLNI---VKNFILLGDIH 1214
             G+ L     K  + +          E  GI  +     ++VSL +    K   ++GD+ 
Sbjct: 991  DGNFLCTVNSKTHVVQIVADCGVLRLEYVGIGHHG----HIVSLFVKSRAKPLAIVGDLM 1046

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+  + +  Q   L  +A+DF      A E L D     + +  E  N        K +
Sbjct: 1047 RSVSLMQYYPQHETLEEVARDFNPNWTTAVEMLTD----DVYIGAENWNNLFCLRRNKAA 1102

Query: 1275 ESWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
             S + + +L +  EFH+G    KF+   ++         +  S  ++R A LFGT++GS+
Sbjct: 1103 TSEEIRCRLDNIGEFHLGEMCNKFMSGSLVMP------VSSNSTTSSRRATLFGTVEGSL 1156

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
            G I  LD  T     +L++ +  ++  V G + + +R   +  + H
Sbjct: 1157 GVILGLDGRTAAFFITLERAIAKTIQPVGGFSHQLYRSCQAELRVH 1202


>gi|392580116|gb|EIW73243.1| hypothetical protein TREMEDRAFT_37240 [Tremella mesenterica DSM 1558]
          Length = 1214

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/390 (20%), Positives = 160/390 (41%), Gaps = 61/390 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-------- 1116
            +RIL+P       QT +TI +   E A ++ +    N      E  L +GTA        
Sbjct: 862  IRILDPLTN----QTVSTIELDEDEAAFSLTIAYFENMA---GEPSLVVGTAVKTTLTPR 914

Query: 1117 -----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
                 +++   +   GR L F               ++  +L      +A  QG+LL+ +
Sbjct: 915  GCKEGWLRVYAIKENGRTLEF---------------MHKTKLDEIPLCVAGFQGYLLVGA 959

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            G  + L++     L      ++ P  + ++N++   I++GD+ +S +F  ++    +  L
Sbjct: 960  GKSLRLYEAGKKALLRKCENNSFPTVIATINVIGARIIVGDMQESTFFCVYRSIPTRQLL 1019

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAE 1287
            +  D            +D  T++    D+  N+ +      +SE       G  +L    
Sbjct: 1020 VFGDDTQPRFLTCVTNVDYDTVA--CGDKFGNVFVNRMDQAVSEKVDDDPTGAGILHEKG 1077

Query: 1288 FHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP---LD 1340
            F +G AH T  +   ++  + TS  +    PG     R  L++ T+ G++G + P   +D
Sbjct: 1078 FLMGAAHKTTLIAHYQVGSVVTSLTKVSLVPG----GRDVLVYTTISGAVGALVPFISMD 1133

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F  + +L+  +      + G +  ++R +++           +VD +L   Y  LP 
Sbjct: 1134 DVEF--MTTLEMHMRSQNISLVGRDHLAYRGYYAPVMG-------VVDGDLCDAYSSLPY 1184

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
             +Q  IA++   +   +L  L  +   ++F
Sbjct: 1185 TKQSSIANELDRSVGDVLKKLEQMRTSSAF 1214


>gi|388855100|emb|CCF51231.1| probable splicing factor 3B subunit 3 [Ustilago hordei]
          Length = 1221

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 129/581 (22%), Positives = 227/581 (39%), Gaps = 84/581 (14%)

Query: 887  LRNLRFSRTPLDAYTREETPH-----GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
            L N    RT LDA T + T       G+   R+ I   + G        +R      ++ 
Sbjct: 685  LSNGVLLRTVLDAMTGQLTDTRTRFLGSKAVRL-IRTKVHGQSAVMALSTRTWLSFTYQS 743

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
            RL+  P + D    A++    + C  G I +     L+I  +PS ++         + L 
Sbjct: 744  RLQFTPLIFDALDHAWSFSAEL-CPEGLIGIVG-STLRIFTIPSLASK---LKQDSVALS 798

Query: 1002 ATPHQITYFAEKN--LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT-Y 1058
             TP +I +  ++    Y +      L P  Q   +   + +  ++  H    +DL    +
Sbjct: 799  YTPRKIAHHPDEQGLFYVVEADRRTLSPGAQRRRV---EALEKELKPHQRGVLDLKPAEF 855

Query: 1059 TVEEYE-------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             +   E       VR+++    G   QT   I +  +E A +V +V      + E ++ L
Sbjct: 856  GLIRGEAGNWASCVRVVD----GPQSQTTHKIELDDNEAAFSVAIVPF---ASAEKQSFL 908

Query: 1112 AIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSK-ELKGAISALASLQGHL 1167
             +G+A     DV    R       +T R  +  + L  EV+ K E+      L   QG L
Sbjct: 909  VVGSAV----DVVLSPRSFKKAYLTTYRLINGGREL--EVHHKTEIDDIPLVLRPFQGRL 962

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L   G  + ++     +L       + P  +VSL+   + I++GD+ +SI F S+K    
Sbjct: 963  LAGVGKALRIYDLGKKKLLRKCENKSFPTAIVSLDAQGSRIVVGDMQESIVFTSYKPLEN 1022

Query: 1228 QLNLLAKD----------------FGSLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYY 1269
            +L   A D                  + D F   ++  ID  T S  V ++   + I + 
Sbjct: 1023 RLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNLYVLRIDADT-SRSVDEDPTGMTIVHE 1081

Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
             P +  +     LL  A + VG           + TS +RT   PG     R  L++  +
Sbjct: 1082 KPVLMGAAHKATLL--AHYFVGD----------IITSLNRTVMVPG----GREVLMYTGI 1125

Query: 1330 DGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
             G+IG + P + +     L +LQ +L      + G +  ++R  ++  K       S++D
Sbjct: 1126 SGTIGALVPFVSKEDVDTLSTLQTQLRQENNSLVGRDHLAYRSSYAPVK-------SVID 1178

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             +L   + +L   +Q  IA +     S+I   L  L  G +
Sbjct: 1179 GDLCETFGLLQPAKQNAIAQELDRKPSEINKKLAQLREGAT 1219


>gi|295666353|ref|XP_002793727.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278021|gb|EEH33587.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1209

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/362 (21%), Positives = 156/362 (43%), Gaps = 38/362 (10%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
            I ++ +E A++V  V+    +++++ET L +GT    G+D+    R          R  +
Sbjct: 872  IELEENEAAVSVAAVSF---SSQDDETFLVVGT----GKDMVVNPRSCSAGFIHIYRFQE 924

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            + + L   ++  +++    AL   QG LL   G  + ++     ++         P  VV
Sbjct: 925  DGKEL-EFIHKTKVEQPPVALLGFQGRLLAGIGTDVRIYDLGMRQMLRKCQASVVPHLVV 983

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             L    + I++ D+ +S+ ++ +K Q  +L     D  S     T  ++D  T++    D
Sbjct: 984  GLQTQGSRIIVSDVQESVTYVVYKSQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040

Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
            +  N+ +     K SE       G  L+   ++  GA    ++      Q L TS  +  
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               G     R  L++  L G++G + P    +E+ F   QSL+ +L    P +AG +   
Sbjct: 1101 LVTG----GRDILVWTGLQGTVGMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R +++  K         +D +L   Y +LP +++ +IA +   +  +I   + D+    
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTKV 1207

Query: 1429 SF 1430
            ++
Sbjct: 1208 AY 1209


>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
           purpuratus]
          Length = 1161

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  +DP+ R  G+ +Y     I+   +    L            F+ R+E  +VI+++
Sbjct: 50  GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F++G  +P +V LH+      GR     H              + P  W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++AVP P GG L++G  +I YH      A+A             +  S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188

Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
                 LD   + +L  D  L       L+      DG   V+ L L     + +   +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
            + N + F+GSRLGDS LV+    S  S     + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 76/336 (22%), Positives = 141/336 (41%), Gaps = 38/336 (11%)

Query: 1102 TTTKENE---TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1158
            T+TK ++   T LA+GT+ +   D    GRVLLF    N +  + LV  +     +  + 
Sbjct: 749  TSTKFDDDPCTYLAVGTS-INIPDRQTSGRVLLF----NINEAKKLVL-LEEISFRSGVL 802

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----NFILLGDI 1213
             L    G L+ A   ++   +++ ++        +  ++     I+K     +F+L+GD+
Sbjct: 803  YLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVISSENVHKGHTMILKLASRGHFMLVGDM 862

Query: 1214 HKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             KS+  L   E G+ L  +AK+       S+     ++ I   T +  V  ++ N     
Sbjct: 863  MKSMSLLGQSENGS-LVQIAKNPQPIWIRSIAMINDDYFIGSETSNNFVVVKKNN----- 916

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                 +   + + L S   +H+G  +       ML  S  R    P SD      +L+ +
Sbjct: 917  ---DSTNELERELLDSVGHYHIGESIN-----SMLCGSLVR---LPDSDAPPIPTILYAS 965

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            ++GSIG IA + +  +     LQK L   V  + G    S+R F ++   H     + +D
Sbjct: 966  VNGSIGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSND--HHTVESRNFID 1023

Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +L+  +  L +E   ++      T  + L  +  L
Sbjct: 1024 GDLIEMFPDLKIESMAKVIQDMNVTLDETLKRIESL 1059



 Score = 42.4 bits (98), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           ++G +  L + S GG     ++D + ++ E  K  +L +D     +   +    E     
Sbjct: 14  IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
               GR + A G L  +DP GR          +I L   +G   L+  E       G + 
Sbjct: 65  --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            I+ +   N R L+   V D  F++G   P + +L      +      KH    I    +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153

Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           S   ++  P  WS  N+    Y  + V  P+GGVLVV  N I Y +   + ++A      
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205

Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                  +P + F     +D   + +L  D        G L +L ++   + V  L    
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + + + F+GS  GDS L++                   + E D P+T 
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293

Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
               S    L++  N G  +      +     AQ  T S   RD     G L+    G+ 
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                   GI++++   L+EL G KG+W +        N  S  +   D     YLI+S 
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
              T VL+      E TE         TI   N+     ++QV  +   +++  ++   D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442

Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
               PS S     S N + +++SI 
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467


>gi|336257679|ref|XP_003343663.1| hypothetical protein SMAC_08834 [Sordaria macrospora k-hell]
 gi|380091896|emb|CCC10625.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1209

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 82/370 (22%), Positives = 163/370 (44%), Gaps = 54/370 (14%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I + ++E A++  +V      ++E E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 872  IDLDNNEAAVSAAIVPF---ASQEGESFLVVGT----GKDMVLNPR--QFTEGYIHVYRF 922

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
             ++ ++L   ++   ++    AL   QG LL   G  + ++     +L   A  D  P  
Sbjct: 923  HEDGRDL-EFIHKTRVEEPPMALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTL 981

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
            +VSL    N I++GD+ + + ++ +K +G +L     D  +L+ + T   ++D    S+ 
Sbjct: 982  IVSLQSQGNRIIVGDLQQGVTYVVYKAEGNRLIPFVDD--TLNRWTTCTTMVDYE--SVA 1037

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QML 1303
              D+  NI I     ++S+          +E H+  H   +L     RL        Q L
Sbjct: 1038 SGDKFGNISIVRCPERVSQDTDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFFTQDL 1092

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
             TS  +T    G        LL+  L G++G   P    +++ F   Q+L+  +    P 
Sbjct: 1093 PTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPP 1146

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            +AG +   +R +++  K        ++D +L   + +LP +++  IA +   +  +I   
Sbjct: 1147 LAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERK 1199

Query: 1421 LNDLALGTSF 1430
            ++D+   ++F
Sbjct: 1200 ISDIRTRSAF 1209


>gi|315053737|ref|XP_003176243.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
 gi|311338089|gb|EFQ97291.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
          Length = 1181

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 102/466 (21%), Positives = 187/466 (40%), Gaps = 60/466 (12%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +    ++ S    DN    + IPL  TP       E   YPL     
Sbjct: 725  QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPE---YPLFY--- 774

Query: 1024 VLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGP 1076
            V+   N VLS      L+ +      D+  L   D         +   + +++P      
Sbjct: 775  VIGSDNNVLSPSTKAKLLGESTAVNGDSAELPPEDFGYPRGTNHWASCIEVVDPINTKS- 833

Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFS 1133
                + + ++ +E A+++  V+    T++E+ET L +GT    G+D+    R        
Sbjct: 834  --VLSKLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRTYTCGFIH 884

Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
              R  +  + L   ++  +++    AL   QG LL   GP + ++     +L        
Sbjct: 885  IYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGVGPDLRIYDLGMRQLLRKCQAQI 943

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
             P  +V L    + I++ D+ +S+ ++ +K Q   L   A D  S     T  ++D  T+
Sbjct: 944  TPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIISRWTTCTT-MVDYETV 1002

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLA 1304
            +    D+  NI +     K SE       G  L+   ++  GA      V  F   Q + 
Sbjct: 1003 A--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVVHFYS-QDIP 1059

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G     R  L++  L G++G   P    D++ F   Q+L+ +L      +
Sbjct: 1060 TSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNAPL 1113

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            AG +   +R +++  K        ++D +L   + +LP +++  IA
Sbjct: 1114 AGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 1152


>gi|225683909|gb|EEH22193.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb03]
          Length = 1209

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/362 (21%), Positives = 156/362 (43%), Gaps = 38/362 (10%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
            I ++ +E A++V  V+    +++++ET L +GT    G+D+    R          R  +
Sbjct: 872  IELEENEAAVSVAAVSF---SSQDDETFLVVGT----GKDMVVNPRSCSAGFIHIYRFQE 924

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            + + L   ++  +++    AL   QG LL   G  + ++     ++         P  VV
Sbjct: 925  DGKEL-EFIHKTKVEQPPVALLGFQGRLLAGIGTDVRIYDLGMRQMLRKCQASVVPHLVV 983

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             L    + I++ D+ +S+ ++ +K Q  +L     D  S     T  ++D  T++    D
Sbjct: 984  GLQTQGSRIIVSDVQESVTYVVFKSQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040

Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
            +  N+ +     K SE       G  L+   ++  GA    ++      Q L TS  +  
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               G     R  L++  L G++G + P    +E+ F   QSL+ +L    P +AG +   
Sbjct: 1101 LVTG----GRDILVWTGLQGTVGMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R +++  K         +D +L   Y +LP +++ +IA +   +  +I   + D+    
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTKV 1207

Query: 1429 SF 1430
            ++
Sbjct: 1208 AY 1209


>gi|440478305|gb|ELQ59147.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae P131]
          Length = 1223

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 85/368 (23%), Positives = 162/368 (44%), Gaps = 56/368 (15%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I + ++E AL++ VV+     +++ E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 879  IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
            +++ + L   ++  +++   +AL   QG L+   G  + ++     +L   A  +  P  
Sbjct: 930  SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            +VSLN   + I++GD+   + ++++K +  +L   A D  +     T  +   ST     
Sbjct: 989  IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
            +D+  N+ I     K S+          +E H+  H   +L     RL ++A        
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G  +     LL+G   G+IG + P    ++  F   QSL++ L    P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT----RSQI 1417
            AG +   +R  +   K        ++D +L   Y MLP +++  IA +   +      +I
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207

Query: 1418 LSNLNDLA 1425
             +N  D+A
Sbjct: 1208 STNFVDIA 1215


>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
 gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
          Length = 1217

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/389 (20%), Positives = 153/389 (39%), Gaps = 60/389 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
            +RI+ P        T  T  +   E  L    ++L   + + ++  L +G A  +     
Sbjct: 866  IRIIAP-------TTGQTFEVHRLEQNLAALCLSLVKFSNQGDQLFLIVGIAKEFQLNPR 918

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
            V++ G +  +       N    +  V+   L     A+   QG +L+  G  + L+    
Sbjct: 919  VSSGGFLYTYRVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             +L         P  VVS+N +   I + D+ +S+Y + +K Q  QL + A D       
Sbjct: 975  KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
             T  ++D  T++   +D+  NI +   A  +++            W         QK  +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
             A FHVG  V    +  ++          PG  ++    L++ TL G++G + P    ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
              F   Q L+  +    P + G +  SFR ++   K       +++D +L   +  +   
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSIEPT 1188

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +Q  I+     T S++   L D+    +F
Sbjct: 1189 KQKSISGDLERTASEVSKKLEDIRTRYAF 1217



 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|425768510|gb|EKV07031.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum PHI26]
 gi|425775700|gb|EKV13954.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum Pd1]
          Length = 1209

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 100/462 (21%), Positives = 186/462 (40%), Gaps = 66/462 (14%)

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            DN    + IPL  TP +     +++L+ +I S   VL P  +    LID       +  +
Sbjct: 781  DNNMLQESIPLSYTPRRFVKHPDQHLFYVIESDNNVLSPATR--QRLIDDSQAQNGEVAD 838

Query: 1049 LSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1106
            L   D         +   V+I++P          +T+ ++ +E A+++  V+    ++++
Sbjct: 839  LPPADFGYPRATGHWASCVQIVDPITTKS---VISTLDLEDNEAAVSLAAVSF---SSQD 892

Query: 1107 NETLLAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
            +ET L +GTA             ++        GR L F      D P            
Sbjct: 893  DETFLVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP----------- 941

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
                 AL   QG LL   GP + ++     +L         P  +V L    + I++ DI
Sbjct: 942  ----LALLGFQGRLLAGIGPVLRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIVSDI 997

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +S+ ++ +K Q   L   A D  +    +T  ++D  T +    D+  N+ +     K+
Sbjct: 998  RESVTYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCPSKI 1054

Query: 1274 SESW----KGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALL 1325
            SE       G  L+  +   H   H  + +     Q + TS  +T    G     R  ++
Sbjct: 1055 SEQADEDGSGAHLIHEKGYLHGTPHRLELMVHFFAQDIPTSLHKTQLVAG----GRDIVV 1110

Query: 1326 FGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
            +  L G+IG   P    +++ F   Q L+ +L    P +AG +   +R +++  K     
Sbjct: 1111 WTGLQGTIGMFVPFVSREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG---- 1164

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               ++D +L   Y +LP + +L IA +   +  +I   ++D+
Sbjct: 1165 ---VIDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203


>gi|323508292|emb|CBQ68163.1| probable splicing factor 3B subunit 3 [Sporisorium reilianum SRZ2]
          Length = 1221

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 152/375 (40%), Gaps = 56/375 (14%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
            QT   + +  +E A +V VV      + E E +L +G+A     DV    R       +T
Sbjct: 878  QTTHKLELDDNEAAFSVAVVPF---ASAEKEAMLVVGSAV----DVVLSPRSFKKAYLTT 930

Query: 1135 GRNADNPQNLVTEVYSK-ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
             R  +N + L  EV  K E+      L   QG LL   G  + ++     +L       +
Sbjct: 931  YRLTNNGREL--EVLHKTEVDDIPLVLRPFQGRLLAGIGKALRIYDLGKKKLLRKCENKS 988

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FG 1237
                +VSL+   + I++GD+ +SI F S+K    +L   A D                  
Sbjct: 989  FATAIVSLDAQGSRIVVGDMQESIIFTSYKPLENRLVTFADDVMPKFVTRCAMLDYDTVA 1048

Query: 1238 SLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1295
            + D F   ++  ID  T S  V ++   + I +  P +  +     L+  A F VG    
Sbjct: 1049 AADKFGNVYVLRIDADT-SRSVDEDPTGMTIVHEKPVLMGAAHKATLV--AHFFVGD--- 1102

Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKL 1354
                   + TS +RT   PG     R  LL+  + G+IG + P + +     L +L+  L
Sbjct: 1103 -------IVTSLNRTVMVPG----GREVLLYTGVSGTIGALVPFVSKEDVDTLSTLESHL 1151

Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
                  + G +  ++R  ++  K       S++D +L   + +LP  +Q  IA +     
Sbjct: 1152 RQENSSLVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLPPAKQNAIATELDRKP 1204

Query: 1415 SQILSNLNDLALGTS 1429
            S+I   L  L  G++
Sbjct: 1205 SEINKKLAQLREGST 1219


>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1212

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 105/506 (20%), Positives = 199/506 (39%), Gaps = 93/506 (18%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q +        G T       + IPL  TP ++    E   YP+  ++ 
Sbjct: 761  QCEEGVVGIQGQSLRIFAIENLGDTITQ----KSIPLSYTPRRLLKHPE---YPMFYTI- 812

Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE----EYEVRILEPDRAGGP--- 1076
                               + DN+ L   DL      E      + RIL PD  G P   
Sbjct: 813  -------------------EADNNTLPP-DLRAKLIAEPGVVNGDARILPPDEFGYPKGK 852

Query: 1077 --W--------------QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1120
              W              +   TI + ++E A++  +V+     ++++E+ L +GT    G
Sbjct: 853  GRWASCISVIDPLAEDQRVLQTIDLDNNEAAVSAAIVSF---ASQDSESFLIVGT----G 905

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   FS G     R  ++   L   ++  +++   SAL   QG LL   G  +
Sbjct: 906  KDMVVNPR--QFSEGYIHVYRFGEDGHEL-EFIHKTKVEEPPSALLGFQGRLLAGIGKTL 962

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     ++   A  D  P  +VSL+   + I++GD+   I ++ +K    +L     D
Sbjct: 963  RIYDLGLRQMLRKAQADVTPQLIVSLSTQGSRIIVGDVQHGITYVVYKPTTNKLIPFVDD 1022

Query: 1236 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHV 1290
              S     T  ++D    S+   D+  N+ +   + K ++    +      + +R   H 
Sbjct: 1023 TVSRWVTCTT-MVDYE--SVAGGDKFGNMFLVRCSEKATQEADDESGGLHLINTRDYLHG 1079

Query: 1291 GAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
              H    L     Q + TS  +T    G        LL+  ++G+IG   P    +++ F
Sbjct: 1080 TPHRLSLLAHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIGVFIPFVTREDVDF 1135

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
               Q+L++ +      +AG +   +R ++   K        ++D +L   Y +LP E++ 
Sbjct: 1136 --FQNLEQHMRTEDAPLAGRDHLMYRGYYVPVKG-------VIDGDLCERYTLLPSEKKQ 1186

Query: 1405 EIAHQTGTTRSQILSNLNDLALGTSF 1430
             IA +   +  +I   ++D+   ++F
Sbjct: 1187 MIAGELDRSVREIERKISDIRTRSAF 1212



 Score = 44.3 bits (103), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 137/606 (22%), Positives = 225/606 (37%), Gaps = 135/606 (22%)

Query: 74  QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      +  R+ +   D      + L+ H  + G + S+A     G++    +
Sbjct: 27  QFSGTKEQNIITASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
           D +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  D
Sbjct: 82  DYLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130

Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           P+GR      V    L  ++ + SQ        E T  S     A      V+++  LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
                     GY  PV   L E E T A     G  + +  T ++    +   L      
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229

Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALAL--NNYAVSLD 350
           WS    P D    L    P G     GVLV G   I Y HS   +  + +     A    
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDL 403
           S +    S    +L  +   +      L+ T+ GDL   T+      DG     V+RL +
Sbjct: 287 SRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKI 342

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
              +   ++S +  + +   +  S+ G+    QF              E+ GD       
Sbjct: 343 KYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGD------D 382

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
            + L  SS D   D   G +   +          +   + A+ +S+ ++ PL D      
Sbjct: 383 DEELEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANL 434

Query: 524 INADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSS 566
              DA    +A G   +S + ++            ELPG    +WT+  K +RG      
Sbjct: 435 TGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG------ 486

Query: 567 RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
                 D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQV
Sbjct: 487 ------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQV 538

Query: 627 FERGAR 632
             +G R
Sbjct: 539 HPKGIR 544


>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
 gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
          Length = 1217

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/389 (20%), Positives = 152/389 (39%), Gaps = 60/389 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
            +RI+ P        T  T  +   E  L    ++L     + ++  L +G A  +     
Sbjct: 866  IRIIAP-------STGQTFEVHRLEQNLAALCLSLVKFANQGDQLFLIVGIAKEFQLNPR 918

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
            V++ G +  +       N    +  V+   L     A+   QG +L+  G  + L+    
Sbjct: 919  VSSGGFLYTYKVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             +L         P  VVS+N +   I + D+ +S+Y + +K Q  QL + A D       
Sbjct: 975  KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
             T  ++D  T++   +D+  NI +   A  +++            W         QK  +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
             A FHVG  V    +  ++          PG  ++    L++ TL G++G + P    ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
              F   Q L+  +    P + G +  SFR ++   K       +++D +L   +  +   
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPIK-------NVIDGDLCEQFNSIEPA 1188

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +Q  I+     T S++   L D+    +F
Sbjct: 1189 KQKSISSDLERTASEVSKKLEDIRTRYAF 1217



 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD         
Sbjct: 55  SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L    
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159

Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                    S+ +ES    +  I +  LD           GY  P++  +   E  +A  
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
            +   +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ 
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I YH+ +    L L     S   S  +P   ++V     H     N   LL  + GD   
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312

Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
           + + Y  D   V  L +   +    ++ I        F      D +L Q    G   S 
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372

Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
           +SS       D+ + +     + R   + AL  +++       G+    +ES +   +  
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
              S      LK  ++G+ I+               LV  P      +V+          
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463

Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           ++R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + 
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520

Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           +IQ++  G R +     +G    +   + P            T++S S     VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572

Query: 678 GSIRLLVGDPS 688
             +     DP+
Sbjct: 573 RELCYFEIDPT 583


>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
 gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
          Length = 1205

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 89/389 (22%), Positives = 167/389 (42%), Gaps = 61/389 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
            + I +P  A    +T AT+P++++E A ++ VV   +T     E  L +GTA  ++    
Sbjct: 852  IHIFDPMEA----KTVATLPLKANEAAFSIAVVPFASTG---GEYHLVVGTAMHHLVTPP 904

Query: 1123 VAARGRVLLFSTGRNADNPQNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
             A+   + ++         + L  T +   EL     AL + QG LL   G  + ++   
Sbjct: 905  QASASYLKVYKIVNEGTGLELLHETPIQDSELP---RALLAFQGRLLAGVGKALRIYDLG 961

Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK------- 1234
              +L   A   +P   +VSL    + I++GD+ +S  F  +KE   +L +          
Sbjct: 962  KKKLLRKAETKSP-TAIVSLATQGSRIVIGDMQESTLFAVYKEAENRLLIFGDDTQPRWV 1020

Query: 1235 ------DFGSL---DCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
                  D+ ++   D F   F+  +D ST+S  V ++     I +    ++ +    K+L
Sbjct: 1021 SAMTMVDYNTVAVGDKFGNIFVNRLD-STISDQVDEDPTGAGILHEKATLNGAPHKTKML 1079

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
              A FHVG  +T   ++ ++                 R  LL+  L G+IG + PL   +
Sbjct: 1080 --AHFHVGDIITSIHKVSLVV--------------GGREVLLYTGLQGTIGILVPLTSKE 1123

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
            ++ F  L  L++ + +    + G +  S+R ++   KA       ++D +L   Y  L  
Sbjct: 1124 DIEF--LTMLEQHIRNEQGSLVGRDHLSWRGYYVPVKA-------VIDGDLCETYGGLSS 1174

Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
             +Q  IA +   T   +L  L+ + + +S
Sbjct: 1175 SKQSAIASELDRTVGDVLKKLDQMRVASS 1203


>gi|302916981|ref|XP_003052301.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256733240|gb|EEU46588.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1212

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 105/488 (21%), Positives = 206/488 (42%), Gaps = 57/488 (11%)

Query: 964  NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
             C  G + +  Q   I  I +L  G T      +QK IPL  TP ++    E+ L+  I 
Sbjct: 761  QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPEQPLFYTIE 813

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
            +     P      LL D  V +  D+  L   D         +   + +++P    G  Q
Sbjct: 814  ADNNTLPPELRAQLLADPNVVNG-DSQVLPPEDFGYPRANRRWASCINVVDPLSEEG--Q 870

Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
               T+  +++E A++  VV+     +++NE  L +GT    G+D+    +   +S G   
Sbjct: 871  VLQTVHFENNEAAVSATVVSF---ASQDNENFLVVGT----GKDMIVNPQS--YSEGYLY 921

Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
              R  ++ + L   ++  +++    AL   QG + +A G ++ ++     ++   A  + 
Sbjct: 922  IYRFVEDGREL-EFIHKTKIEEPPLALLPFQGKVAVAVGTQLRIYDLGMRQMLRKAQAEV 980

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                +VSLN   + I++GD+ + +  + +K    +L   A D  +     T  ++D    
Sbjct: 981  SAQRIVSLNTQGSRIVVGDVQQGVTLVVYKSATNKLIPFADDTVARWTTCTT-MVDYE-- 1037

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
            S+   D+  N+ I     K SE    ++     + +R   H   H    +     Q + T
Sbjct: 1038 SIAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRLGLMCHFYTQDVPT 1097

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +T    G  +     LL+  + G+IG   P    ++  F   Q+L++ L    P +A
Sbjct: 1098 SITKTSLVVGGQEI----LLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +   +R +++  K        ++D +L   Y +LP +++  IA +   +  +I   ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204

Query: 1423 DLALGTSF 1430
            D+   ++F
Sbjct: 1205 DIRTRSAF 1212


>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
 gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
           Full=Damage-specific DNA-binding protein 1
 gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
          Length = 1072

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
           RES   GPL+ VDP  R   + VY   + I+   +     +   +       FS RI+  
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V+           D   ++    P + +L++   +     ++K              ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
           +  +    + + HD  +   +PS  GGV V G   ++Y S+    +  L  Y        
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
             P ++FS  +     T L + + +++ ++G L     ++    V  ++L K   S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +  + ++  F+GS   +S+L+Q                         PS  +      +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            LQ+ VN   +S +    + T S+  T S A +D     G L+     + I         
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
               N  L+E+ G K  ++V            S  A YD+  + +L +  E R +++   
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
            +    + + D   +  TI    ++G  +++Q+  +  R+ DG  +   +S  P +   G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493

Query: 654 SGSENSTVLSVS 665
           S   ++  ++V+
Sbjct: 494 SSFADNVCVAVA 505



 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 64/290 (22%), Positives = 124/290 (42%), Gaps = 38/290 (13%)

Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
            + +GT +    +D    GR+++F     +DN   +  E    +++G+++ L  L  HL++
Sbjct: 770  VVVGTGFNFPDQDAPDSGRLMVFEM--TSDNNIEMQAE---HKVQGSVNTLV-LYKHLIV 823

Query: 1170 A--SGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            A  +    I     GT    N I      P Y + +++ ++ I+  D+ KSI  L + + 
Sbjct: 824  AGINASVCIFEYEHGTMHVRNSIR----TPTYTIDISVNQDEIIAADLMKSITVLQFIDD 879

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY---YAPKMSESWKGQKL 1282
              QL  +A+D+  L   + E L   S     V++   N  I      +P++S+    +KL
Sbjct: 880  --QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRDNVSPQLSDR---KKL 931

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
                +F++G  + K  R        D++   P         LL  T+DGS+  +      
Sbjct: 932  RWYKKFYLGELINK-TRHCTFIEPQDKSLVTP--------QLLCATVDGSLMIVGDAGMS 982

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
                L  LQ  +   +P   GL+ + ++++    +     P  ++D  L+
Sbjct: 983  NTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLIDGSLI 1029


>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
 gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1256

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/289 (18%), Positives = 127/289 (43%), Gaps = 29/289 (10%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            +Y  E++  + A+A  QG L+   G  I ++     +L         P  +V+++ + + 
Sbjct: 979  LYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1038

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            +++GDI +SI+F+ +K     L + A D        +  ++D  T++   +D+  NI + 
Sbjct: 1039 LVVGDIQESIHFIKYKRSENMLYVFADDLAP-RWMTSSVMLDYDTVA--GADKFGNIFVL 1095

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGA---------HVTKFLRLQMLATSSDRTGAAPGSDK 1318
                 +S+  +     ++ +F  G          H+  F     + T +  +    G + 
Sbjct: 1096 RLPLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPE- 1154

Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                 +L+ T+ G+IG + P    +++ F    +L+  +      + G +  ++R ++  
Sbjct: 1155 ----VILYTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLPLCGRDHLAYRSYYFP 1208

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             K       +I+D +L   +  L  ++QL I+ +   + S+++  L ++
Sbjct: 1209 VK-------NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250



 Score = 43.9 bits (102), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
           GGVLV   + I Y +Q  +            +    +PR   S      L  +H++  Q 
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            +   L+ ++ GDL  +T+ Y G  V  ++++  +  VL + +T + N   F  S  GD 
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
            L  F         S G +EE G  +        L    R S    ++++ N E  S   
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
           S S           F V D +    P             LK   +GL +    +A     
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464

Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                   LPG   GIWTV   +S   NA         D+   Y+++S    T VL   D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + E  ES    ++  T       G   +IQVF  G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542


>gi|331238007|ref|XP_003331659.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|309310649|gb|EFP87240.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1213

 Score = 54.3 bits (129), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 91/398 (22%), Positives = 169/398 (42%), Gaps = 68/398 (17%)

Query: 1070 PDRAGGPWQT--RATIPMQSS----------ENALTVRVVTL-FNTTTKENETLLAIGTA 1116
            P    G W +  R + P++ +          E A ++ +V+  F       E LL +G+A
Sbjct: 847  PRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAIVSFAFAAHHPATEPLLVVGSA 906

Query: 1117 ---YVQGEDVAARG--RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
               +VQ       G  RV  F       N   ++  V+  E+    SAL   QG L    
Sbjct: 907  KDAFVQPR-TCKNGFLRVYRFV------NDGKVIELVHKTEVDEMPSALVGFQGRLAAGV 959

Query: 1172 GPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            G  + ++     +L       +F  A    ++SL++  + IL+GD   S+ +  +K    
Sbjct: 960  GKALRIYDLGKKKLLRKVENKSFGSA----IISLSVQGSRILVGDSQDSVSYAVYKPAEN 1015

Query: 1228 QLNLLAKDF-GSLDCFATEFLIDGSTLS-------LVVSDEQKNIQIFYYAPKMSESWKG 1279
            +L + A D        AT  ++D  T++       L VS   KN+     + ++ E   G
Sbjct: 1016 RLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNLWVSRLPKNV-----SDEVDEDPTG 1068

Query: 1280 QKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
              ++    + +GA H  K L    L  + TS  +T   PG     R  LL+  + GSIG 
Sbjct: 1069 AGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVPG----GREVLLYTGVQGSIGI 1124

Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            + P    +++ F   Q+L+  + + +P + G +  ++R ++   K       + VD +L 
Sbjct: 1125 LVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLAYRGYYFPVK-------NCVDGDLC 1175

Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
              + +LP  +QL++A +   + S +L  +  + + + +
Sbjct: 1176 ESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSSGY 1213


>gi|440473070|gb|ELQ41892.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae Y34]
          Length = 1229

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 82/356 (23%), Positives = 158/356 (44%), Gaps = 52/356 (14%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I + ++E AL++ VV+     +++ E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 879  IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
            +++ + L   ++  +++   +AL   QG L+   G  + ++     +L   A  +  P  
Sbjct: 930  SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            +VSLN   + I++GD+   + ++++K +  +L   A D  +     T  +   ST     
Sbjct: 989  IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
            +D+  N+ I     K S+    +     +E H+  H   +L     RL ++A        
Sbjct: 1046 ADKFGNLWILRCPEKASQ----ESDEPGSEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G  +     LL+G   G+IG + P    ++  F   QSL++ L    P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            AG +   +R  +   K        ++D +L   Y MLP +++  IA +   +  +I
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREI 1203


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
          Length = 1329

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/286 (23%), Positives = 113/286 (39%), Gaps = 45/286 (15%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
               S  G L+ + G K+ ++     +L     Y   P  +VS+ I  N I   DI +S+ 
Sbjct: 1064 CFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVL 1123

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      L L++ D        +E L   + ++         L V +E K  +    
Sbjct: 1124 IFFYDPNQNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQDEYGIT 1183

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    M+ S K +KL     FH+G  VT   ++++  TSS+               
Sbjct: 1184 NKCWYGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTSSE--------------C 1229

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L    P + G     FR ++   + 
Sbjct: 1230 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKPPLCGREHIFFRSYYHPVQ- 1284

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                  ++VD +L   +  L  + Q +IA+    T   IL  L D+
Sbjct: 1285 ------NVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324



 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)

Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
           LP D    L +P P G     GVL+   N + Y     +   CA     Y   L+  Q+ 
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
             S     +      +      L+ ++ GDL  + V ++  +V+ +     +   + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           + + +   F+ +  G+    QF+  G              G     A  T +L+      
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428

Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
           L D +      L +    + NT + Q  ++   R      GP   L+   +GL I   A 
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481

Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
                        ELPG  K IWT+   +                EY  Y+++S E  T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
           +LE  + + EV++++   +   T    N+      IQV++ G R ++G  + + ++  P 
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571

Query: 649 NSESGSGSENSTVLSVSIA 667
           N +  + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590


>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1216

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 111/505 (21%), Positives = 214/505 (42%), Gaps = 87/505 (17%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK-IPLKATPHQI----------TYFAE 1012
             C  G + V  Q  L+IC +  G   D+   +QK I L  TP ++          T  AE
Sbjct: 761  QCEEGMVGVQGQ-FLRICAI--GKLGDSM--IQKSISLAYTPKKLIKNPTHPIFYTIEAE 815

Query: 1013 KNLYPLIVSVPVLKP---LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
             N  P  +   +L     +N    +L  +E G+   N   +S             + +++
Sbjct: 816  NNTLPPELREQLLAAPTAVNGDTKVLPPEEFGYPRGNGRWASC------------ISVVD 863

Query: 1070 PDRAGGPWQTRA--TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG 1127
            P   G   +      I ++++E  ++V VV      ++++E+ L +GT    G+D+    
Sbjct: 864  PLGDGEEREPSVLQQIHLENNEATVSVAVVPF---ASQDSESFLVVGT----GKDMVLNP 916

Query: 1128 RVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
            R   F+ G     R  ++ + L   ++  +++    AL + QG L+   G  + ++    
Sbjct: 917  RC--FTEGYIHVYRFLEDGREL-EFIHKTKVEEPPMALLAFQGKLVAGVGRSLRIYDLGL 973

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             +L   A  +  P  +VSL    + I++GD    + ++++K++  +L   A D  S+  +
Sbjct: 974  RQLLRKAQSEVAPRVIVSLQTQGSRIVVGDSQHGLIYVAYKQEANKLIAFADD--SIQRW 1031

Query: 1243 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--- 1298
             T   ++D    S    D+  NI I     K S+          +E H+  H   +L   
Sbjct: 1032 TTCSTMVDYE--STAGGDKFGNIWILRCPEKASQEADQPG----SEVHL-MHARDYLHGT 1084

Query: 1299 --RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1345
              RL        Q +ATS  +T    G  +     LL+G + G+IG + P    ++  F 
Sbjct: 1085 SNRLALMAHVYTQDIATSICKTNLVVGGQE----VLLWGGIQGTIGVLIPFVSREDADF- 1139

Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
              Q+L++ +    P +AG +   +R ++   K        ++D +L   + MLP +++  
Sbjct: 1140 -FQTLEQHMRSEDPPLAGRDHLMYRSYYVPVKG-------VIDGDLCERFTMLPNDKKQM 1191

Query: 1406 IAHQTGTTRSQILSNLNDLALGTSF 1430
            IA +   +  +I   ++D+   ++F
Sbjct: 1192 IAGELDRSVREIERKISDIRTRSAF 1216



 Score = 43.5 bits (101), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)

Query: 378 LLSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+      +G     VQRL +   +   ++S++  + +   F+ S  G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +    QF              E+ GD        + L  SS +   D     E + +   
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
                  + T + A+ +S+ ++ PL D       + DA    + +G   +S + ++    
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461

Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   +LPG    +WT                 A DD+Y +Y+++S    T+VL   
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           + + EV+++   F+   +  A    G   ++QV  RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544


>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1257

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 121/269 (44%), Gaps = 26/269 (9%)

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            T +  GT     E   ++GR+L+F    +    +  +T   S E++G + A+ S+ G ++
Sbjct: 940  TYICAGTYKYVDEVEPSQGRLLVFD-AEDGSLLREKITMAVSLEVRGCVYAVGSVNGMII 998

Query: 1169 IASGPKIILHKW---TGTEL---NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
             A    +++++      T+L   + I  ++   L V +L    + IL+GD   SI FL  
Sbjct: 999  AAINSSVVVYRPEIDASTQLLALHKITEWNHNYL-VTNLVCRGDKILVGDAINSISFLRM 1057

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
             E  +Q+  LA+D+GSL     E L   S +         +  +F +A  + E+   + L
Sbjct: 1058 VE--SQIQCLARDYGSLWPVCVEMLDQSSIIG-----ANSDYNLFTFA--LQETELRKSL 1108

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DE 1341
                 +++G  V KF+   +  T+ D +   P   K      LF T  G IG I  + DE
Sbjct: 1109 ERDGSYYIGDMVNKFIPGAL--TAHDVSVDMPLEPKQ-----LFFTSTGCIGVIVDMGDE 1161

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
            L+   + +LQ+ +   +    G+    FR
Sbjct: 1162 LSL-HMTALQRNMSTYLSQTKGVTHTKFR 1189


>gi|310796681|gb|EFQ32142.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1163

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 73/322 (22%), Positives = 136/322 (42%), Gaps = 41/322 (12%)

Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1167
            +GT+Y+   D+      +GR+L+   G ++D NP  +V    S ELKGA  +L  +   L
Sbjct: 847  VGTSYLADPDIDESGDTKGRILVL--GVDSDKNPYLIV----SHELKGACRSLGVMGEKL 900

Query: 1168 LIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            +      ++++ +     T   L  +A +  P  + V +++  N I + D+ +S+  + +
Sbjct: 901  VAGLSKTVVVYDYVEESTTSGALRKLATF-RPSTFPVDIDVHGNMIGIADLMQSLTLVEF 959

Query: 1223 --KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
               + G +  L+ +       +AT    ++G   S + +D Q N+ +    P        
Sbjct: 960  VPAQDGNKAKLVERARHFQYIWATSVCHLEGH--SWLEADAQGNLMVLRRNPNAPTEHDR 1017

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            +++    EFH+G  V K   L +    +D     P   K         T++GS+   A +
Sbjct: 1018 KQMEVTGEFHLGEQVNKIRSLDITPNEND-----PIIPKA-----FLATVEGSLYVFADI 1067

Query: 1340 DELTFRRLQSLQKKLVDSVPHV--------AGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
                   L   Q++L + V  +        +GL+  ++R F  N K    GP   VD EL
Sbjct: 1068 KSEYQSLLIQFQERLAEVVRALGQADGEPGSGLSFTTWRGFR-NAKRAAEGPFRFVDGEL 1126

Query: 1392 LSHYEMLPLEEQLEIAHQTGTT 1413
            +  +  L   +Q  +    G T
Sbjct: 1127 IERFLDLDEAKQEAVVQGLGPT 1148


>gi|159489018|ref|XP_001702494.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
            reinhardtii]
 gi|158280516|gb|EDP06273.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
            reinhardtii]
          Length = 1147

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 105/248 (42%), Gaps = 28/248 (11%)

Query: 1144 LVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKWTGTELNGI-----AFYDAPPLY 1197
            LVTE   KE+KGA   +    G  ++AS   K+ +++W   E +G      A+  A   +
Sbjct: 895  LVTE---KEVKGAAYNVRPFAGDKILASVNNKVTVYRWVVREGSGGPGGCGAYELASECH 951

Query: 1198 ----VVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
                V++L +      +++GD+ +S+  LS+  +   L   A D+ S    A E L D +
Sbjct: 952  HLGNVLALYLAARGGLVVVGDLMRSVSLLSYNAEQGVLEHRAADYNSGWTTAVEILDDDN 1011

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR--LQMLATSSDR 1309
             ++   +D   N+ +       +   +  +L    EFH G  + +     L M    S+ 
Sbjct: 1012 YIA---ADNHCNLYVVRRNADSATDEERARLQVVGEFHTGTFINQMRNGSLVMRLPDSEH 1068

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
             G  P         LLF   DG +G +A L    +     LQ  +   V  V GL+   +
Sbjct: 1069 AGLPP--------PLLFAGTDGRLGVVARLPPALYEWATKLQTAMRSVVRGVGGLDHEQW 1120

Query: 1370 RQFHSNGK 1377
            R F ++ +
Sbjct: 1121 RAFANDRR 1128


>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 953

 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 56/216 (25%), Positives = 92/216 (42%), Gaps = 34/216 (15%)

Query: 1174 KIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
            ++ L++WT      TE N   + +   LY   L    +FIL+GD+ +S+  L++K     
Sbjct: 707  QVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGN 761

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
               +A+DF      A E L D + L    ++   N+ +       +   + Q L     F
Sbjct: 762  FEEIARDFNPNWMSAIEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGVF 818

Query: 1289 HVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            H+G  V  F    L LQ L  +S  T            ++LFGT++G IG +  L E  +
Sbjct: 819  HLGEFVNVFCHGSLVLQNLGETSTPTQG----------SVLFGTVNGMIGLVTSLSEGWY 868

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
              L  LQ +L   +         ++R FH+  K  +
Sbjct: 869  SLLLDLQNRLNKVI-------KTTWRSFHTERKTEQ 897



 Score = 48.1 bits (113), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 68  WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+       S L         +++++   +      +  
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L  +V+   + L         +    F           G L+    G+ I+  AS    
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  ++ R              E    L++S   +T VL  +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349


>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
          Length = 1217

 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+   QG +L+  G  + L+     +L         P  +VS+N +   I + D+ +S+Y
Sbjct: 951  AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAIVSINAIGQRIYVSDVQESVY 1010

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
             + +K Q  QL + A D        T  ++D  T++   +D+  NI +   A  +++   
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVD 1067

Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
                     W         QK  + A FHVG  V    +  ++          PG  ++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116

Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
               L++ TL G++G + P    ++  F   Q L+  +    P + G +  SFR ++   K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                   +++D +L   +  +   +Q  IA     T S++   L D+    +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSIAGDLERTPSEVSKKLEDIRTRYAF 1217



 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|307166104|gb|EFN60356.1| Splicing factor 3B subunit 3 [Camponotus floridanus]
          Length = 1201

 Score = 53.5 bits (127), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 65/297 (21%), Positives = 122/297 (41%), Gaps = 55/297 (18%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+   QG +L+  G  + L+     +L         P  VVS+N +   I + D+ +S+Y
Sbjct: 935  AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 994

Query: 1219 FLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
             + +K Q  QL + A D    F +  C     ++D  T++   +D+  NI +   A  ++
Sbjct: 995  AVRYKRQENQLIVFADDTHPRFITTTC-----VLDYDTVA--TADKYGNIAVIRLATGIN 1047

Query: 1275 ES-----------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
            +            W         QK  + A FHVG  V    +  ++          PG 
Sbjct: 1048 DDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGG 1097

Query: 1317 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
             ++    L++ TL G++G + P    ++  F   Q L+  +    P + G +  SFR ++
Sbjct: 1098 SES----LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYY 1151

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
               K       +++D +L   +  +   +Q  I+     T S++   L D+    +F
Sbjct: 1152 YPVK-------NVIDGDLCEQFNSIEPVKQKSISGDLERTPSEVSKKLEDIRTRYAF 1201


>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
          Length = 1134

 Score = 53.5 bits (127), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 53/271 (19%), Positives = 115/271 (42%), Gaps = 15/271 (5%)

Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GT  +  ++   + GR+++F      D  +  +  V+   ++G+  AL  L G L+ A 
Sbjct: 823  VGTGLIYPDESDTKLGRIIVFEVD---DVERTKLRRVHELVVRGSPLALRILNGKLVAAI 879

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
               + L +WT  ++  +   +   +  + L ++   + + D+ +S+  LS++        
Sbjct: 880  NSSVRLFEWTADKVLRLECSNFNHIVALDLKVMNEEVAVADLMRSVSLLSYRMMEGNFEE 939

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF-HV 1290
            +AKD+ S      EF+   S L          +++    P   +   G+ +L    + ++
Sbjct: 940  VAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPITDD---GRYVLEPTGYWYL 996

Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
            G      +R  ++    D T             ++FGT  G+IG +  +D+   + L S+
Sbjct: 997  GELPKVMVRASLVVQPEDST-------IEYSHPIMFGTNQGTIGMLVQIDDKWKKFLVSI 1049

Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
            +K + DSV +   +   ++R F    +   P
Sbjct: 1050 EKAISDSVKNCMQIEHSTYRSFIFQKRIEPP 1080



 Score = 48.9 bits (115), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 71/137 (51%), Gaps = 19/137 (13%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP+P+GGV+V+GAN+  Y +   +  +    Y+ SL     L  + F+    +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
           DA+       D  LL+   G L++L + + +GR    V+ + +     + +   +  + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316

Query: 421 SLFFLGSRLGDSLLVQF 437
            + F+GSRLGDS L++ 
Sbjct: 317 GVVFVGSRLGDSQLIRL 333


>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
          Length = 1644

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 72/315 (22%), Positives = 129/315 (40%), Gaps = 45/315 (14%)

Query: 299 WSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDSS 352
           WS + +P  A KL+AVP      GGVLV+    I Y +++    SC+  L +     +  
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGPGGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNPV 717

Query: 353 QELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTNP 408
           Q+  +  + + +  + AT  Q D+   L+ ++ GDL  +++ Y G  VQ+L +    T P
Sbjct: 718 QDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIP 776

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             L   IT  G  L F  S   +  L QF         +  +     D E + PS    +
Sbjct: 777 VALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRK 834

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
             +   + ++ +   ++         E   + ++   +    N   L+   +GL +   A
Sbjct: 835 LKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEMA 891

Query: 529 SATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ++             LPG  K +W +                ++ D    Y+++S E  T
Sbjct: 892 ASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDAT 925

Query: 588 MVLETADLLTEVTES 602
           +VLE  D + E+T+S
Sbjct: 926 LVLEIGDTVEEITDS 940



 Score = 47.0 bits (110), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 66/300 (22%), Positives = 123/300 (41%), Gaps = 54/300 (18%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA---FYDAPPLYVVSLNIV 1204
            V++  + G   A+   QG LL++ G  + ++     +L       ++ +P   ++ L   
Sbjct: 1369 VHTTPVDGIPYAMIEFQGRLLVSVGKVLRIYDLGKRKLLRKCENRYFTSP---MIDLKSA 1425

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKN 1263
             + I   D+H+SI+F+ +K +  QL   A D   +  F T   L+D  T++    D+  N
Sbjct: 1426 GDRIYASDVHESIHFVKYKAEDNQLITFADD--CVPHFMTSSTLLDYDTIA--GGDKFGN 1481

Query: 1264 IQIFYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATS 1306
            + +     ++S+           W          KL   A+FHVG  +T  L        
Sbjct: 1482 VFVTRLPAEVSDEIDNPTGNRMLWDTGLLNGAPHKLEQIAQFHVGEVITSVL-------- 1533

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              RT   PG  +     +L+ T+ G IG + P    D++ F     L+  +      + G
Sbjct: 1534 --RTSLVPGGMEV----ILYTTILGRIGALVPFTSRDDVDF--YTHLEMYMRQEKAPLCG 1585

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
             +  S+R +    K       ++ D +L   +  L  ++Q  IA     T ++++  L D
Sbjct: 1586 RDHLSYRSYFIPVK-------NVTDGDLCEQFSSLGPDKQKNIAEDLDRTPTEVVKKLQD 1638


>gi|405117821|gb|AFR92596.1| pre-mRNA-splicing factor RSE1 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1217

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/285 (22%), Positives = 120/285 (42%), Gaps = 32/285 (11%)

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LA  QG LL   G  + L++     L      +  P  VV++N+    I++GD+ +S ++
Sbjct: 951  LAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010

Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
              ++     QL + A D     + C  +   +D  T++    D+  NI I    P +SE 
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065

Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +GA H T+ +    +  + TS  +     G     R  L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F     +  +  D  P   G +  ++R ++   K        
Sbjct: 1122 ISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISP--VGRDHIAYRGYYVPIKG------- 1172

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + +LP  +Q  IA     +   +L  L  +   ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIATDLDRSVGDVLKKLEQMRTSSAF 1217


>gi|332026090|gb|EGI66238.1| Splicing factor 3B subunit 3 [Acromyrmex echinatior]
          Length = 1217

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+   QG +L+  G  + L+     +L         P  VVS+N +   I + D+ +S+Y
Sbjct: 951  AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
             + +K Q  QL + A D        T  ++D  T++   +D+  NI +   A  +++   
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067

Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
                     W         QK  + A FHVG  V    +  ++          PG  ++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116

Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
               L++ TL G++G + P    ++  F   Q L+  +    P + G +  SFR ++   K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                   +++D +L   +  +   +Q  I+     T S++   L D+    +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217


>gi|307205956|gb|EFN84082.1| Splicing factor 3B subunit 3 [Harpegnathos saltator]
          Length = 1217

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+   QG +L+  G  + L+     +L         P  VVS+N +   I + D+ +S+Y
Sbjct: 951  AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
             + +K Q  QL + A D        T  ++D  T++   +D+  NI +   A  +++   
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067

Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
                     W         QK  + A FHVG  V    +  ++          PG  ++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116

Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
               L++ TL G++G + P    ++  F   Q L+  +    P + G +  SFR ++   K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                   +++D +L   +  +   +Q  I+     T S++   L D+    +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217


>gi|340960602|gb|EGS21783.1| hypothetical protein CTHT_0036510 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1100

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/308 (24%), Positives = 133/308 (43%), Gaps = 48/308 (15%)

Query: 1113 IGTAYVQGEDVA---------ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALAS 1162
            +GT+Y+   D +         ARGR+L+   G ++D NP     ++ S +LKGA   LA 
Sbjct: 783  VGTSYLPDPDYSPAPSHGNPEARGRILVL--GIDSDRNPY----QILSYQLKGACRCLAV 836

Query: 1163 LQ-GHLLIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
            +  G +++     + + ++     T  +L  +A Y  P  Y V + I    I + D+ KS
Sbjct: 837  MDDGKVVVGLTKAVTVCEYKETSSTTAQLTKLASY-RPSTYPVEIAIHGRTIAVADLMKS 895

Query: 1217 IYFLSW--KEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            I  + +   E+G Q  L+  A+ + S    A  ++ +G  L    +D Q N+Q+      
Sbjct: 896  ISLVDYIPAEEGGQAKLVERARHYQSAWSTAVGYVQNGLWLE---ADAQGNLQVLRQNVD 952

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
                   +++   AE ++G  V +   + +   +S      P +          GT++G 
Sbjct: 953  GITEDDRKRMELTAEINLGEMVNRIRSITV--ETSPEALIIPRA--------FLGTVEGG 1002

Query: 1333 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIV 1387
            I   G IAP        L   Q+K+ D +  V   +  +FR + +   A R   GP   +
Sbjct: 1003 IYMFGTIAP---HALDLLLRFQEKVADVIKAVGDSDNANFRSYRAFKNAERVGHGPFRFL 1059

Query: 1388 DCELLSHY 1395
            D ELL  +
Sbjct: 1060 DGELLERF 1067


>gi|331221690|ref|XP_003323519.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|309302509|gb|EFP79100.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1213

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 134/302 (44%), Gaps = 43/302 (14%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNI 1203
            V+  E+    SAL   QG L    G  + ++     +L       +F  A    ++SL++
Sbjct: 936  VHKTEVDEMPSALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA----IISLSV 991

Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF-GSLDCFATEFLIDGSTLS-------L 1255
              + IL+GD   S+ +  +K    +L + A D        AT  ++D  T++       L
Sbjct: 992  QGSRILVGDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNL 1049

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTG 1311
             VS   KN+     + ++ E   G  ++    + +GA H  K L    L  + TS  +T 
Sbjct: 1050 WVSRLPKNV-----SDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTS 1104

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
              PG     R  LL+  + GSIG + P    +++ F   Q+L+  + + +P + G +  +
Sbjct: 1105 LVPG----GREVLLYTGVQGSIGILVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLA 1158

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R ++   K       + VD +L   + +LP  +QL++A +   + S +L  +  + + +
Sbjct: 1159 YRGYYFPVK-------NCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSS 1211

Query: 1429 SF 1430
             +
Sbjct: 1212 GY 1213


>gi|221508103|gb|EEE33690.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1878

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 67/292 (22%), Positives = 119/292 (40%), Gaps = 59/292 (20%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
            LA G      E+V   GRV LF            G   D P            E+++   
Sbjct: 1472 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 1531

Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            L G ++ + S       + +++ + GP++ +H+  G++    AF DA  + V S+  ++N
Sbjct: 1532 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 1590

Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
            F LLGD  K +  +SW+        ++  +++ F   +L   A  FL   + L ++ +D 
Sbjct: 1591 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 1650

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              N+++F Y                   +   +   F +L +L   ++   AA    K  
Sbjct: 1651 DGNVRLFCY-------------------NADKNSPGFEKLDILQCDAEDRCAAGCVVKLQ 1691

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
            +F +   T+  S+G  A    L FR         LQ+LQ ++   +P   GL
Sbjct: 1692 QFVVDSETV-ASLGEAADSSTLVFRLLASDSYSFLQTLQDRMAQYLPEPLGL 1742


>gi|401426063|ref|XP_003877516.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322493761|emb|CBZ29051.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1347

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/308 (22%), Positives = 120/308 (38%), Gaps = 37/308 (12%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
              + +E + LL IG+++   ++  AR   + +   R     Q L   + SK++ GA+   
Sbjct: 913  GVSKEEWQQLLLIGSSFTFPDEQRARSGRITWCALREEHQRQRL-HLIASKDIGGALQCC 971

Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
            A++   +G + +     + L+ W   +           G+      PLY  SL    + +
Sbjct: 972  AAVPHYKGRIALGVNGCVCLYNWNTEDQTFVAEERCRIGLTVTKLIPLYDTSL--AASVL 1029

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            +  D+  S +F+        L +L +D             D   L L   D+  N     
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDADLRGVMDGHIGSDAENLCLF--DDSLNFTAMK 1087

Query: 1269 YAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
              P   E          S   Q +   RA+ H+G  VT  +R    A +S     A  S 
Sbjct: 1088 VVPLPVEAGDGDAAAAGSVTAQYRFEVRAQCHLGDLVT-CVRPGCFAATSLMEAPAACSL 1146

Query: 1318 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
              NR          L+F T  G  G + P+   T+  L++L+  LV ++  V GL+ ++F
Sbjct: 1147 SRNRLLLPGIAGPQLVFATAHGGFGVVTPVQAATYLVLRALEASLVRTLQPVGGLSHQAF 1206

Query: 1370 RQFHSNGK 1377
            R+    G+
Sbjct: 1207 REVLRAGQ 1214


>gi|221486318|gb|EEE24579.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 2804

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 121/286 (42%), Gaps = 47/286 (16%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
            LA G      E+V   GRV LF            G   D P            E+++   
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457

Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            L G ++ + S       + +++ + GP++ +H+  G++    AF DA  + V S+  ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516

Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
            F LLGD  K +  +SW+        ++  +++ F   +L   A  FL   + L ++ +D 
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576

Query: 1261 QKNIQIFYY-APKMSESWKGQKLLS-RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
              N+++F Y A K S  ++   +L   AE    A     ++LQ     S+   +    + 
Sbjct: 2577 DGNVRLFCYNADKNSPGFEKLDILQCDAEDRCAAGCV--VKLQQFVVDSETVASL--GEA 2632

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
             +   L+F  L          D  +F  LQ+LQ ++   +P   GL
Sbjct: 2633 ADSSTLVFRLLAS--------DSYSF--LQTLQDRMAQYLPEPLGL 2668


>gi|322797581|gb|EFZ19622.1| hypothetical protein SINV_00421 [Solenopsis invicta]
          Length = 1217

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            A+   QG +L+  G  + L+     +L         P  VVS+N +   I + D+ +S+Y
Sbjct: 951  AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
             + +K Q  QL + A D        T  ++D  T++   +D+  NI +   A  +++   
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067

Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
                     W         QK  + A FHVG  V    +  ++          PG  ++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116

Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
               L++ TL G++G + P    ++  F   Q L+  +    P + G +  SFR ++   K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                   +++D +L   +  +   +Q  I+     T S++   L D+    +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217


>gi|157872916|ref|XP_001684981.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
 gi|68128051|emb|CAJ06910.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
          Length = 1347

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/311 (22%), Positives = 126/311 (40%), Gaps = 43/311 (13%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
              + +E + LL IG+++   ++  AR   + +   R     Q L   + SK++ GA+   
Sbjct: 913  GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCALREEHQQQRL-HLIASKDIGGALQCC 971

Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
            A++   +G + +     + L+KW   +           G+      PLY  SL    + +
Sbjct: 972  AAVPHYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDF---GSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            +  D+  S +F+        L +L +D    G +D       I     +L + D+  N  
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDAELRGVMDGH-----IGSDAENLCLFDDSLNFT 1084

Query: 1266 IFYYAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
                 P   E          S   Q +   RA+ H+G  VT  +R    A +S     A 
Sbjct: 1085 ALRVVPLPVEAGDGDAAAAASVTAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAS 1143

Query: 1315 GSDKTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
             +   NR          L+F T  G  G + P+   T+  L++L+  LV ++  + GL+ 
Sbjct: 1144 CASAQNRLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRTLEASLVRTLQPLGGLSH 1203

Query: 1367 RSFRQFHSNGK 1377
            ++FR+   +G+
Sbjct: 1204 QAFREVLRSGQ 1214


>gi|237833631|ref|XP_002366113.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
 gi|211963777|gb|EEA98972.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
          Length = 2804

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 67/292 (22%), Positives = 119/292 (40%), Gaps = 59/292 (20%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
            LA G      E+V   GRV LF            G   D P            E+++   
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457

Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            L G ++ + S       + +++ + GP++ +H+  G++    AF DA  + V S+  ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516

Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
            F LLGD  K +  +SW+        ++  +++ F   +L   A  FL   + L ++ +D 
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              N+++F Y                   +   +   F +L +L   ++   AA    K  
Sbjct: 2577 DGNVRLFCY-------------------NADKNSPGFEKLDILQCDAEDRCAAGCVVKLQ 2617

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
            +F +   T+  S+G  A    L FR         LQ+LQ ++   +P   GL
Sbjct: 2618 QFVVDSETV-ASLGEAADSSTLVFRLLASDSYSFLQTLQDRMAQYLPEPLGL 2668


>gi|46125735|ref|XP_387421.1| hypothetical protein FG07245.1 [Gibberella zeae PH-1]
          Length = 1208

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 103/475 (21%), Positives = 200/475 (42%), Gaps = 57/475 (12%)

Query: 964  NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
             C  G + +  Q   I  I +L  G T      +QK IPL  TP ++    ++ L+  I 
Sbjct: 761  QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813

Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
            +     P      LL D  V +  D+  L   D         +   + +++P    G  Q
Sbjct: 814  ADNNTLPPELRAQLLADPGVVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870

Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
               TI ++++E A++  +V     ++++NE+ L IGT    G+D+    R   FS G   
Sbjct: 871  VLQTIDLENNEAAVSAAIVPF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921

Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
              R  +  + L   ++  +++    AL + QG +L+A G  + ++     ++   +  + 
Sbjct: 922  IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                +VSLN   + I++GD+ + + ++ +K    +L     D  +     T  ++D    
Sbjct: 981  ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
            S+   D+  N+ I     K SE    ++     + +R   H   H    +     Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S  +     G  +     LL+  + G+IG   P    ++  F   Q+L++ L    P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
            G +   +R +++  K        ++D +L   Y +LP +++  IA +   +  +I
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREI 1199


>gi|321249291|ref|XP_003191408.1| U2 snRNA binding protein [Cryptococcus gattii WM276]
 gi|317457875|gb|ADV19621.1| U2 snRNA binding protein, putative [Cryptococcus gattii WM276]
          Length = 1217

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 117/546 (21%), Positives = 206/546 (37%), Gaps = 80/546 (14%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            N+ G        SR      +++ L+  P + D    A++ L    C  G I + S   L
Sbjct: 718  NVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWS-LSAAMCPDGLIGI-SGNTL 775

Query: 979  KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
            +I  +P           +K+   +TP  +TY   K          +  P N V  ++   
Sbjct: 776  RIFSIPKLG--------EKLKQDSTP--LTYTPRKF---------ISHPFNPVFYMI--- 813

Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV- 1097
                + D+   S   + R    +E E R ++      P          +   A  VRV+ 
Sbjct: 814  ----EADHRTYSKGAIERIVKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLD 869

Query: 1098 -----TLFNTTTKENETLLAIGTAYVQ---GEDVAARG---RVLLFSTGRN-------AD 1139
                 T+      E+E   +I  AY +   GE     G   +  L   G         A 
Sbjct: 870  PLANETIMTFDLDEDEAAFSIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAI 929

Query: 1140 NPQNLVTEVYSKELKGAIS-ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
              Q  V E   K     I   LA  QG LL   G  + L++     L      +  P  V
Sbjct: 930  KEQGRVLEFLHKTKTDDIPLCLAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPTAV 989

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSL 1255
            V++N+    I++GD+ +S ++  ++     QL + A D     + C  +   +D  T++ 
Sbjct: 990  VTINVQGARIIVGDMQESTFYCVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA- 1045

Query: 1256 VVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSS 1307
               D+  NI I    P +SE       G  +L    F +GA H T+ +    +  + TS 
Sbjct: 1046 -CGDKFGNIFINRLDPSISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVTSI 1104

Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1364
             +     G     R  L++ T+ G++G + P    D++ F  + +L+  +      + G 
Sbjct: 1105 TKIPLVAG----GRDVLVYTTISGAVGALVPFVSPDDIEF--MSTLEMHMRTQDISLVGR 1158

Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +  ++R ++   K        +VD +L   + +LP  +Q  IA     +   +L  L  +
Sbjct: 1159 DHIAYRGYYVPIKG-------VVDGDLCESFSLLPYPKQQAIASDLDRSVGDVLKKLEQM 1211

Query: 1425 ALGTSF 1430
               ++F
Sbjct: 1212 RTSSAF 1217


>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
 gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
          Length = 1059

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 129/574 (22%), Positives = 214/574 (37%), Gaps = 133/574 (23%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  + + G + S+A     G++    +D +ILA +  +I+++E+  + +  +   +  F 
Sbjct: 59  VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEYLPAQNRFQRLHLETFG 114

Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLVGDED 218
                  K G      G  +  DP+GR    L+  L+      ++ + SQ        E 
Sbjct: 115 -------KSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA-------EL 158

Query: 219 TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
           T  S     A     HV+++  LD+          GY  PV   L E + T A +     
Sbjct: 159 TISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQDPTGQ 205

Query: 279 HTCMISALSISTTLKQHPL------IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGAN 327
                +AL + T L  + L      +    + P D    L    P G     GVLV G  
Sbjct: 206 -----AALDVETQLVYYELDLGLNHVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEE 260

Query: 328 TIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTK 382
            I Y HS   +  + +         + E P    S+     H   L+        LL T+
Sbjct: 261 NITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAFFFLLQTE 314

Query: 383 TGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
            GDL  +T+      DG     V+RL +   +   + S +  + +   ++ S+ G+    
Sbjct: 315 DGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQFGNYQFY 374

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
           QF              E+ GD        + L  SS D   D     E   +        
Sbjct: 375 QF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF-------- 406

Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--------- 542
             ++  + A+ +S+ ++ PL D         DA    +A G   +S + ++         
Sbjct: 407 HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEI 466

Query: 543 ---ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
              ELPG    +WT+  K SRG            D+Y AY+++S    T+VL   + + E
Sbjct: 467 VASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLSIGETVEE 512

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           V +S   F+      A  L G   +IQV  +G R
Sbjct: 513 VNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544



 Score = 46.6 bits (109), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 60/267 (22%), Positives = 114/267 (42%), Gaps = 24/267 (8%)

Query: 964  NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
             C  G + +  Q   I  I +L    T       + IPL  TP ++    E  ++  I S
Sbjct: 761  QCEEGVVGIQGQSLRIFAIEKLSDTLTQ------KSIPLTYTPRRMVKHPEHPMFYTIES 814

Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
                 P      LL D  V +  D   L  V+         +   I   D      QT  
Sbjct: 815  DNNTLPPELRAQLLADPSVVNG-DARTLPPVEFGYPRAKGRWASCISVIDPLSEELQTLQ 873

Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
            T+ + ++E A++  +V     T+++NE+ L +GT    G+D+    R   F+ G     R
Sbjct: 874  TVDLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIYR 924

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
             +++ + L   ++  +++   +AL + QG L+   G  + ++     ++   A  D  P 
Sbjct: 925  FSEDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAPQ 983

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             +VSL+   + I++GD+ + + ++ +K
Sbjct: 984  LIVSLSTQGSRIVVGDVQQGVTYVVYK 1010


>gi|119173562|ref|XP_001239205.1| hypothetical protein CIMG_10227 [Coccidioides immitis RS]
          Length = 1208

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 75/344 (21%), Positives = 152/344 (44%), Gaps = 48/344 (13%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
            I ++ +E A++V  V     +++++ET L +GT    G+D+      ++ G + ++    
Sbjct: 872  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
            +       +  ++  +++    AL + QG LL   G  + ++     +L      +  P 
Sbjct: 925  DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
             +V L    + I++ D+ +S+ ++ +K Q  +L   A D  +    C A   ++D  T++
Sbjct: 981  LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037

Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
                D+  N+ +     K SE       G  L+   ++  GA     L +    Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
              +T    G     R  L++  L G++G + P    +++ F   QSL+ +L    P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149

Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             +   +R +++  K         +D +L   Y  LP +++L IA
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIA 1186


>gi|392578232|gb|EIW71360.1| hypothetical protein TREMEDRAFT_71141 [Tremella mesenterica DSM 1558]
          Length = 1250

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 130/309 (42%), Gaps = 46/309 (14%)

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------------AARGRVLL 1131
            P++S E   T+  VTL       ++ LLA+G A    +D             A  GR++L
Sbjct: 894  PLESREIVTTIAYVTL------GDQNLLAVGIATFSEDDEDLPDDLDMVTISAQSGRLVL 947

Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-------WTGTE 1184
            +    + D+ +  + E+ S  L+ A++ +  ++  L +A+G  + ++K          T 
Sbjct: 948  YEPVVDQDSAEPNLIELTSVGLESAVNDIKVIKNLLAVATGSNVTIYKHEKASHLLIPTS 1007

Query: 1185 LNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
                AF  A  L V   + +  +  +++GD  +SI+ L   E    +    +D  +    
Sbjct: 1008 RFASAFV-AKSLVVAPPDKLHPEERLVVGDGMRSIFVLDIDEGTGMIMGDERDMATHSVM 1066

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            A E L DG   +++V+D   NI  F            +++ + A F +   ++ F R  +
Sbjct: 1067 AMEGLRDGGQ-AVIVADAHSNISTFRLR---------EEIETAATFGLHEDISVFRRGSL 1116

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
               SS     +P         ++F T+DG +G +  L     R L  LQ+ +   +    
Sbjct: 1117 APASSAEDVLSP--------EIIFATIDGRLGIVGELTPSAARTLDDLQRNMDRYIRGPG 1168

Query: 1363 GLNPRSFRQ 1371
             +  RS+R+
Sbjct: 1169 DIAWRSYRR 1177


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   I +S   N    D K V +   + G    
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D+   L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+   A H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + ++  +V+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q         +L   GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + NST + 
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558



 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
                  G LL + G K+ ++     +L     Y   P  ++S+ +  + I   DI +S+ 
Sbjct: 1021 CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1080

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      L L++ D        +E L   + ++         L V +E K  +    
Sbjct: 1081 IFFYDSNMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1140

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    M+ S K ++L     FHVG  VT   ++++  TSS+               
Sbjct: 1141 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1186

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L    P + G     FR ++   + 
Sbjct: 1187 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1241

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   ++D +L   +  LP + Q ++A     T   IL  L D+
Sbjct: 1242 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1281


>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD       
Sbjct: 53  LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
              +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L  
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
                       +  +++  I +  LD           GY  P++  +   E  +A   +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201

Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ I 
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261

Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
           YH+ +    L L    + L + S  +P   ++V     H     N   LL  + GD   +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313

Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
            + Y  D   V  L +   +    ++ I        F      D +L Q    G   S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373

Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           SS       D+ + +     + R   + AL  +++       G+    +ES +   +   
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
             S      LK  ++G+ I+               LV  P      +V+          +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464

Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
           +R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521

Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           IQ++  G R +     +G    +   + P            T++S S     VL+G+S+ 
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573

Query: 679 SIRLLVGDPS 688
            +     DP+
Sbjct: 574 ELCYFEIDPT 583


>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1212

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 159/365 (43%), Gaps = 42/365 (11%)

Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
            T+ +  +E A++  +V+     +++NE  L +GT    G+D+    R   F+ G     R
Sbjct: 874  TVDLDDNEAAVSAAIVSF---ASQDNENFLIVGT----GKDMIVNPR--QFTEGYIHVYR 924

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
              ++   L   ++  +++   +AL + QG LL   G  + ++     ++   +  D  P 
Sbjct: 925  FGEDGHEL-EFIHKTKVEEPPTALLAFQGRLLAGVGKTLRIYDLGLRQMLRKSQADVAPQ 983

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
             +VSL+   + I++GD+   + ++ +K    +L     D  +     T  ++D    S+ 
Sbjct: 984  LIVSLSTQGSRIVVGDVQHGVTYVVYKPTTNKLIPFVDDTIARWVTCTT-MVDYE--SVA 1040

Query: 1257 VSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSD 1308
              D+  N+ +     K S    E   G  LL +R   H   H    L     Q + TS  
Sbjct: 1041 GGDKFGNMFLVRCPEKASQEADEEQAGLHLLNTRDYLHGAPHRLNLLSHSYTQDVPTSIT 1100

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G        LL+  ++G+IG   P    +++ F   Q+L++ +      +AG +
Sbjct: 1101 KTSLVVGGQDV----LLWSGINGTIGVFIPFVTREDVDF--FQNLEQHMRTEDAPLAGRD 1154

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               +R ++   K        ++D +L   Y +LP +++L IA +   +  +I   ++D+ 
Sbjct: 1155 HLMYRGYYVPVKG-------VIDGDLCERYTLLPNDKKLMIAGELDRSVREIERKISDIR 1207

Query: 1426 LGTSF 1430
              ++F
Sbjct: 1208 TRSAF 1212



 Score = 44.3 bits (103), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)

Query: 74  QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      +  R+ +   D      + L+ H  + G + S+A     G++    +
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
           D +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  D
Sbjct: 82  DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130

Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           P+GR      V    L  ++ + +Q        E T  S     A      V+++  LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
                     GY  PV   L E E T A     G  + +  T ++    +   L      
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229

Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
           WS    P D    L    P G     GVLV G   I Y HS   +  + +     A    
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
           S +    S    +L  +   +      LL T+ GDL   ++ +V D        V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
              +   ++S +  + +   +  S+ G+    QF              E+ GD + +   
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386

Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                 SS D   D   G + +  Y     N          A+ +S+ ++ PL D     
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433

Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
               DA    +A G   +S + ++            ELPG    +WT+  K SRG     
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                  D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537

Query: 626 VFERGAR 632
           V  +G R
Sbjct: 538 VHPKGIR 544


>gi|255946770|ref|XP_002564152.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591169|emb|CAP97395.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1209

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 99/465 (21%), Positives = 185/465 (39%), Gaps = 72/465 (15%)

Query: 990  DNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
            DN    + IPL  TP +     E+ L+ +I S   VL P  +    LID       +  +
Sbjct: 781  DNNMLQESIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPATR--QRLIDDSQAQNGEATD 838

Query: 1049 LSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTLFNTT 1103
            L   D         +   ++++EP        T++ I    ++ +E A+++  V+    +
Sbjct: 839  LPPADFGYPRATGHWASCIQVVEP------ITTKSVIFNLDLEDNEAAVSLAAVSF---S 889

Query: 1104 TKENETLLAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYS 1150
            ++++ET L +GTA             ++        GR L F      D P         
Sbjct: 890  SQDDETFLVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP-------- 941

Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1210
                    AL   QG LL   GP + ++     +L         P  +V L    + I++
Sbjct: 942  -------LALLGFQGRLLAGIGPILRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIV 994

Query: 1211 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
             D+ +S+ ++ +K Q   L   A D  +    +T  ++D  T +    D+  N+ +    
Sbjct: 995  SDVRESVTYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCP 1051

Query: 1271 PKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRF 1322
             K+SE       G  L+  +   H   H  + +     Q + TS  +T    G     R 
Sbjct: 1052 SKVSEQADEDGSGAHLIHEKGYLHGTPHRLELMVHFYAQDIPTSLHKTQLVAG----GRD 1107

Query: 1323 ALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
             +++    G+IG   P    +++ F   Q L+ +L    P +AG +   +R +++  K  
Sbjct: 1108 IVVWTGFQGTIGMFVPFASREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG- 1164

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                  ++D +L   Y +LP + +L IA +   +  +I   ++D+
Sbjct: 1165 ------VIDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203


>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 139/361 (38%), Gaps = 100/361 (27%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  +++VP P+GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLT 412
            +    +D     +L  D+A      G+L +L           L+L K      N +V+ 
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLL----------LELGKDQEQDENSAVIV 300

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTC--GSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
            D+             LG++ + +  C   +G   + S     FGD         +L R 
Sbjct: 301 RDMKV---------ESLGETCIAECMCYLDNGVCFIGS----RFGD--------SQLIRL 339

Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-- 528
           S++                       A  T   ++ DS  N+ P++D +  +R N     
Sbjct: 340 STEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQI 377

Query: 529 -SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
            + +G  K              EL  VEL G K ++T+    +RG            DE+
Sbjct: 378 LTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------DEF 422

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
             YLI+S ++ T VL       E TE   + V G T+ AG LF  + ++QV      ++D
Sbjct: 423 DDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVILID 482

Query: 636 G 636
           G
Sbjct: 483 G 483


>gi|225560964|gb|EEH09245.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus G186AR]
          Length = 1209

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 103/486 (21%), Positives = 201/486 (41%), Gaps = 66/486 (13%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
             C  G + +  Q  L+I  L      +N    + IPL+ TP    +F +   YPL     
Sbjct: 759  QCVEGMVGIQGQN-LRIFSL---EKLENNLLQETIPLQYTPR---HFIKHPEYPLFY--- 808

Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-----------VRILEPDR 1072
            V++  N +LS     ++ +  D  N  +  L      EE+            ++I++P  
Sbjct: 809  VIEAENNILSPGTRTKLLNDSDAVNGDTTPL----PPEEFGYPRGTGHWASCIQIVDPIN 864

Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL-- 1130
            +    +  + I ++ +E A++V  V     +++++ET L +GT    G+D+    R    
Sbjct: 865  SK---RVISQIELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPRSCTA 914

Query: 1131 -LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA 1189
                  R  +  + L   ++  +++    AL   QG LL   G  + ++     ++    
Sbjct: 915  GFIHIYRFQEEGKEL-EFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQMLRKC 973

Query: 1190 FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1249
                 P  VV L    + I++ D+ +S+ ++ +K Q  +L     D  S     T  ++D
Sbjct: 974  QASVVPHLVVGLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCTT-MVD 1032

Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQ 1301
              T++    D+  N+ +     K SE       G  L+   ++  GA    ++      Q
Sbjct: 1033 YETVA--GGDKFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQ 1090

Query: 1302 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSV 1358
             L TS  +     G     R  L++  L G++  + P    +E+ F   QSL+ +L    
Sbjct: 1091 DLPTSIQKAQLVTG----GRDILVWTGLQGTVSMLIPFISREEVDF--FQSLEMQLAAQN 1144

Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
            P +AG +   +R +++  K         +D +L   Y +LP +++ +IA +   +  +I 
Sbjct: 1145 PPLAGRDHLIYRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIE 1197

Query: 1419 SNLNDL 1424
              + D+
Sbjct: 1198 RKIADM 1203


>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
          Length = 1216

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 82/392 (20%), Positives = 156/392 (39%), Gaps = 60/392 (15%)

Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV 1123
            ++RIL+     G   T   +P++ +E A+++ VV     T      ++ +    +     
Sbjct: 860  QIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRS 919

Query: 1124 AARGRV---LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
             + G +    +++TG+       LV +    E  G   ALA+  G LL   G  + L+  
Sbjct: 920  CSEGSLHVYKIYNTGK-----LELVHKTPIDEYPG---ALAAFNGKLLAGVGRMLRLYDI 971

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
               +L         P  +  +  ++  I + D+ +S++ + +K++  QL + A D     
Sbjct: 972  GRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQLIIFADDTNP-R 1030

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQK------LLSRA---- 1286
                  ++D  T+++  +D+  N+ +      +S    E   G K      LL+ A    
Sbjct: 1031 WITNTCILDYDTVAM--ADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGLLNGASQKG 1088

Query: 1287 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
                 FHVG  VT   R  ++          PG  +    ALL+ T+ G++G   P    
Sbjct: 1089 DITVNFHVGETVTSLQRATLI----------PGGSE----ALLYATVSGALGVFLP---F 1131

Query: 1343 TFRR----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            T R      Q L+  +      + G +  SFR ++   K       +++D +L   +  L
Sbjct: 1132 TSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSL 1184

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
               +Q  IA     T +++   L D+    +F
Sbjct: 1185 EPAKQKAIAGDLERTPAEVSKKLEDIRTRYAF 1216


>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
          Length = 1212

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 80/381 (20%), Positives = 167/381 (43%), Gaps = 43/381 (11%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            ++I++P R     Q    I ++ +E A+++  V+     ++E+E  L +GT    G+D+ 
Sbjct: 860  IQIVDPIREKKVLQQ---IDLEDNEAAVSMATVSF---ASQEDEVFLVVGT----GKDMV 909

Query: 1125 ARGRV----LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
            A  R      +     + D  +  +  ++  +++    AL   QG LL+  G ++ ++  
Sbjct: 910  ASPRSSSGGFIHVYRFHEDGKE--IEFIHKTKVEEPPLALLGFQGRLLVGIGRELRIYDL 967

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
               +L   A  +     +V L    + I++ D+ +SI F+ +K Q  +L   A D  +  
Sbjct: 968  GMRQLLRKAQTEIAASLIVGLQTQGSRIIVSDVQESITFVVYKFQENKLIPFADDTIARW 1027

Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA-HVT 1295
               T  ++D  T++    D+  N+ +     K SE       G  L+   ++  GA H  
Sbjct: 1028 TTCTT-MVDYETVA--GGDKFGNLWLLRCPTKASEEADEEGSGAHLVHERQYLQGAPHRV 1084

Query: 1296 KFLRLQM---LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQS 1349
              +       + TS  +T    G     R  LL+  L G+I  + P    +++ F   Q+
Sbjct: 1085 ALMAHNFANDIPTSIQKTNLVAG----GRDCLLWSGLQGTIAIMIPFVSREDVDF--FQT 1138

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
            L++ L      +AG +   +R ++   K        ++D +L   Y +LP ++++ IA +
Sbjct: 1139 LEQHLRTEDAPLAGRDHLIYRSYYVPVKG-------VIDGDLCERYTLLPTDKKMMIAGE 1191

Query: 1410 TGTTRSQILSNLNDLALGTSF 1430
               +  +I   ++D+   +++
Sbjct: 1192 FDRSVREIERKISDMRTRSAY 1212


>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
 gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
          Length = 1154

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 53/283 (18%), Positives = 121/283 (42%), Gaps = 21/283 (7%)

Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            T +++    +GT  +  E+   + GR+++F      +  ++ +  V+    +G+  AL  
Sbjct: 834  TNDSKVYYIVGTGLIYPEETDTKFGRIVVFEVD---EVERSKLRRVHDLVCRGSPLALRI 890

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            L G L+ A    + L +WT  +   +   +   +  + L ++   + + D+ +S+  LS+
Sbjct: 891  LNGKLVAAINSSVRLFEWTMDKELRLECSNFNHIMALDLKVMNEEVAVADVMRSVSLLSY 950

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--SWKGQ 1280
            +        +AKD+ S      EF+     L       + ++ +F      S   +  G+
Sbjct: 951  RMLEGNFEEVAKDWNSEWMVTCEFITAEQILG-----GEAHLNLFTVEVDKSRPITDDGR 1005

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAP 1338
             +L    ++    + + +    L    D        D + +++  ++FGT  GSIG +  
Sbjct: 1006 YVLEPTGYYYLGELPRVMVRSSLVAQPD--------DCSIQYSQPIMFGTNQGSIGMVVQ 1057

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
            +D+   + L +++K + DSV +   +   ++R F    +   P
Sbjct: 1058 IDDKWKKFLIAVEKAIADSVKNCMHIEHTTYRSFIFQKRLESP 1100



 Score = 45.4 bits (106), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 82/358 (22%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  DA  L+ VP+PI GVLV+ AN+I Y S          N  V   +S  L  + F+ 
Sbjct: 206 SIAADASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTC 258

Query: 363 E--LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITT 417
              +D +   ++ +D     T+   L+L+  + +GR    V+ + +     + +   I  
Sbjct: 259 HGLVDPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINY 313

Query: 418 IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
           I   + F+GSRLGDS L++     SG S   S + E + +I                 ++
Sbjct: 314 IDAGVVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIR 356

Query: 477 DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           DM+  E         ++ ++   T S A +D     G L+    G+ I   AS       
Sbjct: 357 DMIMVE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS------- 395

Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                VEL G  GI+ +   S+  +                Y+I+SL   T VL+     
Sbjct: 396 -----VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEE 434

Query: 597 TEVTESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
            E  + +    +  TI A  +FG      ++QV E+  R +  S +++   + P N E
Sbjct: 435 LEDVQLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490


>gi|440300137|gb|ELP92626.1| DNA damage-binding protein, putative [Entamoeba invadens IP1]
          Length = 1086

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 79/337 (23%), Positives = 154/337 (45%), Gaps = 39/337 (11%)

Query: 1106 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL---- 1160
            E + L  +GTAY + GE   +RGR ++F         +  + EV ++ + GA+ ++    
Sbjct: 766  EEKELYIVGTAYAKLGEVEPSRGRFIIFEI------HEEKIIEVSNRYVDGAVYSVKRFE 819

Query: 1161 --------ASLQGHLLIAS-GPKIILHKWTGT-ELNGIAFYDAPPLYVVSLNIVKNFILL 1210
                    A++Q  +++     KI+  K+  T E  G A      L+V +L    + IL+
Sbjct: 820  NDVGNYIAATIQKKVVVYQIERKIVDGKFAVTIEEKGGANVKLIGLFVKTLG---HEILV 876

Query: 1211 GDIHKSIYFLSWKEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            GD+ KSI    + E+  +  ++   +DF +    A EF+ +   +S   SD Q N+ +F 
Sbjct: 877  GDLMKSISVFKFDEKATRNAVVETCRDFYASYTTAVEFMDEHCFMS---SDSQGNLLVFT 933

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                 +   +  KL + A  HVG  +    +  +   ++          +T +  +LFG 
Sbjct: 934  ENTTTTNENEKFKLQNEAHIHVGECINVMCKGSIAVMNN-------AMWETQKKCMLFGG 986

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHRPGPDSIV 1387
            + GSIG I  ++  T+++L +L+ +++  +  V  +   SF Q+        R    +++
Sbjct: 987  ICGSIGGITEINLETYKKLFALESEMLREMKGV--IECESFGQWKMVFDDWKRMEAQNVI 1044

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            D  ++  +  LP E Q  IA + G    ++++ L  +
Sbjct: 1045 DGNVVELFLDLPKESQKHIAEKIGYAGEELVTVLESM 1081


>gi|121700262|ref|XP_001268396.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
 gi|119396538|gb|EAW06970.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
          Length = 1209

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 97/477 (20%), Positives = 204/477 (42%), Gaps = 48/477 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    + IPL  TP ++    E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNMLQESIPLSYTPRRLLKHPEQPLFYVIGSDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + LI+       +   L   +         +   +++++P  A       
Sbjct: 815  NVLSPATR--ARLIEDSKARNGEADTLPPEEFGYPRATGHWASCIQVVDPVNAKA---VI 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +TI ++ +E A+++  V     +++++ET L +GTA         +A G + ++   R  
Sbjct: 870  STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDLTVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL + QG L+   G  + ++     +L         P  +
Sbjct: 924  EDGKEL-EFIHKTKVEEPPLALLAFQGRLVAGIGSILRIYDLGMKQLLRKCQAPVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++    
Sbjct: 983  VGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDTVSRWMTSTT-MVDYETVA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  N+ +     K+SE       G  L+  R   H   +  + +     Q + TS  +T
Sbjct: 1040 DKFGNLWLVRCPKKISEEADEDGSGAHLIHERGYLHGTPNRLELMIHTYTQDIPTSVHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFMSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
          Length = 1124

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 133/322 (41%), Gaps = 52/322 (16%)

Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E+  + RGR+L+F      DN + L T+V    +KGA  ALA L G  ++
Sbjct: 816  FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 869

Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
            A+  K ++      + +   +L  +A Y    AP    V + +  N I + D+ KS+  +
Sbjct: 870  AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 925

Query: 1221 SWKE--QGA--QLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
             + E   G+   L  +A+ F ++      C A +  ++        +D + N+ +     
Sbjct: 926  EYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 977

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
               E    ++L    E  +G  V +   + +Q LA+ +    A              GT+
Sbjct: 978  TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1025

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            +GSI   A ++      L  LQ  +   V  +  +    FR F S  +  +  P   VD 
Sbjct: 1026 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE-PYRFVDG 1084

Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
            EL+  +       Q EI    G
Sbjct: 1085 ELIERFLTCEPSLQEEIVDSVG 1106



 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|326432370|gb|EGD77940.1| splicing factor 3b subunit 3 [Salpingoeca sp. ATCC 50818]
          Length = 1232

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 60/304 (19%), Positives = 120/304 (39%), Gaps = 47/304 (15%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+  E++    AL    G L+   G  + ++     +L         P  VV + ++   
Sbjct: 955  VHRTEVEAMPCALTPFAGRLIAGVGNIVRIYDMGRKKLLRKCENKHLPSRVVDIEVMGTR 1014

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            +++ D  +S++FL +K     L++   D     C A   ++D ST  + V+D+  N+ + 
Sbjct: 1015 VVVADQRESVFFLKYKPTENVLSVFCDDTTPRWCTAM-LMVDYST--VCVADKFGNVSVL 1071

Query: 1268 YYAPKMSESWK------------------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                 ++++ +                   QKL+  A F++G  V    +  +  + ++ 
Sbjct: 1072 RCPDDVTDTLQEDPSGAKAFWARGYLNGAPQKLVQVANFYIGEIVQSLHKTTLTPSGTE- 1130

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
                          + + TL GSIG + P    ++  F   Q+L+  L    P + G + 
Sbjct: 1131 -------------CIAYTTLSGSIGALMPFSHKEDAEF--FQTLELHLRQEHPPICGRDH 1175

Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
             +FR  +   K       S++D +L   Y ML    + +IA     T  ++   L +   
Sbjct: 1176 LAFRSAYVPCK-------SVIDGDLCEEYNMLSASLKSDIADGLERTPQEVAKKLEEFRT 1228

Query: 1427 GTSF 1430
              +F
Sbjct: 1229 RYAF 1232


>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
            MF3/22]
          Length = 1161

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 72/136 (52%), Gaps = 10/136 (7%)

Query: 1108 ETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGH 1166
            +T  A+G+ Y  + E   +RGR+L+ STG   +   +++    S E+KGA++AL  +QG 
Sbjct: 849  DTFFAVGSVYFDETEREPSRGRILIISTGSKRNQTPHILA---STEVKGAVNALTCIQGK 905

Query: 1167 LLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            L++A    + + +      T L  +  ++   L + ++ ++ + I++GD   S+  L  K
Sbjct: 906  LVVAINTSVDVFRLKHGDNTVLTAVTSWNHNYLVITAV-VMDDLIVIGDAVSSLAVL--K 962

Query: 1224 EQGAQLNLLAKDFGSL 1239
             +  +L   A+D+  L
Sbjct: 963  LEDDKLTTFARDYSPL 978



 Score = 42.0 bits (97), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)

Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            D+  L+ VP  I       GGVLV+G +TI ++S         ++      S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282

Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +       A W Q D      LL    G L LL +      +  + L + +P    + 
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + +   ++GS  GDS L++ T    ++     + + F +I   AP    +   + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            Q  +    ++  G  S                     G L+    G   N DA   GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
                         G+W +  +              YDD +H Y++++ +  T +LE   
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466

Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
           +   T V+ S D+    +  RT+ AGN+  R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497


>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
          Length = 1140

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 131/322 (40%), Gaps = 52/322 (16%)

Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E+  + RGR+L+F      DN + L T+V    +KGA  ALA L G  ++
Sbjct: 832  FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 885

Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
            A+  K ++      + +   +L  +A Y    AP    V + +  N I + D+ KS+  +
Sbjct: 886  AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 941

Query: 1221 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
             + E        L  +A+ F ++      C A +  ++        +D + N+ +     
Sbjct: 942  EYSEGENGMPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 993

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
               E    ++L    E  +G  V +   + +Q LA+ +    A              GT+
Sbjct: 994  TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            +GSI   A ++      L  LQ  +   V  +  +    FR F S  +  +  P   VD 
Sbjct: 1042 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVRETKE-PYRFVDG 1100

Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
            EL+  +       Q EI    G
Sbjct: 1101 ELIERFLTCEPSLQEEIVDSVG 1122



 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363


>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
          Length = 1206

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 79/368 (21%), Positives = 160/368 (43%), Gaps = 50/368 (13%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR--------VLLFST 1134
            +  +++E A++V VV+     +++NE+ L +GT    G+D+    R        +  F  
Sbjct: 869  VDFENNEAAVSVAVVSF---ASQDNESFLVVGT----GKDIVLNPRSSSEAYIYIYRFQQ 921

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
            G         +  ++  +++   +AL   QG LL   G  + ++     +L   A  +  
Sbjct: 922  GGRE------LEFIHKTKIEEPATALLPFQGKLLAGIGKTLRMYDLGMRQLLRKAQAEVV 975

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTL 1253
            P  +VSLN   + I++ D+ + +  + +K    +L     D  S+  ++T   ++D    
Sbjct: 976  PQQIVSLNTQGSRIVVSDVQQGVTLVVYKSASNKLIPFVDD--SIARWSTCTTMVDYE-- 1031

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWK----GQKLL-SRAEFHVGAHVTKFL---RLQMLAT 1305
            S+   D+  N+ I     K SE       G  L+ +R   H   H  + +     Q + T
Sbjct: 1032 SVAGGDKFGNMFIVRSPAKASEEADEDAAGLHLVNARDYLHGTQHRLELMCHFFTQDILT 1091

Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
            S ++TG   G        LL+  + G+IG   P    ++  F   QSL++ L      +A
Sbjct: 1092 SINKTGLVVGGQDV----LLWSGIMGTIGVFIPFVSREDTDF--FQSLEQHLRTEDGPLA 1145

Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
            G +   +R +++  K        ++D +L   + +LP +++  IA +   +  +I   ++
Sbjct: 1146 GRDHLMYRSYYAPVKG-------VIDGDLCERFSILPNDKKQMIAGELDRSVREIERKIS 1198

Query: 1423 DLALGTSF 1430
            D+   ++F
Sbjct: 1199 DIRARSAF 1206



 Score = 44.7 bits (104), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 144/639 (22%), Positives = 231/639 (36%), Gaps = 116/639 (18%)

Query: 74  QEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      SG     +  D      + L+ H  + G + S+A+    G++    +
Sbjct: 21  QFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFRLAGSN----K 75

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           D IILA +  +I++LE+    +      M  F        K G      G  +  DP+GR
Sbjct: 76  DYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPGEYLACDPKGR 128

Query: 191 C---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
                 V    L  ++ + SQ        E T  S     A      VI +  LD+    
Sbjct: 129 ACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIAMVALDV---- 175

Query: 248 DFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
                 GY  PV   L     E +    G    +  T ++    +   L      WS   
Sbjct: 176 ------GYANPVFAALEIEYTEVDQDITGEALSEVETQLV-YYELDLGLNHVVRKWSD-- 226

Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPR 357
            P D    L    P G     GVLV G   I Y HS   +  + +         + E P 
Sbjct: 227 -PVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRVPIPRRR----GATEDPS 281

Query: 358 SSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKT 406
              ++     H   L+        LL +  GDL  +T+  V D        VQR+ +   
Sbjct: 282 RKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYF 339

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPST 464
           +   + + +  + +   ++ S+ G+    QF              E+F    + A  P  
Sbjct: 340 DTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVY 399

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
              R + + AL D +      L    +N T E A + F+     +      LK   +GL 
Sbjct: 400 FYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLK---HGLE 456

Query: 524 INADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
           +N   ++            ELPG    +WT+   S              D++Y AY+++S
Sbjct: 457 VNEIVAS------------ELPGVPSAVWTLKLNS--------------DEQYDAYIVLS 490

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQ 641
               T+VL   + + EV++S  +     TIAA  L G   +IQV  RG R I +G     
Sbjct: 491 FTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQVHPRGIRHIRNG----- 543

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
                  N    S  ++ ++++ S     V + +S G I
Sbjct: 544 -------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575


>gi|58258783|ref|XP_566804.1| U2 snRNA binding protein [Cryptococcus neoformans var. neoformans
            JEC21]
 gi|338819361|sp|P0CR23.1|RSE1_CRYNB RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|338819362|sp|P0CR22.1|RSE1_CRYNJ RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|57222941|gb|AAW40985.1| U2 snRNA binding protein, putative [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1217

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LA  QG LL   G  + L++     L      +  P  VV++N+    I++GD+ +S ++
Sbjct: 951  LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010

Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
              ++     QL + A D     + C  +   +D  T++    D+  NI I    P +SE 
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065

Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +GA H T+ +    +  + TS  +     G     R  L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R ++   K        
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + +LP  +Q  IA     +   +L  L  +   ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217


>gi|391867503|gb|EIT76749.1| splicing factor 3b, subunit 3 [Aspergillus oryzae 3.042]
          Length = 1034

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
            +TI ++ +E A++V  V     T++++ET L +GTA     +   +A G + ++   R  
Sbjct: 695  STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 748

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   GP + ++     +L         P  +
Sbjct: 749  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 807

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T +    
Sbjct: 808  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 864

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  NI +     K+SE       G  L+  R   H   +  + +     Q + T+  +T
Sbjct: 865  DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 924

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 925  QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 978

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 979  IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1028


>gi|261329035|emb|CBH12013.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
            gambiense DAL972]
          Length = 1270

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1164
            ++ IGT +V  ++  +R    ++ T   A     + L+    SK+++GA+       +  
Sbjct: 852  VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911

Query: 1165 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1219
            G + +  G  ++L+ W   +   +A        L V  + +++   ++I+  D   S +F
Sbjct: 912  GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971

Query: 1220 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +        LN++A+D    G +DC   ++    S   + + D+  N     +   ++ S
Sbjct: 972  VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028

Query: 1277 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1325
                      +KL + A++H+G  +T  +     A  S      P      R      ++
Sbjct: 1029 SGVSAPAMPTKKLQTSAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            +GT  G+ G I P+   TF  L+ L+  +   VP + G    SFR+    G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139


>gi|134106833|ref|XP_777958.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50260658|gb|EAL23311.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1218

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LA  QG LL   G  + L++     L      +  P  VV++N+    I++GD+ +S ++
Sbjct: 952  LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1011

Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
              ++     QL + A D     + C  +   +D  T++    D+  NI I    P +SE 
Sbjct: 1012 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1066

Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +GA H T+ +    +  + TS  +     G     R  L++ T
Sbjct: 1067 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1122

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R ++   K        
Sbjct: 1123 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1173

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + +LP  +Q  IA     +   +L  L  +   ++F
Sbjct: 1174 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1218


>gi|119473054|ref|XP_001258481.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
            181]
 gi|119406633|gb|EAW16584.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
            181]
          Length = 1209

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 101/479 (21%), Positives = 208/479 (43%), Gaps = 52/479 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + + +Q +    ++ S    DN    + IPL  TP ++    E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + LI+       + + L   D         +   ++I++P  A       
Sbjct: 815  NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +TI ++ +E A+++  V     +++++ET L +GTA   +     +A G + ++   R  
Sbjct: 870  STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL-- 1196
            ++ + L   ++  +++    AL   QG LL   G  + ++     +L  +    AP +  
Sbjct: 924  EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRVYDLGMKQL--LRKCQAPVVSK 980

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
             +V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++  
Sbjct: 981  TIVGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETVA-- 1037

Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSD 1308
              D+  N+ +     K+SE       G  L+  R   H   +    +     Q + TS  
Sbjct: 1038 GGDKFGNLWLVRCPKKVSEEADEDGSGAHLIHERGYLHGTPNRLDLMIHTYTQDIPTSLH 1097

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRD 1151

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
               +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 1152 HLIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++++L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N   LD K V +   +  Y   
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L      W   +   H                   +      P DA   L +P P 
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE   S     L      
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           +      L+ ++ GDL  + + ++  VV+ +     +   + + I  + +   F+ +  G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
           +    QF+ G G     +M +S  K   G     A  TK+L   ++  L D V      L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
            +    + N  S Q         +L   GP   L+   +GL I   A             
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450

Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            ELPG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV 
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
           +S+   +   T    N+     +IQV + G R ++G  + + +   P N +  + + N  
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551

Query: 661 VLSVSIA 667
            + +S++
Sbjct: 552 QIVISLS 558



 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
                  G LL + G K+ ++     +L     Y   P  ++S+ +  + I   DI +S+ 
Sbjct: 965  CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1024

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      L L++ D        +E L   + ++         L V +E K  +    
Sbjct: 1025 IFFYDANMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1084

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    M+ S K ++L     FHVG  VT   ++++  TSS+               
Sbjct: 1085 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1130

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L    P + G     FR ++   + 
Sbjct: 1131 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1185

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   ++D +L   +  LP + Q ++A     T   IL  L D+
Sbjct: 1186 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1225


>gi|72390667|ref|XP_845628.1| damage-specific DNA binding protein [Trypanosoma brucei TREU927]
 gi|62359843|gb|AAX80271.1| damage-specific DNA binding protein, putative [Trypanosoma brucei]
 gi|70802163|gb|AAZ12069.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
            brucei strain 927/4 GUTat10.1]
          Length = 1270

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)

Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1164
            ++ IGT +V  ++  +R    ++ T   A     + L+    SK+++GA+       +  
Sbjct: 852  VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911

Query: 1165 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1219
            G + +  G  ++L+ W   +   +A        L V  + +++   ++I+  D   S +F
Sbjct: 912  GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971

Query: 1220 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +        LN++A+D    G +DC   ++    S   + + D+  N     +   ++ S
Sbjct: 972  VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028

Query: 1277 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1325
                      +KL + A++H+G  +T  +     A  S      P      R      ++
Sbjct: 1029 SGVSAPAMPTKKLQTTAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            +GT  G+ G I P+   TF  L+ L+  +   VP + G    SFR+    G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139


>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 1197

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 119/531 (22%), Positives = 211/531 (39%), Gaps = 82/531 (15%)

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            SRP    +    L + P + D S+       +  C+ G + +  Q  LKI  +      D
Sbjct: 718  SRPWLSYIINASLHLVPLIYD-SLEYCWGFSSEQCSEGIVGIQGQD-LKIFMV---ERLD 772

Query: 991  NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
            N      I L  TP +     +++++ +I S   + PL++       +  G Q  +H L 
Sbjct: 773  NVLKQDSISLMYTPRRFIKHPDEHIFYIIESDHNVLPLSERQK----RVEGLQNGDHILL 828

Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI------------PMQSSENALTVRVVT 1098
              D+               P    G W +  TI             +  +E A ++ +VT
Sbjct: 829  PEDIGL-------------PRGLSGNWASCITILDPLSKKILTRIELDDNEAAFSIAMVT 875

Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKG 1155
              N   + +E  LAIG+    G++V    +       S  R  D  ++ +  V+  E+  
Sbjct: 876  FKN---QNDEIFLAIGS----GKNVILAPKSFSAAYISIYRFIDQGKS-IELVHKTEVDD 927

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL   QG LL   G  + +++    +        A P  +V L+   + I++ DI +
Sbjct: 928  IPLALLGFQGRLLAGLGKMLRIYEMGMKKCLRKCEVRAVPNCIVQLHTQGSRIIIADIQE 987

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI+F  +K    +L + A D        T  ++D  T++    D+  N  I     ++SE
Sbjct: 988  SIHFAVYKYLENRLIVFADDVIP-RWTTTSTMLDYETVA--AGDKFGNFWINRCPLEVSE 1044

Query: 1276 SW----KGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDRTGAAPGSDKTNRFA 1323
            S      G +L+    +  GA      RL+MLA        TS  +     G     R  
Sbjct: 1045 SADEDPSGAQLIHEKSYLFGAAK----RLKMLAHFYIGDTFTSMHKVQLIAGG----RDI 1096

Query: 1324 LLFGTLDGSIGCIAPL----DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
            +++  + GSIG   P     D   F++L++L +    S+    G +   +R ++   K  
Sbjct: 1097 IVYTGMMGSIGIFLPFVGREDVDFFQQLEALMRTEDLSL---IGRDHLMYRGYYVPVK-- 1151

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
                 S+VD +L   + MLP  ++  IA++     S+I   + D+ +  +F
Sbjct: 1152 -----SVVDGDLCERFLMLPYNKKQVIANELDREISEIAKKIEDMRVRVAF 1197



 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 118/544 (21%), Positives = 206/544 (37%), Gaps = 92/544 (16%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           +HG + +L      G +    +D +I+  +  +I++LE+    +         +      
Sbjct: 65  VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
             K G      G  + VDP+GR   +       ++   ++  +                A
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSA------TNLTISSPLEA 167

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMIS 284
               S V +L  +D+          GY  PV   L     E E   +G+ +++    +++
Sbjct: 168 HKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVLT 216

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASC 338
              +   L      WS    P D    L V  P G     G LV    +I Y H    + 
Sbjct: 217 YYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKTH 273

Query: 339 ALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
            + +     SL++SQ  ++  SS   ++  A    LQN+        GDL  +T+  +  
Sbjct: 274 RIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSNDG 326

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEEF 454
            V+ L +   +   +++ ++ + +   F+ S  G+  L QF   G   + +  S +    
Sbjct: 327 EVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPV 386

Query: 455 GDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNI 512
            D+ E   PS  R R   +  L D +N     +     N T E A + ++   R      
Sbjct: 387 LDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR------ 440

Query: 513 GPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRM 568
           GP   F    YGL +N +  A+G           LPG    +WT    SS          
Sbjct: 441 GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS---------- 478

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
               D+Y AY+++S    T+VL   + + EV+++  +     T+A   L G   +IQV  
Sbjct: 479 ----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVHP 532

Query: 629 RGAR 632
           +G R
Sbjct: 533 KGIR 536


>gi|154342093|ref|XP_001566998.1| putative CPSF-domain protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|134064323|emb|CAM40524.1| putative CPSF-domain protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 1347

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 67/328 (20%), Positives = 132/328 (40%), Gaps = 43/328 (13%)

Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQ 1142
            P +     + V  +T    + ++ + LL +G+++   ++  AR GRV+ F+   + +   
Sbjct: 896  PAERCGEQIDVPAITSAGVSEEDWQHLLLVGSSFTFPDEQRARSGRVMWFAL--HEERQG 953

Query: 1143 NLVTEVYSKELKGAISALASL---QGHLLIASGPKIILHKWTGTELN---------GIAF 1190
              +  + SK++ GA+   A +   +G + +     + L+KW   +           G+  
Sbjct: 954  QRLRLIASKDIGGALQCCAEVPYYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTV 1013

Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---FGSLDCFATEFL 1247
                PLY  +L    + ++  D+  S +F+        L +L ++    G +D +     
Sbjct: 1014 TRLIPLYNTAL--AASVLVALDVRHSAFFIEVDLLQGSLKVLCREGNLRGVMDGY----- 1066

Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ-----------KLLSRAEFHVGAHVTK 1296
            +     +L + D+  N       P   E   G            +   RA++HVG  VT 
Sbjct: 1067 VGSDAENLCLFDDNLNFTALKVVPLPVEPGDGDAAAAASGTPQCRFEVRAQYHVGDLVTC 1126

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFA-------LLFGTLDGSIGCIAPLDELTFRRLQS 1349
                   ATS  +      S  +           L+F T  G  G + PL   T+  L++
Sbjct: 1127 VRPGSFAATSLMKAPTPSSSVPSPLLLPGIAGPQLVFATAHGGFGVVTPLHAATYLVLRA 1186

Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            L+  L  ++P + GL+ ++FR+    G+
Sbjct: 1187 LEASLERTLPPLGGLSHQAFREVLRAGQ 1214


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
            strain H]
          Length = 1276

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 112/286 (39%), Gaps = 45/286 (15%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
              +   G LL + G K+ ++     +L     Y   P  ++S+ +  + I   DI +S+ 
Sbjct: 1011 CFSPFNGRLLASVGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1070

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      L L++ D        +E L   + ++         L V +E K  +    
Sbjct: 1071 VFFYDANMNALRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1130

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    M+ S K ++L     FHVG  VT   ++++  TSS+               
Sbjct: 1131 NKCWYGGEMMAGSNKNRRLEHIMNFHVGEIVTSLQKVKLSPTSSE--------------C 1176

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L    P + G     FR ++   + 
Sbjct: 1177 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1231

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   ++D +L   +  LP + Q ++A     T   IL  L D+
Sbjct: 1232 ------HVIDGDLCEQFSSLPYDIQRKVAADLERTPDDILRKLEDI 1271



 Score = 47.8 bits (112), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N    D K V +   +      
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D    L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+     H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + +   VV+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q  ++   R      GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + N+T + 
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558


>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
          Length = 1201

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)

Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            L  +GT+Y+   GE  + RGR+L F    N       + +V    +KGA  ALA +Q  +
Sbjct: 885  LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938

Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + A    ++++     ++    L+  A Y    AP    V + +  N I + D+ KS+  
Sbjct: 939  VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994

Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + + +QGA      L  +A+ F +L   A   + + + L    SD + N+ + +      
Sbjct: 995  VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
                 ++L   +E  +G  V +   + +  +       +P +          GT++GSI 
Sbjct: 1051 TDDDRRRLEVTSEISLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102

Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
              G I P  +    RLQS    +V +     G+    FR F  N       P   VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1158

Query: 1392 LSHYEMLPLEEQLEI 1406
            +  +    +E Q EI
Sbjct: 1159 IERFLGCSVELQEEI 1173



 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|317143715|ref|XP_001819645.2| pre-mRNA-splicing factor rse1 [Aspergillus oryzae RIB40]
          Length = 1209

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
            +TI ++ +E A++V  V     T++++ET L +GTA     +   +A G + ++   R  
Sbjct: 870  STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   GP + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  NI +     K+SE       G  L+  R   H   +  + +     Q + T+  +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1139

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 121/296 (40%), Gaps = 36/296 (12%)

Query: 1113 IGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            +GTAY+  E   + RGR+LLF    N       ++      +KGA  ALA +  +++ A 
Sbjct: 833  VGTAYLDDETAESIRGRILLFEVDSNRK-----LSLFLEHPVKGACRALAMMGDYIVAAL 887

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLY-----VVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
               +++ + TG    G        +Y      V + +    I++ D+ KSI  +   +  
Sbjct: 888  VKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISIVESNKTD 947

Query: 1227 AQLNLLAKDFGSLDCFATEF---LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            A L + AK+      FAT +   + D  +   +VSD + N+ +             ++L 
Sbjct: 948  A-LTMEAKEVAR--HFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLE 1004

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAPLD 1340
              +E  +G  V +   + +  TS+      P +          GT++GSI     I P  
Sbjct: 1005 VTSELLLGEMVNRIRPVNIPQTST--MAVTPKA--------FLGTVEGSIYLFALINPEH 1054

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNP-RSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
            +    RLQ+     VDS     GL P   FR F S  +     P   VD EL+  +
Sbjct: 1055 QDFLMRLQTAISAYVDS----PGLMPFNKFRAFRSTVREAEE-PFRFVDGELIERF 1105



 Score = 40.4 bits (93), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y                 +D ++     + S  LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
                W+Q D    LL+   G L  L +V D +  V+   L     +   S +  +G  +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363


>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1140

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 132/322 (40%), Gaps = 51/322 (15%)

Query: 1111 LAIGTAYV-QGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
              +GTAY+ + ED  + RGR+L+F      DN + L T+V    +KGA  ALA L   ++
Sbjct: 831  FVVGTAYLDEDEDRDSIRGRILMF----EVDNGRKL-TKVAELAVKGACRALAMLGDKVV 885

Query: 1169 IASGPKIILHKWTGT-----ELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
             A    ++++K TG      +L  +A Y    AP    V + +  N I + D+ KS   +
Sbjct: 886  AALVKTVVIYKVTGNNFGAMKLEKLASYRTSTAP----VDITVTDNVIAVSDLMKSSCLV 941

Query: 1221 SW--KEQGA--QLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
             +   E G    L  +A+ F +     + C A    ++        SD + N+ I     
Sbjct: 942  EYIEGEDGLPDSLKEVARHFQTVWATGIACIAPHTYLE--------SDAEGNLIILRRNL 993

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
               E    ++L    E  +G  V +   + +Q LA+ +    A              GT+
Sbjct: 994  SGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
            +GSI   A ++      L  LQ  +   +  +  +    FR F S  +  +  P   VD 
Sbjct: 1042 EGSIYLYAIINPEHQDFLMRLQATMAGKIESLGDMPFNEFRGFRSMVREAKE-PYRFVDG 1100

Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
            EL+  +       Q +I +  G
Sbjct: 1101 ELIERFLTCEPSVQEDIVNSVG 1122



 Score = 40.8 bits (94), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
           A  L   A  L+ VP+P+GG+L++G  +I Y           NN  VS  LD +      
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283

Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
                       W Q D    LL+   G L  L +V D    VQ   L     +   S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
             +G  + F+GS  GDS +++   GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362


>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 435

 Score = 50.1 bits (118), Expect = 0.010,   Method: Composition-based stats.
 Identities = 52/214 (24%), Positives = 99/214 (46%), Gaps = 10/214 (4%)

Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
            P  +++SL++++N+I++GDI  S+  LS+    + L  + +D+ ++ C    F+   S  
Sbjct: 197  PSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSILTEVCRDYSNVWC---TFVCALSKS 253

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
              +VSD + N  +F  +          KL   A F+ G  V K L + +   SS      
Sbjct: 254  HFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVNKMLPVSL---SSLIEEEE 310

Query: 1314 PGSD-KTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
            P ++    + ++L  + +GSI  I P   L  F++   ++  L DS+  +  +N  S   
Sbjct: 311  PQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALNDSLSSIGNINDNSNNT 370

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            +  N          +VD E+   +  +P E+Q +
Sbjct: 371  YKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFK 402


>gi|321260749|ref|XP_003195094.1| hypothetical protein CGB_G1120W [Cryptococcus gattii WM276]
 gi|317461567|gb|ADV23307.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1276

 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 112/266 (42%), Gaps = 42/266 (15%)

Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
            LA+GTA++    GED         V   GRVLL    +  D       ++ ++    GA+
Sbjct: 945  LAVGTAFLPPDDGEDSSWDEGNLAVVKEGRVLLLEI-KEGDAGGGWDVKIKAELTTVGAV 1003

Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
             AL  + G L +A+G K+ +H+      EL   + + A    + SL+ +       +  +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTMHRLDHNSVELEETSSW-ASAYVISSLSALHPSHTRPEGAL 1062

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD  +S+  L+  E    +    ++  +    A   L D    ++VVSD   N+  + 
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-AVVVSDAHSNLLTYR 1121

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                       QKL   A F +   VT+F    ++ T++      P         +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQNGSLVPTTTAPEIIIPD--------VLFAT 1164

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
             +G +G I  L  ++ R L  LQ+ +
Sbjct: 1165 REGRLGVIGELGTMSSRTLDDLQRNM 1190


>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
 gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
          Length = 1143

 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 133/323 (41%), Gaps = 52/323 (16%)

Query: 1111 LAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              IGTAY+  +D + A+GR+L+       D    LVTE+    ++GA   LA   G ++ 
Sbjct: 834  FVIGTAYLDDQDASNAKGRILVLEV--TEDRRLKLVTEI---SVRGACRCLAVSHGRIVA 888

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFILLGDIHKS 1216
            A    +I++ +          Y+ P  P  V           + + +  + I + D+ KS
Sbjct: 889  ALIKTVIIYSFE---------YETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKS 939

Query: 1217 IYFL--SWKEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
            +  +  +  + G   NL  +A+ F +L   A   + +   L    SD + N+ +  +  K
Sbjct: 940  MSLVQHTLGQAGGPDNLTEVARHFDTLWGTAVANVDENIYLE---SDAEGNLVVLEHDVK 996

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
                   ++L   +E  +G  V +  R+ +          +P  + T        T++GS
Sbjct: 997  GFSEEDRRRLRVTSEILLGEMVNRIRRIDV----------SPTPNATVIPRAFLATVEGS 1046

Query: 1333 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRPGPDSIVD 1388
            I     IA   +    R+Q+   ++V S  HV     R F+ Q    G+    GP   VD
Sbjct: 1047 IYLFALIAEGKQDLLIRMQNKMAEMVQSPGHVPFAKFRGFKTQVRDMGEE---GPSRFVD 1103

Query: 1389 CELLSHYEMLPLEEQLEIAHQTG 1411
             EL+  +     + Q E+A + G
Sbjct: 1104 GELIERFLDCDEDVQAEVAKELG 1126


>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
          Length = 1643

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 132/325 (40%), Gaps = 58/325 (17%)

Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E+  + RGR+L+F      DN + L T+V    +KGA  ALA L G  ++
Sbjct: 772  FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 825

Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
            A+  K ++      + +   +L  +A Y    AP    V + +  N I + D+ KS+  +
Sbjct: 826  AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAIADLMKSVCLV 881

Query: 1221 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
             + E        L  +A+ F ++      C A +  ++        +D + N+ +     
Sbjct: 882  EYSEGENGMPDSLTEVARHFQTVWATGVVCIAKDTFLE--------TDAEGNLIVLRRNL 933

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
               E    ++L    E  +G  V +   + +Q LA+ +    A              GT+
Sbjct: 934  TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 981

Query: 1330 DGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
            +GSI     I P  +    RLQ+     V+S+ ++     R FR      K     P   
Sbjct: 982  EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE----PYRF 1037

Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTG 1411
            VD EL+  +       Q EI    G
Sbjct: 1038 VDGELIERFLTCEPSLQEEIVDSVG 1062



 Score = 43.9 bits (102), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 105/264 (39%), Gaps = 29/264 (10%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP GR   + VY   + ++   Q  S   G +    SG       E       R +D  
Sbjct: 62  IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120

Query: 245 HVKDFIFVHGYIEP--VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            V+   F+H    P  + ++  + +     +V   H++   S+       ++  L   + 
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R     
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220

Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
            LD A     W Q D    LL+   G L  L +V D    VQ   L     +   S +  
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS +++   GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303


>gi|325096432|gb|EGC49742.1| pre-mRNA-splicing factor Rse1 [Ajellomyces capsulatus H88]
          Length = 1209

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 76/356 (21%), Positives = 151/356 (42%), Gaps = 38/356 (10%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
            I ++ +E A++V  V     +++++ET L +GT    G+D+    R          R  +
Sbjct: 872  IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPRSCTAGFIHIYRFQE 924

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
              + L   ++  +++    AL   QG LL   G  + ++     ++         P  VV
Sbjct: 925  EGKEL-EFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVV 983

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             L    + I++ D+ +S+ ++ +K Q  +L     D  S     T  ++D  T++    D
Sbjct: 984  GLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040

Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
            +  N+ +     K SE       G  L+   ++  GA    ++      Q L TS  +  
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               G     R  L++  L G++  + P    +E+ F   QSL+ +L    P +AG +   
Sbjct: 1101 LVTG----GRDILVWTGLQGTVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R +++  K         +D +L   Y +LP +++ +IA +   +  +I   + D+
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203


>gi|330935579|ref|XP_003305038.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
 gi|311318228|gb|EFQ86975.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
          Length = 1115

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 68/297 (22%), Positives = 122/297 (41%), Gaps = 35/297 (11%)

Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              IGTAY+  ++  + RGR+L+         P+ ++  V    +KG    LA+ +G ++ 
Sbjct: 809  FVIGTAYLDDQNTTSERGRILILEV-----TPERILKLVTEIAVKGGCRCLATCEGKIVA 863

Query: 1170 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            A    I+++           T+L       AP    + + +  + I + D+ KS+  + +
Sbjct: 864  ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGSKIAIADLMKSLVVVEY 919

Query: 1223 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
             +  A L + L +            + +  T   + SD + N+ + Y  P        ++
Sbjct: 920  TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1338
            L   +E  +G  V +  R+ +L T+SD     P +          GT++GSI   G I+P
Sbjct: 980  LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
              +     L +LQ  L   VP    ++   FR F  NG      P   VD E +  +
Sbjct: 1030 AHQ---NLLMTLQSNLGALVPAPGDMDFAKFRAFK-NGVREEEEPMRFVDGEFVERF 1082


>gi|358366518|dbj|GAA83139.1| nuclear mRNA splicing factor [Aspergillus kawachii IFO 4308]
          Length = 1209

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 100/477 (20%), Positives = 199/477 (41%), Gaps = 48/477 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    Q IPL  TP +     E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + L++       D   L   D         +   +++++P  A       
Sbjct: 815  NVLSPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
             TI ++ +E A+++  V     T++++ET L +GTA     +   +A G + ++   R  
Sbjct: 870  HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMSVNPPKSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   G  + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    AT  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  G      +   +  Q + TS  +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
           yoelii]
          Length = 1235

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  +M +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           V   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
              NL      IQV++ G R ++G  + + +   P N +  + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558



 Score = 46.2 bits (108), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 60/300 (20%), Positives = 115/300 (38%), Gaps = 59/300 (19%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
                  G ++++ G K+ ++     +L     Y   P  +VS+ +  + I   DI +S+ 
Sbjct: 956  CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 1015

Query: 1219 FLSWKEQGAQLNLLAKDFG----------------SLDCFATEFLIDGSTLSLVVS---- 1258
               +      + L++ D                  + D F + F++  S L+  ++    
Sbjct: 1016 IFFYDSNQNLIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVSLLTFFITPFCH 1075

Query: 1259 -------DEQKNI--QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                    E+  I  + +Y    ++ S K +K+     FH+G  VT   ++++  TSS+ 
Sbjct: 1076 LVPEEAKQEEYGIANKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPTSSE- 1134

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGL 1364
                          +++ T+ G+IG   P D     ELT    Q L+  L      + G 
Sbjct: 1135 -------------CIIYSTIMGTIGAFIPYDSKEELELT----QHLEIILRTEKHSLCGR 1177

Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                FR ++   +        ++D +L   +  LP E Q +I      T  +IL  L D+
Sbjct: 1178 EHIFFRSYYHPVQ-------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1230


>gi|116207186|ref|XP_001229402.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
 gi|88183483|gb|EAQ90951.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
          Length = 1211

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 99/455 (21%), Positives = 192/455 (42%), Gaps = 48/455 (10%)

Query: 995  VQK-IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
            +QK IPL  TP ++    E+  +  I S     P  ++ + L++Q      D   L   D
Sbjct: 786  IQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLP-PELRAQLLEQSGAVNGDAAILPPED 844

Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1113
                     +   I   D  G        I  + +E A++  VV     +++E E+ L +
Sbjct: 845  FGYPRATGRWASCISVVDPLGEEPSVLQRIDFEGNEAAVSAAVVPF---SSQEGESFLIV 901

Query: 1114 GTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            GT    G+D+    R   FS G     R  ++ ++L   ++  +++    AL   QG LL
Sbjct: 902  GT----GKDMVLNPRK--FSEGYIHVYRFHEDGRDL-EFIHKTKVEEPPMALIPFQGRLL 954

Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
               G  + ++     +L   A  +  P  +VSL    + I++GD+ + I ++ +K +  +
Sbjct: 955  AGIGKTLRVYDLGLRQLLRKAQGEVAPQLIVSLQTQGSRIIVGDVQQGITYVVYKPESNK 1014

Query: 1229 LNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-----QKL 1282
            L   A D  +++ + T   ++D    S+   D+  NI I   + + S+         Q L
Sbjct: 1015 LLPFADD--TINRWTTCTTMVDYE--SVAGGDKFGNIWILRCSERASQESDEPGSEIQLL 1070

Query: 1283 LSRAEFHVGAH----VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
             +R   H GA            Q L TS  +T    G        L++  + G++G + P
Sbjct: 1071 HARNYLH-GAQSRLSAMAHFYTQDLPTSIVKTNLVVGGQD----VLVWSGIQGTVGVLIP 1125

Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
                +++ F   Q+L+  +    P +AG +   +R ++   K        ++D +L   +
Sbjct: 1126 FVSREDVDF--FQNLESHMRAEDPPLAGRDHLIYRGYYVPVKG-------VIDGDLCERF 1176

Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
             +LP +++  IA +   +  +I   ++D+   ++F
Sbjct: 1177 SLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1211


>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
          Length = 1201

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)

Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            L  +GT+Y+   GE  + RGR+L F    N       + +V    +KGA  ALA +Q  +
Sbjct: 885  LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938

Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + A    ++++     ++    L+  A Y    AP    V + +  N I + D+ KS+  
Sbjct: 939  VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994

Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + + +QGA      L  +A+ F +L   A   + + + L    SD + N+ + +      
Sbjct: 995  VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
                 ++L   +E  +G  V +   + +  +       +P +          GT++GSI 
Sbjct: 1051 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102

Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
              G I P  +    RLQS    +V +     G+    FR F  N       P   VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1158

Query: 1392 LSHYEMLPLEEQLEI 1406
            +  +    +E Q EI
Sbjct: 1159 IERFLNCGVELQEEI 1173



 Score = 44.7 bits (104), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|145240731|ref|XP_001393012.1| pre-mRNA-splicing factor rse1 [Aspergillus niger CBS 513.88]
 gi|134077536|emb|CAK96680.1| unnamed protein product [Aspergillus niger]
          Length = 1209

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 100/477 (20%), Positives = 198/477 (41%), Gaps = 48/477 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    Q IPL  TP +     E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + L++       D   L   D         +   +++++P  A       
Sbjct: 815  NVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
             TI ++ +E A+++  V     T++++ET L +GTA         +A G + ++   R  
Sbjct: 870  HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   G  + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    AT  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  G      +   +  Q + TS  +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I   ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|66361481|ref|XP_627314.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
 gi|46228697|gb|EAK89567.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
          Length = 1317

 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+   ++ + +AL   +G LL+     + ++      L   + Y   P  +  + +V + 
Sbjct: 1040 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1099

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1266
            I  GDI   +    +     Q  L+AKD       +   ++D  T++  VSD+  NI + 
Sbjct: 1100 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1157

Query: 1267 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                     F +    +++   Q        ++ + A+FH+G  VT   + Q+  TS++ 
Sbjct: 1158 RVPAEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1216

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                         A+++GT+ GSIG ++P L+      L  L+  L      +   +   
Sbjct: 1217 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1263

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
            FR ++S    H     +++D +    + +L  + Q EIA +   T  +I   L+D
Sbjct: 1264 FRSYYS--PVH-----NVIDGDFCQTFTILDSQIQSEIASKLDVTVEEIYKKLDD 1311


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 131/315 (41%), Gaps = 44/315 (13%)

Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            L  +GT+Y+   GE  + RGR+L F    N       + +V    +KGA  ALA +Q  +
Sbjct: 842  LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 895

Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + A    ++++     ++    L+  A Y    AP    + + +  N I + D+ KS+  
Sbjct: 896  VAALMKTVVVYTISKGQFADYTLSKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 951

Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + + +QG+      L  +A+ F +L   A   + + + L    SD + N+ + +      
Sbjct: 952  VEY-QQGSNGLPDSLTEVARHFQTLWSTAVAHVAEDTWLE---SDAEGNLVMLHRNVNGV 1007

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
                 ++L   +E  +G  V +   + +  +       +P +          GT++GSI 
Sbjct: 1008 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1059

Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
              G I P  +    RLQS    +V +     G+    FR F  N       P   VD EL
Sbjct: 1060 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1115

Query: 1392 LSHYEMLPLEEQLEI 1406
            +  +    +E Q EI
Sbjct: 1116 IERFLSCSVELQEEI 1130



 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 63/136 (46%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 258 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 299

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
           W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  +G  + F+
Sbjct: 300 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 357

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GD  L++ T GS
Sbjct: 358 GSHQGDPELIRITEGS 373


>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
          Length = 1134

 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)

Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            L  +GT+Y+   GE  + RGR+L F    N       + +V    +KGA  ALA +Q  +
Sbjct: 818  LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 871

Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + A    ++++     ++    L+  A Y    AP    V + +  N I + D+ KS+  
Sbjct: 872  VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 927

Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + + +QGA      L  +A+ F +L   A   + + + L    SD + N+ + +      
Sbjct: 928  VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 983

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
                 ++L   +E  +G  V +   + +  +       +P +          GT++GSI 
Sbjct: 984  TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1035

Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
              G I P  +    RLQS    +V +     G+    FR F  N       P   VD EL
Sbjct: 1036 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1091

Query: 1392 LSHYEMLPLEEQLEI 1406
            +  +    +E Q EI
Sbjct: 1092 IERFLNCGVELQEEI 1106



 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
 gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
          Length = 1218

 Score = 49.3 bits (116), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 77/388 (19%), Positives = 149/388 (38%), Gaps = 58/388 (14%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RI++P       +T   + ++ +E AL++ +V  FN   +    ++ +   Y       
Sbjct: 867  IRIIDPVEG----RTDKIVRLEQNEAALSIALVK-FNNHPESLFLVVGVVKEYQLSPRQV 921

Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
            + G +  F    +  +    +  V+   +  A +A+    G LL+  G  + L+     +
Sbjct: 922  SFGYLYTFRINEDVTD----LELVHKTTVDEAPAAVCPYHGRLLVGVGRMLRLYDLGKKK 977

Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
            L         P  +VS+      + + D+ +S+Y + +K Q  QL + A D        T
Sbjct: 978  LLRKCENKYIPNQIVSICATGQRVFVSDVQESVYMVRYKRQENQLIIFADDTHPRWITCT 1037

Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLSRA 1286
              L D  T++   +D+  NI I   +  +++            W         QK    A
Sbjct: 1038 TIL-DYDTVA--TADKFGNIAIIRLSSIITDDVDEDPTGNKALWDRGLLNGASQKADVLA 1094

Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
             FHVG       +  ++          PG  ++    L++ +L G++G + P    T R 
Sbjct: 1095 NFHVGETCMSLQKATLI----------PGGSES----LVYTSLSGTVGVLVP---FTSRE 1137

Query: 1347 ----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
                 Q L+  +    P + G +  SFR ++   K       +++D +L   Y  +   +
Sbjct: 1138 DHDFFQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQYNSIEPAK 1190

Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            Q  IA       S +   L D+    +F
Sbjct: 1191 QKSIAEDLDRNPSDVSKKLEDIRTRYAF 1218



 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL+++V  +E+      S + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406

Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
             A +    A  D+     L   GP   L+   +GL +            S   + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WTV  +               ++EY AY+I+S    T+VL   + + EVT+S   
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
           F+      + +  G   ++QV+  G R +              N     G +  T++  +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548

Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
           +    V++ ++ G +     DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571


>gi|171685748|ref|XP_001907815.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942835|emb|CAP68488.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1235

 Score = 49.3 bits (116), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 77/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            + + ++E A++  VV+     +++ E+ L +GT    G+D+    R   FS G     R 
Sbjct: 898  VDLDNNEAAISAAVVSF---ASQDGESFLIVGT----GKDMILSPR--QFSEGYIHVYRF 948

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
             D+ ++L   ++  +++    AL   QG LL   G  + ++     +L   A  +  P  
Sbjct: 949  HDDGRDL-EFIHKTKIEEPPMALIPFQGRLLAGIGKTLRIYDLGLKQLLRKAQAEIAPQL 1007

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
            +VSL    N I++GD+ + I +  +K +  +L   A D  +++ + T   ++D    S+ 
Sbjct: 1008 IVSLQTQGNRIVVGDVQQGITYAVYKPESNKLLAWADD--TINRWTTCTAMVDYE--SVA 1063

Query: 1257 VSDEQKNIQIFYYAPKMSESWKG-----QKLLSRAEFHVGAHVTKFLR---LQMLATSSD 1308
              D+  N+ I     + S+         Q + +++  H   + T  +     Q L TS  
Sbjct: 1064 GGDKFGNVWILRAPERASQESDEPGSEIQLVHAKSYLHGAPNRTALMAHFYTQDLPTSIT 1123

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G        LL+  + G++G + P    +++ F   QSL+  +    P +AG +
Sbjct: 1124 KTNLVVGGQDV----LLWSGIQGTVGVLIPFVSREDVDF--FQSLESHMRAEDPPLAGRD 1177

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               +R ++   K        ++D +L   + +L  +++  IA +   +  +I   ++D+ 
Sbjct: 1178 HLIYRGYYVPVKG-------VIDGDLCERFALLANDKKQMIAGELDRSVREIERKISDIR 1230

Query: 1426 LGTSF 1430
              ++F
Sbjct: 1231 TRSAF 1235


>gi|449019082|dbj|BAM82484.1| UV-damaged DNA binding protein [Cyanidioschyzon merolae strain 10D]
          Length = 1372

 Score = 49.3 bits (116), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 119/302 (39%), Gaps = 58/302 (19%)

Query: 1110 LLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS------ 1162
            +  +GTAYV   E   +RGR+L+FS          L+ E+Y+      +SALA       
Sbjct: 981  VFVVGTAYVLPSEMEPSRGRILVFSR-----EELLLLNELYTPGAVYTMSALADPSDRTC 1035

Query: 1163 ---LQGHLLIASGPK--IILHKWTGT------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
                     +A+G    +IL+ W  +      EL  +A +    L V+ L    + +L+G
Sbjct: 1036 RFPASAARFLAAGVNNVVILYDWGQSGHGDDYELREVARHLGHVL-VLRLEARGDQLLVG 1094

Query: 1212 DIHKSIYFLSW------KEQGAQ--LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
            D+ KS+  L           GA   L  +A D+ +    A  FL + + L+   +D   N
Sbjct: 1095 DLMKSLCVLQLVLPEGETSDGASPCLKAVAWDYETAWITACAFLNEDTYLA---ADNSYN 1151

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG-------- 1315
            +      P  + S     L     FH+G  V  F R +++  +S    A  G        
Sbjct: 1152 LLSLQRNPHETRSEFRHALNRAGAFHLGDLVNVFRRGKLVTEASGNEEAGTGNGHSTIDT 1211

Query: 1316 ---------------SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
                           +D  +R  LLF T  G+IG I PLD    R L  ++K L     H
Sbjct: 1212 ESTRDVARASTGTTTADNVSRQTLLFATTAGAIGIIVPLDPAQHRMLSRVEKALRSLTDH 1271

Query: 1361 VA 1362
             A
Sbjct: 1272 PA 1273


>gi|189205943|ref|XP_001939306.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187975399|gb|EDU42025.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 1115

 Score = 49.3 bits (116), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 67/297 (22%), Positives = 121/297 (40%), Gaps = 35/297 (11%)

Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              IGTAY+  +   + RGR+L+         P+ ++  V    +KG    LA+ +G ++ 
Sbjct: 809  FVIGTAYLDDQSTTSERGRILILEV-----TPERILKLVMEIAVKGGCRCLATCEGKIVA 863

Query: 1170 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            A    I+++           T+L       AP    + + +    I++ D+ KS+  + +
Sbjct: 864  ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGPKIVIADLMKSLVVVEY 919

Query: 1223 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
             +  A L + L +            + +  T   + SD + N+ + Y  P        ++
Sbjct: 920  TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1338
            L   +E  +G  V +  R+ +L T+SD     P +          GT++GSI   G I+P
Sbjct: 980  LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
              +     L +LQ  L   +P    ++   FR F  NG      P   VD E +  +
Sbjct: 1030 AHQ---NLLMTLQSNLGALIPAPGDMDFAKFRAFK-NGVRQEEEPMRFVDGEFVERF 1082


>gi|67600754|ref|XP_666354.1| CG13900 gene product [Cryptosporidium hominis TU502]
 gi|54657334|gb|EAL36124.1| CG13900 gene product [Cryptosporidium hominis]
          Length = 1318

 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+   ++ + +AL   +G LL+     + ++      L   + Y   P  +  + +V + 
Sbjct: 1041 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1100

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1266
            I  GDI   +    +     Q  L+AKD       +   ++D  T++  VSD+  NI + 
Sbjct: 1101 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1158

Query: 1267 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
                     F +    +++   Q        ++ + A+FH+G  VT   + Q+  TS++ 
Sbjct: 1159 RVPVEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1217

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                         A+++GT+ GSIG ++P L+      L  L+  L      +   +   
Sbjct: 1218 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1264

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
            FR ++S    H     +++D +    + +L  + Q EIA +   T  +I   L+D
Sbjct: 1265 FRSYYS--PVH-----NVIDGDFCQTFTILDSKIQSEIASKLDVTVEEIYKKLDD 1312


>gi|156086042|ref|XP_001610430.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797683|gb|EDO06862.1| conserved hypothetical protein [Babesia bovis]
          Length = 1450

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            YVVSL+  ++ I++GD+  S+  L W  QG +L  + KDF S+ C A    ID +  S V
Sbjct: 1171 YVVSLDAYQDVIVIGDLMNSMRMLQW--QGTELREVCKDFNSVYCTAAA-AIDQT--SCV 1225

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1299
            V+D   N  +F     ++   +  K      FH G  + +  R
Sbjct: 1226 VADSSGNFYVFAKRQVVTNDAEAIKAEDVGLFHHGELINRIRR 1268


>gi|70992737|ref|XP_751217.1| nuclear mRNA splicing factor [Aspergillus fumigatus Af293]
 gi|74670386|sp|Q4WLI5.1|RSE1_ASPFU RecName: Full=Pre-mRNA-splicing factor rse1
 gi|66848850|gb|EAL89179.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus Af293]
          Length = 1225

 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + + +Q +    ++ S    DN    + IPL  TP ++    E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + LI+       + + L   D         +   ++I++P  A       
Sbjct: 815  NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +TI ++ +E A+++  V     +++++ET L +GTA   +     +A G + ++   R  
Sbjct: 870  STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG LL   G  + ++     +L            +
Sbjct: 924  EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  GA     L +    Q + TS  +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +R +++  K        ++D +L   Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188


>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1212

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 87/430 (20%), Positives = 173/430 (40%), Gaps = 72/430 (16%)

Query: 1043 QIDNHNLSSVDLHRTYTVEEYEV----RILEPDRAGGPWQTR------------------ 1080
            + DNH L   DL      +   V    ++L P+  G P   R                  
Sbjct: 813  EADNHTLPP-DLQAKLLADPAAVNGDAKVLPPEEFGHPRGNRRWASCISVVDPVSEEPSV 871

Query: 1081 -ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR--------VLL 1131
               +  +++E A++  VV+     +++NE+ L +GT    G+D+    R        +  
Sbjct: 872  LQKVDFENNEAAVSAAVVSF---ASQDNESFLVVGT----GKDMILNPRSSSEAYIYIYR 924

Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
            F  G         +  ++  +++    AL + QG LL   G  + ++     +L   A  
Sbjct: 925  FQEGGRE------LEFIHKTKIEEPAMALLAFQGKLLAGIGKTLRMYDLGMRQLLRKAQA 978

Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
            +  P  +VSLN   + I++GD+ + +  + +K    +L   A D  +     T  ++D  
Sbjct: 979  EVVPQQIVSLNTQGSRIVVGDVQQGVTLVVYKPASNKLIPFADDTIARWTTCTT-MVDYE 1037

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQML 1303
              S+   D+  N+ I     K SE    ++     + +R   H   H  + +     Q +
Sbjct: 1038 --SVAGGDKFGNMFIVRSPAKASEEADEEQAGLHLVNARDYLHGAQHRLELMCHFFTQDV 1095

Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
             TS ++T    G        LL+  + G+IG   P    ++  F   QSL++ L      
Sbjct: 1096 PTSINKTSLVVGGQDV----LLWSGIMGTIGVFIPFVSREDADF--FQSLEQHLRTEDAP 1149

Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
            +AG +   +R +++  K        ++D +L   +  LP +++  +A +   +  +I   
Sbjct: 1150 LAGRDHLMYRSYYAPVKG-------VIDGDLCERFAALPNDKKQMMAGELDRSVREIERK 1202

Query: 1421 LNDLALGTSF 1430
            ++D+   ++F
Sbjct: 1203 ISDIRTRSAF 1212



 Score = 47.8 bits (112), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)

Query: 64  NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           NV++   V  Q  G+KE      SG     +  D      + L+ H  + G + S+A+  
Sbjct: 19  NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G+     +D IILA +  +I+VLE+  S +      M  F        K G      G
Sbjct: 76  LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124

Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             +  DP+GR      V    L  ++ + SQ        E T  S     A      VI 
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175

Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
           L  LD+          GY  PV   L     E +    G    +  T ++    +   L 
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
                WS    P D    L    P G     GVLV G   + Y HS   +  + +     
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
               + E P    ++     H   L+        LL T  GDL  +T+  V D       
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            VQR+ +   +   + + +  + +   ++ S+ G+    QF              E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381

Query: 457 IEADAPSTKRLRRSSSDALQD-MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
                     L  SS D   D +   E +  Y   + N          A+ DS+  + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426

Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
            D         DA    S  G   +S +  +            ELPG    +WT+   S 
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
                        D++Y  Y+++S    T+VL   + + EV++S  +     TIAA  L 
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530

Query: 619 GRRRVIQVFERGAR 632
           G   +IQV  RG R
Sbjct: 531 GTDGLIQVHPRGIR 544


>gi|159130328|gb|EDP55441.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus A1163]
          Length = 1225

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + + +Q +    ++ S    DN    + IPL  TP ++    E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + LI+       + + L   D         +   ++I++P  A       
Sbjct: 815  NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +TI ++ +E A+++  V     +++++ET L +GTA   +     +A G + ++   R  
Sbjct: 870  STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG LL   G  + ++     +L            +
Sbjct: 924  EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  GA     L +    Q + TS  +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +R +++  K        ++D +L   Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188


>gi|341886298|gb|EGT42233.1| CBN-TAG-203 protein [Caenorhabditis brenneri]
          Length = 108

 Score = 48.5 bits (114), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 30/120 (25%), Positives = 62/120 (51%), Gaps = 16/120 (13%)

Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
            PG+++    AL++ T+ G+IGC+      DE+ F    +L+  +    P + G +  ++R
Sbjct: 2    PGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYR 55

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
             +++  K       S++D ++   + ++ L +Q E+A + G T S+I   L D+    +F
Sbjct: 56   SYYAPCK-------SVIDGDICEQFSLMDLPKQKEVAEELGKTVSEISKKLEDIRTRYAF 108


>gi|167380951|ref|XP_001733297.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
 gi|165902459|gb|EDR28278.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
          Length = 349

 Score = 48.1 bits (113), Expect = 0.032,   Method: Composition-based stats.
 Identities = 59/226 (26%), Positives = 101/226 (44%), Gaps = 20/226 (8%)

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLIDGST 1252
            LYV ++    N IL+GD+ KSI   S+   G   N    +++DF +    A EF+ +   
Sbjct: 128  LYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDENCY 184

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            LS   SD   N+ +F      +ES +  +L + A  HVG  +    +  +  T S     
Sbjct: 185  LS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY--- 237

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH-VAGLNPRSFRQ 1371
                +   +  +LFG + G IG I  +    +  L  +Q +++  +   V    P  +++
Sbjct: 238  ----ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWKK 293

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
               + K  R    +I+D  ++  Y  +  E+Q EIAH +G    QI
Sbjct: 294  VIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 337


>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
          Length = 1139

 Score = 48.1 bits (113), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
            + + ++  L S  G L+ A   ++   ++T ++        +  ++     I+K     +
Sbjct: 871  KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 930

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL+GD+ KS+  L  +  G+ L  +A++   +   +   + D      + ++   N  +
Sbjct: 931  FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 986

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
                   +   + + L S   +H+G  +           +S R G+    P SD+     
Sbjct: 987  VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1035

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
            +L+ +++GSIG +A + E  F     LQK L   V  V G +  ++R F ++   H    
Sbjct: 1036 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1093

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
             + +D +L+  +  L  E QL+     G T
Sbjct: 1094 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1123



 Score = 40.8 bits (94), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++     AVS     
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSV-----AVSYTKFL 252

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
              R    V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 253 AFTR----VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324


>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
 gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
            protein E; AltName: Full=UV-damaged DNA-binding protein 1
 gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
          Length = 1181

 Score = 48.1 bits (113), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
            + + ++  L S  G L+ A   ++   ++T ++        +  ++     I+K     +
Sbjct: 913  KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL+GD+ KS+  L  +  G+ L  +A++   +   +   + D      + ++   N  +
Sbjct: 973  FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 1028

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
                   +   + + L S   +H+G  +           +S R G+    P SD+     
Sbjct: 1029 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1077

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
            +L+ +++GSIG +A + E  F     LQK L   V  V G +  ++R F ++   H    
Sbjct: 1078 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1135

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
             + +D +L+  +  L  E QL+     G T
Sbjct: 1136 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1165



 Score = 40.8 bits (94), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366


>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
           ANKA]
 gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
          Length = 1216

 Score = 48.1 bits (113), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
           +     S  SM +S      G     A  T++L+      L D +    +          
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +  + ++   R      GP   L+   +GL I   A+             ELPG  + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WT+   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           T    NL      IQV++ G R ++G  + + +   P N +  + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557



 Score = 43.9 bits (102), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 110/286 (38%), Gaps = 46/286 (16%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
                  G ++++ G K+ ++     +L     Y   P  +VS+  V N I   DI +S+ 
Sbjct: 952  CFCPFNGKVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIK-VSNRIFASDIRESVL 1010

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      + L++ D        +E L   + ++         L V +E K  +    
Sbjct: 1011 IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVPEEAKQEEYGIA 1070

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    ++ S K +K+     FH+G  VT   ++++   SS+               
Sbjct: 1071 NKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPVSSE--------------C 1116

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L      + G     FR ++   + 
Sbjct: 1117 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 1171

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   ++D +L   +  LP E Q +I      T  +IL  L D+
Sbjct: 1172 ------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1211


>gi|389744702|gb|EIM85884.1| hypothetical protein STEHIDRAFT_121882 [Stereum hirsutum FP-91666
            SS1]
          Length = 1255

 Score = 48.1 bits (113), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 69/277 (24%), Positives = 115/277 (41%), Gaps = 48/277 (17%)

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSKELKGAISALASLQGHLLIASG 1172
            Y   E   + G + +FS G     P   V     EV  +E+ G + ALAS +G+++ A  
Sbjct: 936  YQLDEKEPSHGELHIFSKG--GTEPDGTVKADLLEVVKQEVSGCVYALASFEGYIVAAIN 993

Query: 1173 PKIILHKW--TGTELNGIAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSW---KEQG 1226
              +   K   +GTE   +  ++    Y+V SL +  +++L+GD   S+  L      E G
Sbjct: 994  STVSFFKLDTSGTEATLVKKHEWNHNYLVTSLVVSGSYLLIGDAISSVSVLQVIQVDENG 1053

Query: 1227 A---QLNLLAKDFG-----SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
                +L  +A+D+G     SL  +  E +I  ++          N+  F   P   +   
Sbjct: 1054 EITEKLKTVARDYGPLWPVSLQGWGKEGVIGANS--------DCNLFSFTLQPVTPQ--- 1102

Query: 1279 GQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL----LFGTLDGSI 1333
             + +L R   FH+  HV KFL            G    S+K     +    LF T  G I
Sbjct: 1103 -KTVLDRDGHFHLDDHVNKFLH-----------GTVHSSEKAEDLDIEARSLFFTASGRI 1150

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
            G +  + +     + +LQ+ L   V  V+G   + +R
Sbjct: 1151 GLVLDMGKELSLHMTALQRNLNGVVKDVSGTTHKRWR 1187


>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1133

 Score = 48.1 bits (113), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 68/302 (22%), Positives = 123/302 (40%), Gaps = 32/302 (10%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
              +GTA     D +  GR+L    G  AD    L+TE+   EL GA  +LA ++G++L  
Sbjct: 818  FVVGTAIGNDSDESEHGRLLFLELG--ADKMLRLITEL---ELPGACHSLAIVKGYILAG 872

Query: 1171 SGPKIILHKWT------GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
                I L++++      G  +  I+   A  L  VSL++    + +GD+ K +  L   E
Sbjct: 873  LSKSIDLYRFSYTRGSLGASIQQISSIRAATL-PVSLSVYGKRVFVGDLVKGVMVLEVVE 931

Query: 1225 QGAQLN----LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
             G + N     + + +G     A E L + + +S   +D   N+ +       +     +
Sbjct: 932  GGGEGNDKLVEVCRQYGVSWVTALEALDEDTCIS---ADSDGNLVLLRRESTGATDEDTR 988

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
            ++   +E  +G  V    R+    T        P +          GT+DG +  +  + 
Sbjct: 989  RMRPLSEIRLGEMVNCIRRVNDPITQG--YVVQPKA--------YLGTVDGGLFMLGLIH 1038

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
               F  L   Q  +   +  +  L+   +R +++ G      P   VD EL+  +  L L
Sbjct: 1039 PDYFDILMKCQVNMAKVIKGIGDLDFNRYRAYNTKG-IQPEEPFRFVDGELVEKF--LDL 1095

Query: 1401 EE 1402
            +E
Sbjct: 1096 DE 1097


>gi|367027320|ref|XP_003662944.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
            42464]
 gi|347010213|gb|AEO57699.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
            42464]
          Length = 1211

 Score = 48.1 bits (113), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 75/365 (20%), Positives = 161/365 (44%), Gaps = 44/365 (12%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I ++ +E A++  VV      ++E E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 874  IDLEGNEAAVSAAVVPF---ASQEGESFLVVGT----GKDMVLNPRK--FTEGYIHVYRF 924

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
             ++ + L   ++  +++    AL   QG LL   G  + ++     +L   A  +  P  
Sbjct: 925  HEDGREL-EFIHKTKVEEPPLALIPFQGRLLAGIGKMLRVYDLGLRQLLRKAQGEVAPQL 983

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
            +V+L    + I++GD+ + + ++ +K +  +L + A D  +++ + T   ++D    S+ 
Sbjct: 984  IVTLQTQGSRIIVGDVQQGVTYVVYKPESNKLLVFADD--TINRWTTCTTMVDYE--SVA 1039

Query: 1257 VSDEQKNIQIFYYAPKMSESWKG-----QKLLSRAEFHVGAHVTKFL---RLQMLATSSD 1308
              D+  N+ I     + S+         Q L +R   H   +    +     Q L TS  
Sbjct: 1040 GGDKFGNVWILRCPERASQESDEPGSEIQLLHARKYLHGAPNRLDLMVHFYTQDLPTSIV 1099

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G        L++  + G++G + P    +++ F   QSL+  +    P +AG +
Sbjct: 1100 KTNLVVGGQD----VLVWSGIQGTVGVLIPFVSREDVDF--FQSLESHMRAEDPPLAGRD 1153

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               +R ++   K        ++D +L   + +LP +++  IA +   +  +I   ++D+ 
Sbjct: 1154 HLIYRGYYVPVKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIR 1206

Query: 1426 LGTSF 1430
              ++F
Sbjct: 1207 TRSAF 1211


>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
            98AG31]
          Length = 1216

 Score = 48.1 bits (113), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 67/323 (20%), Positives = 117/323 (36%), Gaps = 81/323 (25%)

Query: 1113 IGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSK--ELKGAISALASLQGHLLI 1169
            IGT +V   E  +  GR+L        D   NL      +  ++KG +  L  L G   +
Sbjct: 870  IGTGFVNPNESQSNTGRILTIGLSSKHDQEGNLREFKLKRMTKVKGTVHGLGGLPGGKFV 929

Query: 1170 ASGPKII----------------LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            AS    +                L  W G  ++     +            KN+I++GD+
Sbjct: 930  ASANAFVHAFGINEEEEDEGFEVLDTWGGGFVSQTVLTE------------KNWIIVGDL 977

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
            +KSI  L +  +   L +L +D+ ++       + D      V +D + N+    +  +M
Sbjct: 978  YKSIVVLEFDLKKFSLKVLGRDYSAMSVRPIGMISDR---VFVAADTEFNL----FTVEM 1030

Query: 1274 SESWKGQKLLSRAE--------------------------------------FHVGAHVT 1295
             E  KG K     E                                      FH+G +V 
Sbjct: 1031 RERQKGLKEEDEDEEGLSVEEEKGDDDEWEEEERRMRVEKVFNDDHLDTVGGFHLGENVN 1090

Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKL 1354
             F    ++ +         G D      L+F +  G IG I  L++L  ++ L++L+ +L
Sbjct: 1091 HFKAGSLVKSLKHFY----GQDLKYGGKLIFVSSTGGIGVIIKLEDLKIYKHLKALEDRL 1146

Query: 1355 VDSVPHVAGLNPRSFRQFHSNGK 1377
               +  + GL+   FR+F +  K
Sbjct: 1147 KKEILSIGGLDSTEFRKFKNKWK 1169


>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
          Length = 1057

 Score = 47.8 bits (112), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 58/257 (22%), Positives = 111/257 (43%), Gaps = 22/257 (8%)

Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1179
            G++    GR+L+      +D    L++++ +    G I  +  ++G LL +    + L++
Sbjct: 760  GKENDGLGRILVLQLA--SDRKLRLISQLKTG---GMIDCVRPIEGKLLASIQGTLYLYR 814

Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
            W    L  ++    P +    +   +NFI+ GD+  S+    +  Q  QL  +A    + 
Sbjct: 815  WQSQRLVKVSSRRLPSV-TRCMTTHENFIMTGDLAYSVVMFQYDRQSDQLLEVAAHEKTK 873

Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFL 1298
            +  A +  ID +   LV+  E++   +F       E    + LL   + +H+G  V++F 
Sbjct: 874  EVLAMK-AIDSN---LVIGAEREG-HLFVLEHCQDEVSADEPLLDVISTWHLGDVVSRFR 928

Query: 1299 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
               L M     D +  AP        +L+F T  G+IG IA L    ++ L  +Q  +  
Sbjct: 929  FGSLGMNNVDPDSSPIAP--------SLIFATASGAIGVIADLSPERYKLLYQMQCNMCR 980

Query: 1357 SVPHVAGLNPRSFRQFH 1373
             V  +  L+   +R  +
Sbjct: 981  VVKGIGELSHTDWRNVN 997


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score = 47.8 bits (112), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 127/302 (42%), Gaps = 21/302 (6%)

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAPPL 1196
            D  +NL   ++  ELK     +   Q  LL+ASG  I L++    +L    +   D    
Sbjct: 880  DKKKNL-QYIHKTELKYVPQTMEVFQDRLLVASGNSISLYELGQRQLLRKSLTRIDFIQT 938

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
             V      ++ ILL D   SI F  + ++  Q   +A D    +  A + L D  T+  +
Sbjct: 939  IVKVTPQPRDRILLADSANSIVFAKFDQEENQFVSMADDTVKRNITAWKQL-DYDTV--I 995

Query: 1257 VSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
              D+  NI +       + ++ ++W   K  ++   ++ + V K   L         T  
Sbjct: 996  GGDKFGNIFVSRLDREESKQIDQNWTVLKQAAKNSPNLNSCVYKLQNLCEYYIPDIITSF 1055

Query: 1313 APGS-DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
              GS +      +++  L G+IG + PL    E+       L+    +   +VAG N   
Sbjct: 1056 QLGSFNLGGEECIIYTGLTGTIGILLPLISKSEIELLHDLQLEISAYNDKVNVAGKNHAK 1115

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
             R +++  K       +I D + L  Y  LPL+E+L+IA +   +  ++   LND+   +
Sbjct: 1116 LRSYYNPAK-------NIFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRS 1168

Query: 1429 SF 1430
            SF
Sbjct: 1169 SF 1170



 Score = 46.2 bits (108), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           +P+DA  L  VP  IGGVLV GAN I Y          L N ++ L   +   ++S  + 
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280

Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
               HA   +N     LL    GDL  L + YD    +++ ++++  +   +  ++    
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N L F        LL QF            L EE    E D    K ++  +    ++  
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
              E  L G  +       ++ S  + DS++N   L   S   ++      T  +     
Sbjct: 388 --FEFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
           +L   P    I+T             +   A DDE   YL+I  +L  +T+VL   +++ 
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           EV +S   FV  +   A    G+  ++Q++  G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526


>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
           purpuratus]
          Length = 949

 Score = 47.8 bits (112), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + +   +   + + +  +     F+ S  G+  L Q 
Sbjct: 34  LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS    E GD    AP T R        L+++   E LS   S     
Sbjct: 94  AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
            +++ T    V         ++   +GL +            S   + ELPG    +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             KS              DDEY AY+I+S    T+VL   + + EVT+S   F+      
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238

Query: 614 AGNLFGRRRVIQVFERGAR 632
           + +L G   ++Q++  G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257


>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1194

 Score = 47.8 bits (112), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 114/567 (20%), Positives = 211/567 (37%), Gaps = 111/567 (19%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +  L+ V +    G + S+A     G      +D +I+  +  ++ +LE++ S +     
Sbjct: 46  SGRLDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKV 101

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLV 214
               F        + G      G  +  DP+GR    G +    L  I+ + SQ    + 
Sbjct: 102 HQETFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI- 153

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAG 272
                        A    + V ++  +D+          G+  P+   + +   E T   
Sbjct: 154 --------SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIE 195

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANT 328
            V   H+T +++   +   L      WS   +   A  +++VP     P GGVLV     
Sbjct: 196 DVEETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGR 253

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTK 382
           ++Y +   +            D S  +PR +   E    +  +HA+  Q D+   L+ ++
Sbjct: 254 VYYRNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSE 301

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
            GDL  +T+ Y G +V  + ++  +     + IT + N   F+ S  GD  L  F     
Sbjct: 302 YGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK---- 357

Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
               S GL         DAP+      +     + +     L L  + S    S      
Sbjct: 358 ----SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVD 401

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KG 549
           F V D L   G  + ++           +G+S+++N  ++            +LPG   G
Sbjct: 402 FKVAD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAG 450

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDY 605
           IWT+    +   N     +   +     Y+++S    T+VL   + + EV +    S   
Sbjct: 451 IWTIPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTT 510

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
            +  R++ A NL     ++Q+F +G R
Sbjct: 511 TILIRSMGA-NL---DSIVQIFAQGIR 533


>gi|238487250|ref|XP_002374863.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
 gi|220699742|gb|EED56081.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
          Length = 1210

 Score = 47.4 bits (111), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 75/342 (21%), Positives = 152/342 (44%), Gaps = 36/342 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
            +TI ++ +E A++V  V     T++++ET L +GTA     +   +A G + ++   R  
Sbjct: 870  STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   GP + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  NI +     K+SE       G  L+  R   H   +  + +     Q + T+  +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +R +++  K        ++D +L   Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAE 1188


>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1195

 Score = 47.4 bits (111), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)

Query: 1110 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
            +  +GT+Y+  +DV   + RGR+L F  TG         + +V    +KGA  ALA +Q 
Sbjct: 875  MFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVMQD 926

Query: 1166 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1217
             ++ A    ++++     E     LN  A Y    AP    + + +  N I + D+ KS+
Sbjct: 927  KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 982

Query: 1218 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
              + +K+    Q   L  +A+ F +L   A   + +   L    SD + N+ +       
Sbjct: 983  SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNQNVNG 1039

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
                  ++L   +E  +G  V    R++ ++         P     +  A L GT++GSI
Sbjct: 1040 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1095

Query: 1334 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
               G I P  +    RLQS    LV   P     N   FR F  N       P   VD E
Sbjct: 1096 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1151

Query: 1391 LLSHY 1395
            L+  +
Sbjct: 1152 LIERF 1156



 Score = 44.7 bits (104), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407


>gi|449295711|gb|EMC91732.1| hypothetical protein BAUCODRAFT_116696 [Baudoinia compniacensis UAMH
            10762]
          Length = 1148

 Score = 47.4 bits (111), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 74/330 (22%), Positives = 134/330 (40%), Gaps = 52/330 (15%)

Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            T E      +GTAY+        +GR+L+       +    +V E+    LKGA   LA 
Sbjct: 832  TGETAERFVVGTAYLDDAPQQQTKGRILVLEV--TEERRLKVVAEL---GLKGACRCLAV 886

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFIL 1209
            + G ++ A    ++++            Y  P  P  V           + + +  + I 
Sbjct: 887  VLGRIVAALVKTVVIYALE---------YQTPSHPFLVKKAAYRTSTAPIDICVTGSTIA 937

Query: 1210 LGDIHKSIYFLSWKE-QGA---QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            + D+ KS+  +S+K  +G     L+ +A+ + +L   A   + + + L    +D + N+ 
Sbjct: 938  VTDLMKSVSLVSYKPGRGGVPDTLSEIARHYETLWGTAIANVAENTYLE---ADAEGNLV 994

Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
            +  +          ++L   +E  +G  V +   + +  T++      P +         
Sbjct: 995  VLQHEVNGYSDEDRRRLRPVSEMLLGEMVNRIRSISVQPTAT--AVVVPRA--------F 1044

Query: 1326 FGTLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRP 1381
              T++GSI     I+P  +    RLQ+L  + V S  HV     R FR Q    G     
Sbjct: 1045 LATVEGSIYLFALISPGKQDLLMRLQALLAERVKSPGHVPFAKWRGFRSQVRDMGGE--- 1101

Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            GP   VD EL+  Y   P+E Q+++A + G
Sbjct: 1102 GPTRFVDGELVERYLEAPVEVQVDVASELG 1131


>gi|83767504|dbj|BAE57643.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1270

 Score = 47.4 bits (111), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 76/350 (21%), Positives = 155/350 (44%), Gaps = 36/350 (10%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
            +TI ++ +E A++V  V     T++++ET L +GTA     +   +A G + ++   R  
Sbjct: 870  STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   GP + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  NI +     K+SE       G  L+  R   H   +  + +     Q + T+  +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
             +R +++  K        ++D +L   Y +LP + ++ IA +   +  +I
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREI 1196


>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
          Length = 516

 Score = 47.4 bits (111), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 493

 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 98/214 (45%), Gaps = 8/214 (3%)

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
             P  +++SL++++N+I++GDI  S+  LS+    + L  + +D+ ++ C    F+   S 
Sbjct: 254  TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 310

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
               +VSD + N  +F  +          KL   A F+ G  V K L + +  +S      
Sbjct: 311  SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVAFFNHGHVVNKMLPVSL--SSLIEEEE 368

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
            A       + ++L  + +GSI  I P   LT F++   ++  L DS+  +  +N  S   
Sbjct: 369  AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIGNINNNSNNT 428

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
            +  N          +VD EL   +  +P E+Q +
Sbjct: 429  YKMN--LSEKSCKGVVDGELFKMFFSMPFEKQFK 460


>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
            SS1]
          Length = 1263

 Score = 47.4 bits (111), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 62/265 (23%), Positives = 117/265 (44%), Gaps = 27/265 (10%)

Query: 1111 LAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
              +GTA ++ E+   + GR+LLFS   +++N    +T V S +++G + AL  + +G + 
Sbjct: 948  FCLGTAVIRPEEREPSNGRILLFSL--SSENGVRSLTTVASHKVRGCVYALQHVSEGVIA 1005

Query: 1169 IASGPKIILHKWTGTELNGIAF---YDAPPL-----YVVSLNIVKNFILLGDIHKSIYFL 1220
             A    ++L+K     L G  F    D         +V SL     F+L+GD   S+  L
Sbjct: 1006 AAINTSVLLYKIREGNL-GEGFDRVLDKAAEWNHNHFVTSLVWDGQFLLVGDAISSVSVL 1064

Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
               +   +L  +A+D+  L   A E   +G  +         +  +F +A +      G 
Sbjct: 1065 RVADDATKLESVARDYAPLWPVAIESTGNGGVIG-----ANSDCNLFSFALQRGPQRNG- 1118

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
             L     +H+   V K ++  +  +S+D +      D+  +   +F T  G IG I  ++
Sbjct: 1119 -LEKNGVYHIDDVVNKLIKGAL--SSADVS-----QDQAVKAGHVFFTSTGRIGAILDMN 1170

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +     + +LQ+ +  S+    G+N
Sbjct: 1171 DTMSLHMTALQRNMAKSLIGPGGVN 1195


>gi|407919154|gb|EKG12409.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1210

 Score = 47.4 bits (111), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 85/393 (21%), Positives = 160/393 (40%), Gaps = 57/393 (14%)

Query: 1064 EVRILEPD-----RAGGPWQT--RATIPMQSSENALTVRV--------VTLFNTTTKENE 1108
            + +IL P+     RA G W +  +   P+ S E   T+ +        V L   T++ +E
Sbjct: 835  DAKILPPEEFGYPRAEGHWASCIQVVDPISSKEVVHTLELEENESAVSVCLAPFTSQNDE 894

Query: 1109 TLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
            T L +GTA  +   VA R    G V ++    N       +  ++  +++    AL   Q
Sbjct: 895  TFLVVGTA--KDLVVAPRSYNCGYVHIYRLQENGRE----LEFIHKTKMEAPPMALLPFQ 948

Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
            G LL+     + L+     + L      +  P  ++ L    + I+  D+ +S+ ++ +K
Sbjct: 949  GKLLVGVEADLRLYDLGLRQLLRKAQALNVVPNILIGLQTQGSRIVCSDVQESVTYVVYK 1008

Query: 1224 EQGAQLNLLAKD--FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---- 1277
                +L     D       C A   ++D  T +    D+  NI +    PK SE      
Sbjct: 1009 HLENRLIQFCDDSIHRWTSCTA---MVDYETTA--GGDKFGNIWLVRCPPKASEEADEEG 1063

Query: 1278 KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
             G  L++   +  G      +      Q + TS  +T    G     R  LL+  L G++
Sbjct: 1064 SGLHLINERPYLQGTPNRLDLLAHFYTQDIPTSIQKTALVAG----GRELLLWSGLQGTL 1119

Query: 1334 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
            G   P    +++ F   QSL+++L    P +AG +  ++R ++   K        ++D +
Sbjct: 1120 GIFIPFVSREDVDF--FQSLEQQLRTEDPPIAGRDHLAYRSYYVPVKG-------VIDGD 1170

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
            L   +  LP +++  IA +   +  ++   + D
Sbjct: 1171 LCERFLRLPRDKKETIAAELDRSVREVERKIGD 1203


>gi|340367935|ref|XP_003382508.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Amphimedon
            queenslandica]
          Length = 1160

 Score = 47.4 bits (111), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 74/386 (19%), Positives = 151/386 (39%), Gaps = 69/386 (17%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
            +R++ P++     +T   +  + +E A ++ V      +  + E  + +GTA   +    
Sbjct: 824  LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 877

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
              + G +++F    +       +  V++ +L     A+A  QG LL+  G  + ++    
Sbjct: 878  AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 933

Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
             ++         P  VV + ++   + +GD+ ++++FL ++    QL + A +     C 
Sbjct: 934  KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 992

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
             T  ++D +T++   +D+  NI I      +++            W         QK   
Sbjct: 993  TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1050

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
               +HVG  +    ++ ++          PG  +     L++ TL GSIG + P      
Sbjct: 1051 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFS---- 1092

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
                   K+  D   H+        R   SN         S++D +L   Y  L   ++ 
Sbjct: 1093 ------SKEDSDFFQHLE----MHMRSEWSNL--------SVIDGDLCEVYNSLDPSKRR 1134

Query: 1405 EIAHQTGTTRSQILSNLNDLALGTSF 1430
            EIA     + S++   L DL    +F
Sbjct: 1135 EIALDLDRSPSEVAKKLEDLRTRYAF 1160


>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
          Length = 372

 Score = 47.4 bits (111), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 45/286 (15%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
                  G ++++ G K+ ++     +L     Y   P  +VS+ +  + I   DI +S+ 
Sbjct: 107  CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 166

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
               +      + L++ D        +E L   + ++         L V +E K  +    
Sbjct: 167  IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQEEYGIA 226

Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
               +Y    +S S K +K+     FH+G  VT   ++++   SS+               
Sbjct: 227  NKCWYGGEVISSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPASSE--------------C 272

Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
            +++ T+ G+IG   P D     ELT    Q L+  L      + G     FR ++   + 
Sbjct: 273  IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 327

Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
                   ++D +L   +  LP + Q ++A     T  +IL  L D+
Sbjct: 328  ------HVIDGDLCEQFSSLPFDVQRKVASDLEKTPDEILRKLEDI 367


>gi|402586182|gb|EJW80120.1| hypothetical protein WUBG_08972 [Wuchereria bancrofti]
          Length = 162

 Score = 47.0 bits (110), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 68/154 (44%), Gaps = 26/154 (16%)

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            QKL + A  ++G  +T          S  +    PG++      L + T+ G IG + P 
Sbjct: 32   QKLEAIAHLYIGDAIT----------SMQKASLVPGAND----CLSYTTISGIIGILVPF 77

Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
               DE  F   Q+L+  +    P + G +  ++R ++   K       S++D +L   Y 
Sbjct: 78   MSRDEFEF--FQNLEMHMRVEYPPLCGRDHLAYRSYYFPVK-------SVIDGDLCEQYS 128

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            ++PL++Q  +  + G   ++I   L D+    +F
Sbjct: 129  LMPLDKQKSVGEELGRKPTEIHKKLEDIRTRYAF 162


>gi|350630003|gb|EHA18376.1| hypothetical protein ASPNIDRAFT_38018 [Aspergillus niger ATCC 1015]
          Length = 1219

 Score = 47.0 bits (110), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 98/462 (21%), Positives = 191/462 (41%), Gaps = 48/462 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    Q IPL  TP +     E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + L++       D   L   D         +   +++++P  A       
Sbjct: 815  NVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
             TI ++ +E A+++  V     T++++ET L +GTA         +A G + ++   R  
Sbjct: 870  HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG L+   G  + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    AT  ++D  T +    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  G      +   +  Q + TS  +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +R +++  K        ++D +L   Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188


>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
          Length = 346

 Score = 47.0 bits (110), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 90  LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  SM +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           +   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
              NL      IQV++ G R ++G  + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322


>gi|213407660|ref|XP_002174601.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
            yFS275]
 gi|212002648|gb|EEB08308.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
            yFS275]
          Length = 1078

 Score = 47.0 bits (110), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 70/301 (23%), Positives = 134/301 (44%), Gaps = 29/301 (9%)

Query: 1103 TTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
            T  +NE ++ +GT +     D    GR+++F      D  + LVTE   K  +GAI ++ 
Sbjct: 768  TLPDNERVV-VGTGFNYPDRDEPDGGRLIVF----RLDEQEKLVTEAVYKT-QGAIFSVE 821

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV---KNFILLGDIHKSIY 1218
              +G LL+     +   ++    L  +     P      LNI    K+ +++GD+ KS+ 
Sbjct: 822  YQEGKLLVGMNAVLCTFRYENKTLRVVGSTRTP---TYCLNIAASSKDIVVVGDMMKSLT 878

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
              + ++  A+   +A+DFG+L   + + L    TL    + + + + +  +  K  +S +
Sbjct: 879  LYNTEKDTAE--EVARDFGALWVTSVQPL--SETLFFCTTADGEAVTML-WDTKAPQSVE 933

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
             +KL  ++ + +G  V +  R   + +S  R    P         L+  T++G I  I  
Sbjct: 934  RKKLRWKSCYRLGDMVNRTRRGCFVLSSPSRL-VKP--------ELMCVTVEGGILLIGD 984

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
              +     LQ +Q   +++VP + GL+   + +     +A     D  +D +LL   E L
Sbjct: 985  ASQHADLLLQ-IQHNFLEAVPPLGGLDFYKWHERLFPARASAANKD-FIDGDLLESIEDL 1042

Query: 1399 P 1399
            P
Sbjct: 1043 P 1043


>gi|403415203|emb|CCM01903.1| predicted protein [Fibroporia radiculosa]
          Length = 1267

 Score = 47.0 bits (110), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 80/365 (21%), Positives = 145/365 (39%), Gaps = 61/365 (16%)

Query: 1069 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ--------- 1119
            +P+R G P +T +++ +   ++    R+ +    + +E  ++L + ++ V          
Sbjct: 889  KPNRIGEPQETTSSLKLL--DDTTFNRIASFACESDEEVTSVLTLSSSDVSSARFCVGTV 946

Query: 1120 ----GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG---------- 1165
                GE   + GR+LLFS     ++   LV+   S  + G +  L S+QG          
Sbjct: 947  QFKPGETEPSSGRILLFSLNTGPESSFQLVS---STPVSGCVYQLVSIQGMIAAAVNTSV 1003

Query: 1166 ---HLLIASGPKIILHK---WTGTELNGIAFYDAP------PLYVVSLNIVKNFILLGDI 1213
               ++ IA  P + +HK   +   +LN                 V  L +    +++GD 
Sbjct: 1004 RTAYVFIAFDPNMTIHKVILFKPEKLNNSTVVLTKVSEWNHNYSVTGLVVHGCMLIVGDA 1063

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
              SI F+  K     L  +A+D+  L   + E + DG  +    SD      +F +A  +
Sbjct: 1064 ISSISFV--KVDDTTLESIARDYSPLWPVSVEAM-DGDGVIGANSD----CNLFTFA--L 1114

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGS 1332
              S     L     +++G  V KFLR  +     S+R    P          LF T  G 
Sbjct: 1115 QRSGHRSTLERNGSYYLGDMVNKFLRGSLTNIDISERKSIEPKH--------LFFTSTGR 1166

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1390
            IG I  +++     +  LQ+ +   +    G++  +FR   +N K H     +   +D +
Sbjct: 1167 IGVILEMNDKISLHMTGLQRNMGKRIIGPGGVHHATFRA-PANSKGHSDAEAAFGFLDGD 1225

Query: 1391 LLSHY 1395
             L  Y
Sbjct: 1226 FLEQY 1230


>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
          Length = 1209

 Score = 46.6 bits (109), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)

Query: 1110 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
            +  +GT+Y+  +DV   + RGR+L F  TG         + +V    +KGA  ALA +Q 
Sbjct: 889  IFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVVQD 940

Query: 1166 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1217
             ++ A    ++++     E     LN  A Y    AP    + + +  N I + D+ KS+
Sbjct: 941  KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 996

Query: 1218 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
              + +K+    Q   L  +A+ F +L   A   + +   L    SD + N+ +       
Sbjct: 997  SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNRNVNG 1053

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
                  ++L   +E  +G  V    R++ ++         P     +  A L GT++GSI
Sbjct: 1054 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1109

Query: 1334 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
               G I P  +    RLQS    LV   P     N   FR F  N       P   VD E
Sbjct: 1110 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1165

Query: 1391 LLSHY 1395
            L+  +
Sbjct: 1166 LIERF 1170



 Score = 44.7 bits (104), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           +W+   C I+       LK+         L   A  L+ VP+P+GG+LV+G  +I Y   
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328

Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
                          D++ E      S+ LD A     W Q D    LL+   G L  L 
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369

Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           ++ D    VQ  +LDL    P    S +  +G  + F+GS  GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421


>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
            NRRL3357]
 gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
            NRRL3357]
          Length = 1117

 Score = 46.6 bits (109), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)

Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E + + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ 
Sbjct: 809  FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 863

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            A    I+++K        +              V + +V N I++ D+ KS+  L +KE 
Sbjct: 864  ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 923

Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
                   L  +A+ F ++  +AT    ID  T   + SD + N+ +        E    +
Sbjct: 924  ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 979

Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            +L   +E  +G  V +   + +Q LA+ +    A              GT++GSI   A 
Sbjct: 980  RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1027

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            ++      L  LQ  +   V  +  +    FR F S  +     P   VD EL+  +   
Sbjct: 1028 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1086

Query: 1399 PLEEQLEIAHQTG 1411
              E Q EI +  G
Sbjct: 1087 EPELQEEIVNSVG 1099


>gi|118369889|ref|XP_001018147.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila]
 gi|89299914|gb|EAR97902.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila SB210]
          Length = 1563

 Score = 46.6 bits (109), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 82/179 (45%), Gaps = 27/179 (15%)

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC-----FATEFLIDGSTLSLVVSD 1259
            K + L+GDI K + F    E  ++  L+A++  +++      F+ +   + + L  +++D
Sbjct: 1343 KQYFLVGDIQKGVQFYEMDEIQSKPRLIAEENININVRQCVLFSIQ---NKNALRALITD 1399

Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
            E +N+  + +  +  E    ++ +S    A FHVG+ + K        T+  +       
Sbjct: 1400 ESRNVYAYSFIQQQQELPTDKRKISMELVASFHVGSKINK-------ITTDYKVNDTSMQ 1452

Query: 1317 DKTNRFALLFGTLDGSIGCIA----PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
            +  +   LL    DG++  I     P D      L  +Q  + + +P++ GL+PR FR+
Sbjct: 1453 ESVSHLLLL--KQDGNLSDIQLIYEPGDNTN---LFDMQNTIFEELPYIGGLDPREFRE 1506


>gi|170090007|ref|XP_001876226.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164649486|gb|EDR13728.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1275

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 71/322 (22%), Positives = 138/322 (42%), Gaps = 36/322 (11%)

Query: 1068 LEPDRAGGPWQTRATIPM------------QSSENALTVRVVTLFNTTTKENETLLAIGT 1115
            +EP+R   P  +R++  +                +  T  VV+       +      +GT
Sbjct: 903  MEPNRVNDPEISRSSFKLLDDTSFANLCQFNCDPDEETTAVVSFSQKIAGKPMPFFCVGT 962

Query: 1116 -AYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
              Y  GE   + GR+++F T   + +    ++ + S ++ G + AL  +Q  ++ A    
Sbjct: 963  YVYKAGEVEPSAGRLMIF-TATTSTSSNLALSLMASTKVPGCVYALTVVQNQIVAAVNSS 1021

Query: 1175 IILHKWTGTELNG----IAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
            ++L +   +  +     I   +    Y+V SL    + +++GD   SI  L   +  ++L
Sbjct: 1022 VMLFRLESSSDSLSPSLIKVSEWHHNYLVTSLGSYADRVVVGDQPSSISLLQVTQ--SKL 1079

Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
               A+D+G L     E L +     ++ +++  N+  F     M  S    +L     +H
Sbjct: 1080 ISQARDYGPLWPVCVEALDERH---IIGANDSLNLFTFSLEKAMGRS----RLERDGCYH 1132

Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQ 1348
            V   VTKFLR  +  +SSD +  +P + +      +F T  G IG +  + DE    +L 
Sbjct: 1133 VADLVTKFLRGSL--SSSDASTTSPLTSEA-----MFFTSSGRIGVVVDVKDEELSLQLT 1185

Query: 1349 SLQKKLVDSVPHVAGLNPRSFR 1370
            ++Q+ L + +  V G +   +R
Sbjct: 1186 NMQRNLANVIQGVGGSSHSKYR 1207


>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
 gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1139

 Score = 46.2 bits (108), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)

Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E + + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ 
Sbjct: 831  FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 885

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            A    I+++K        +              V + +V N I++ D+ KS+  L +KE 
Sbjct: 886  ALVKTIVMYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 945

Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
                   L  +A+ F ++  +AT    ID  T   + SD + N+ +        E    +
Sbjct: 946  ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 1001

Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            +L   +E  +G  V +   + +Q LA+ +    A              GT++GSI   A 
Sbjct: 1002 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1049

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            ++      L  LQ  +   V  +  +    FR F S  +     P   VD EL+  +   
Sbjct: 1050 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1108

Query: 1399 PLEEQLEIAHQTG 1411
              E Q EI +  G
Sbjct: 1109 EPELQEEIVNSVG 1121


>gi|391865638|gb|EIT74917.1| damage-specific DNA binding complex, subunit DDB1 [Aspergillus oryzae
            3.042]
          Length = 1135

 Score = 46.2 bits (108), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)

Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
              +GTAY+  E + + RGR+L+F      DN + L T+V    +KGA  ALA L   ++ 
Sbjct: 827  FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 881

Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            A    I+++K        +              V + +V N I++ D+ KS+  L +KE 
Sbjct: 882  ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 941

Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
                   L  +A+ F ++  +AT    ID  T   + SD + N+ +        E    +
Sbjct: 942  ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 997

Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
            +L   +E  +G  V +   + +Q LA+ +    A              GT++GSI   A 
Sbjct: 998  RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1045

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            ++      L  LQ  +   V  +  +    FR F S  +     P   VD EL+  +   
Sbjct: 1046 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1104

Query: 1399 PLEEQLEIAHQTG 1411
              E Q EI +  G
Sbjct: 1105 EPELQEEIVNSVG 1117


>gi|291000406|ref|XP_002682770.1| predicted protein [Naegleria gruberi]
 gi|284096398|gb|EFC50026.1| predicted protein [Naegleria gruberi]
          Length = 1216

 Score = 46.2 bits (108), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 67/349 (19%), Positives = 137/349 (39%), Gaps = 64/349 (18%)

Query: 1105 KENETLLAIGTAY---VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
            K NE+L+ +GTA    +        G + +F    +       +  ++  E++    AL 
Sbjct: 897  KSNESLIIVGTAKNMKLYPTRTCDCGYINVFQISEDGK-----LQLIHKTEVEDVPYALH 951

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
            + +G LL+     + ++     +L       + P ++ S+ +  N I +GDI +S +F+ 
Sbjct: 952  AFRGRLLVGVKNMLRIYDLGKKKLLRKCENKSFPNFITSIAVDGNRIFVGDITESFHFVK 1011

Query: 1222 WKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
            +      L + A +                    D F   F+   S L   VSDE ++  
Sbjct: 1012 FNSSENSLTIFADNTTPRWLTASALVDHNTIAGGDKFGNFFI---SRLPSDVSDELED-- 1066

Query: 1266 IFYYAPKMSESW---KG------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
                +    E W   +G      QK     +F+VG+ +T   +  ++A         P  
Sbjct: 1067 ----SSTGKEKWIWERGLLNGAPQKATEIVKFYVGSMITSIYKTSLIA-------GGPS- 1114

Query: 1317 DKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                   L++ T+ G++G   P   +        L+  L +  P + G +  ++R ++  
Sbjct: 1115 ------ILIYTTITGAVGVFFPFTSKKDIEFFTQLEMHLREKNPPLCGRDHLAYRSYYFP 1168

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             K       S+VD +L+  +  + L+ + +I+     T ++I   + D+
Sbjct: 1169 VK-------SVVDGDLIEQFNDVDLQTKTKISEDLQRTINEIAKKIEDM 1210


>gi|430814207|emb|CCJ28534.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 904

 Score = 46.2 bits (108), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 64/307 (20%), Positives = 134/307 (43%), Gaps = 34/307 (11%)

Query: 1097 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-ELK 1154
            V    + T +N+ +  +GT + +  E+ +++GR++LF          N    V+S+ ++ 
Sbjct: 588  VQCITSVTIDNQDIFVVGTGFSLPEEEESSKGRIILFGV-------TNKKIWVFSEIQVN 640

Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGT--ELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
             A+  +  +   ++      + ++ +  +    N IA Y +  L  +SL +    +++GD
Sbjct: 641  DAVYCIGIIDNKIIAGINALVHIYAYDSSLKNFNVIATYRSTTL-CLSLAVHGTHVIIGD 699

Query: 1213 IHKSIYFLSW--KEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            + KS+  L++   E G +L  +AKD   L   C A    +D      + ++ + N+ +F+
Sbjct: 700  LMKSVSLLAFINTENGPRLKEVAKDCNPLWMTCVAA---LDNDLY--IGAEAEGNLSLFW 754

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                 + +++  KL   +E   G  V +     +L   S+ +   P +         F T
Sbjct: 755  --KDFNTTFEENKLQIISEIKWGELVNQIKPGTILY--SENSIIIPKAT--------FVT 802

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
            +DGSIG I  +       L +LQ  +   +  +  LN  ++R F  N +     P   +D
Sbjct: 803  VDGSIGIIFTVKREYLEFLVNLQSNMGKIISGIGCLNHSNWRAF-CNRRKKSNEPKCFID 861

Query: 1389 CELLSHY 1395
             + +  +
Sbjct: 862  GDFVEIF 868


>gi|58269920|ref|XP_572116.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|57228352|gb|AAW44809.1| conserved hypothetical protein [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1276

 Score = 45.8 bits (107), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 110/266 (41%), Gaps = 42/266 (15%)

Query: 1111 LAIGTAYVQGED------------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
            LA+GTA++  +D            V   GRVLL    +  D       ++ ++    GA+
Sbjct: 945  LAVGTAFLPADDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003

Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
             AL  + G L +A+G K+ +H+      EL   + + A    + SL+++       +  +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD  +S+  L+  E    +    ++  +    A   L D     +V+SD   N+  + 
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                       QKL   A F +   VT+F    ++ T++      P         +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
             +G +G I  L   + R L  LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190


>gi|407034933|gb|EKE37449.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 995

 Score = 45.8 bits (107), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
             TI ++S+E AL V  +          + + A+GTA ++  ++  + GR+LL        
Sbjct: 692  TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            + + L+  V +++  GA+  L   Q  ++      + + +  G +LN        PL  V
Sbjct: 738  DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
            SL+I K++I+ GD+ +S+    ++     L+++ KD         GS++  + T FL   
Sbjct: 797  SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853

Query: 1251 STLSLVVSDEQKNIQIF 1267
                  V+D   NI+IF
Sbjct: 854  ------VADVSGNIKIF 864


>gi|390603312|gb|EIN12704.1| hypothetical protein PUNSTDRAFT_97523 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1268

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 59/237 (24%), Positives = 106/237 (44%), Gaps = 35/237 (14%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV--AARGRVLLFSTGRNADNPQ 1142
            +++ E+  +  V++L   T +   +   IGTA    +D    ++GR+++F      D   
Sbjct: 933  LEADEDITSAVVLSL--GTAEAYTSHFCIGTADFTSDDQLEVSKGRLVVF------DPST 984

Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
             +++ V + ++ G + ALAS+QG +  A    +I+++    E +G  F     + + + N
Sbjct: 985  KVLSPVATLDVNGCVYALASIQGLVAAAVNSAVIVYRL---ETDGPTFSSKRLVQLANWN 1041

Query: 1203 ---IVKNF------ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
                V N       I +GD   S+  L     G  L  +A+D+G L   A E        
Sbjct: 1042 HNYFVTNLVTRGSRIFVGDAISSVSILELT--GQALQTVARDYGPLWPVAIE---STGPD 1096

Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1310
            S++ +D + N+  F    K+SE     KL     +H+G  V KF+   +LA     T
Sbjct: 1097 SVIGADGEFNLFTF----KLSEG----KLERDGSYHLGEQVNKFVPGGLLAADPAHT 1145


>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1249

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 43/239 (17%)

Query: 95  GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
            I    L+ +C + +   ++SL  +   G+    ++D +++  +  K+++L++D   + L
Sbjct: 52  NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
               +  F+ P   H K G      GP +  DPQ R   +L+  L+   +I K      G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158

Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
                     G   S+ +ES   H I L    +          GY  PV V +       
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT--------GYENPVFVAIEAEYGAL 200

Query: 271 AGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVLV 323
             +     S  H T + ++  +   L    ++   +N  LP  A  L+ +PSP+GGVL+
Sbjct: 201 DSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVLI 257


>gi|134113697|ref|XP_774433.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50257071|gb|EAL19786.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1276

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 111/266 (41%), Gaps = 42/266 (15%)

Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
            LA+GTA++    GED         V   GRVLL    +  D       ++ ++    GA+
Sbjct: 945  LAVGTAFLPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003

Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
             AL  + G L +A+G K+ +H+      EL   + + A    + SL+++       +  +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD  +S+  L+  E    +    ++  +    A   L D     +V+SD   N+  + 
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                       QKL   A F +   VT+F    ++ T++      P         +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
             +G +G I  L   + R L  LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190


>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
          Length = 1039

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 81/365 (22%), Positives = 151/365 (41%), Gaps = 48/365 (13%)

Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNP 1141
            ++ +E  ++  VV     T+++ E+ L IGT    G+D+    R          R  D+ 
Sbjct: 704  LEGNEAGVSAAVVPF---TSQDGESFLIIGT----GKDMIVNPRQSSEGFIHVYRFHDDG 756

Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
            ++L   ++  +++   +AL S  G LL   G  + ++     +L   A  D  P ++VSL
Sbjct: 757  RSL-EFIHKTKVEEPPTALLSFHGRLLAGIGKTLRIYDLGMRQLLRKAQADISPQHIVSL 815

Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSD 1259
                  I++GD+   +  + +     +L     D  +    C A          S+   D
Sbjct: 816  QSQGFRIVVGDVQHGVTMVVYNPVSNKLLPFVDDTIARWTTCLAM-----ADYESVAGGD 870

Query: 1260 EQKNIQIFYYAPKMS----ESWKGQKLLS-RAEFHVGAHVTKFLRLQMLA--------TS 1306
            +  NI I     K S    E     +L + ++  H  AH     RLQ++A        TS
Sbjct: 871  KFGNIWIVRCPDKASAEADEPGSDVQLSNGQSYLHGAAH-----RLQLMAHMFVQDIPTS 925

Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLN 1365
              +T    G        LL+  L G+IG + PL    T    Q+L+  + +  P +AG +
Sbjct: 926  ICKTSLVVGGQDV----LLWSGLQGTIGVLIPLVTRETADFFQTLEMHMRNEDPPLAGRD 981

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               +R +H   K        ++D +L   + +L  E++  IA +   +  ++   ++D+ 
Sbjct: 982  HLMYRGYHVPVKG-------VIDGDLCERFSLLSREKKQMIAGELDRSVREVERRISDVR 1034

Query: 1426 LGTSF 1430
            + + F
Sbjct: 1035 IRSVF 1039


>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1121

 Score = 45.8 bits (107), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 136/318 (42%), Gaps = 46/318 (14%)

Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            IGT+++  +D       RGR+L+     N       V ++ S  LKGA   L +L  H++
Sbjct: 794  IGTSFITDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLGTLGEHIV 848

Query: 1169 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1222
                  ++++ +         L  +A Y  P  + +SL+I  N I + D+ +S+  + + 
Sbjct: 849  AGLSKTVVVYHYVEETTVFGSLQKLAAY-RPASFPLSLDISGNIIGVVDLMQSLTLVEFI 907

Query: 1223 -KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
              E G++  L          +AT    +DG     + +D Q NI +    P+        
Sbjct: 908  PSEDGSRAKLEETARHYQPGWATSVAHLDGE--RWLEADAQGNIIVLQRNPEAPTEQDRS 965

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKT-----NRFALLFGT 1328
            KL   +E ++G  + +  +L +   S++    +P       G  +T     N+  +L   
Sbjct: 966  KLEVTSEMNIGEQINQIRKLHV--ASNENAVVSPKAFLGSVGLSETIITCWNQLLMLV-Q 1022

Query: 1329 LDGSI---GCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
            ++G++   G IAP   +L    L + Q +L D +     ++   +R F +  +    GP 
Sbjct: 1023 IEGTLYLFGEIAPNYQDL----LLTFQSRLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077

Query: 1385 SIVDCELLSHYEMLPLEE 1402
              VD E++  +  L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093


>gi|159486547|ref|XP_001701300.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
            [Chlamydomonas reinhardtii]
 gi|158271783|gb|EDO97595.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
            [Chlamydomonas reinhardtii]
          Length = 1078

 Score = 45.8 bits (107), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 79/355 (22%), Positives = 137/355 (38%), Gaps = 84/355 (23%)

Query: 1108 ETLLAIGTA----YVQGEDVAARGRVL-LFSTGRNADNPQNLVTEVYSKELKGAI-SALA 1161
            ET+L +GTA    Y+  +  AA  RV  L   GR  D        ++  ++ G +  ALA
Sbjct: 765  ETVLLVGTARGLRYMPTDCEAAYIRVYRLGDGGRRLD-------LLHKTQVDGGVPGALA 817

Query: 1162 SLQGHLLIASGPKIILH--------------KWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
              +G LL   GP + L+              +WT   L+ + FY   P +  S       
Sbjct: 818  GFKGRLLAGVGPTLRLYDMGKKKMLRKCEYNRWTNIFLH-VFFYR--PYFRSS------- 867

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFG----------SLDCFATEFLIDGSTLSLVV 1257
                   +S++ + +K+      + A D              D  AT        +  + 
Sbjct: 868  ------QESVHMMRYKKADNAFYIFADDVAPRYLSALLPLDYDTIATGDKFGNLVILRLP 921

Query: 1258 SDEQKNIQIFYYAPKMSES-----WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
             +  + ++      KM+ +         KL    +FHVG  +T   R +M A   +    
Sbjct: 922  QEASQQVEDDPTGGKMAAASGKLNGAPHKLEELVKFHVGDTITALQRAEMQAGGQE---- 977

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
                       L++ T+ G+IG + P    +++ F     L+  L    P +AG +  ++
Sbjct: 978  ----------VLVYSTVMGAIGVVYPFTSREDVDF--FSHLEMHLRQENPPLAGRDHLAY 1025

Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            R       A+ P   + VD +L S Y  +P+++Q  IA     T  ++L  L D+
Sbjct: 1026 RS------AYFP-VRNCVDGDLCSQYASIPMKKQQMIAEAMDRTTGEMLKKLEDI 1073


>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
 gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 1199

 Score = 45.8 bits (107), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 126/314 (40%), Gaps = 42/314 (13%)

Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            L  +GT+Y+   GE  + RGR+L F        P   + +V    +KGA  ALA +Q  +
Sbjct: 883  LFIVGTSYLDDFGEG-SIRGRILAFEV-----TPNRQLGKVAEMPVKGACRALAIVQDKI 936

Query: 1168 LIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            + A    ++++  +  +     L   A Y    AP    + + +  N I + D+ KS+  
Sbjct: 937  VAALMKTVVVYTLSKGQFADYILTKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 992

Query: 1220 LSWKEQ----GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            + +++        L  +A+ F +L   A   +   + L    SD + N+ +         
Sbjct: 993  VEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLE---SDAEGNLVVLRRNVNGVT 1049

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI-- 1333
                ++L   +E  +G  V +   + + A+       +P +          GT++GSI  
Sbjct: 1050 EDDRRRLEVTSEVLLGEMVNRIRPVNIQASLGTEAAISPRA--------FLGTVEGSIYL 1101

Query: 1334 -GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
             G I P  +    RLQS    +V +     G+    FR F  N       P   VD EL+
Sbjct: 1102 FGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTVRQAEEPYRFVDGELI 1157

Query: 1393 SHYEMLPLEEQLEI 1406
              +     E Q EI
Sbjct: 1158 ERFLGCGAELQEEI 1171



 Score = 45.1 bits (105), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|402467441|gb|EJW02742.1| hypothetical protein EDEG_02863 [Edhazardia aedis USNM 41457]
          Length = 1274

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            +K +  +  S++G+L+I  G +++++K    E L  IAF+D   +  VSL+++KNFIL+G
Sbjct: 988  IKSSAISCDSIRGNLVIGQGTRLMIYKIDRLEGLVAIAFHDLSII-AVSLSVIKNFILVG 1046

Query: 1212 DIHKSIYFLSWKEQGAQL 1229
            D+ + + F  ++ +  ++
Sbjct: 1047 DLLRGVTFFYFQTRPVKI 1064


>gi|147787360|emb|CAN64633.1| hypothetical protein VITISV_043788 [Vitis vinifera]
          Length = 1143

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 76/181 (41%), Gaps = 28/181 (15%)

Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
            +RIL+P  A     T   + +Q +E A ++  V   N   KE  TLLA+GTA        
Sbjct: 822  IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 866

Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
               + L F   R+ D               +  ++  +++G   AL   QG LL   G  
Sbjct: 867  ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 923

Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
            + L+      L         P  +VS++  ++ I +GDI +S ++  ++    QL + A 
Sbjct: 924  LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 983

Query: 1235 D 1235
            D
Sbjct: 984  D 984


>gi|183233163|ref|XP_654084.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801703|gb|EAL48698.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
 gi|449708240|gb|EMD47737.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 995

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
             TI ++S+E AL V  +          + + A+GTA ++  ++  + GR+LL        
Sbjct: 692  TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            + + L+  V +++  GA+  L   Q  ++      + + +  G +LN        PL  V
Sbjct: 738  DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
            SL+I K++I+ GD+ +S+    ++     L+++ KD         GS++  + T FL   
Sbjct: 797  SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853

Query: 1251 STLSLVVSDEQKNIQIF 1267
                  V+D   NI+IF
Sbjct: 854  ------VADVSGNIKIF 864


>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
 gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
          Length = 1228

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
 gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
          Length = 1227

 Score = 45.4 bits (106), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G+SD ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +   L      P      
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|336263557|ref|XP_003346558.1| hypothetical protein SMAC_04731 [Sordaria macrospora k-hell]
 gi|380090453|emb|CCC11749.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1149

 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 138/337 (40%), Gaps = 49/337 (14%)

Query: 1113 IGTAYVQGEDVAA----RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            +GT++++  D  A    RGR+L+F    N D        V   EL+GA  ALA +   ++
Sbjct: 823  VGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYL-----VLKHELRGACRALAVMGSKIV 877

Query: 1169 IASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1222
             A    +++ ++     T   L  +A Y     Y + + +  N I + D+ KS   + + 
Sbjct: 878  AALHKTVVISQYEETSSTEARLVKLASYRCTT-YPIDIAVHGNIIAVADMMKSATLVEYV 936

Query: 1223 -------KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
                   K + A+L   A+   S   +AT    ++G   S + +D   N+ +     +  
Sbjct: 937  QAKTEEEKYEPAKLVECARHRHS--AWATAVAHVEGE--SWLEADANGNLVVLQRNVEGV 992

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
             +   ++L   +E ++G  V K   +++   +S  T   P +           T +G I 
Sbjct: 993  TAEDQRQLRITSELNLGEQVNKIRPIKV--ETSPNTIIIPRA--------FLATAEGGIY 1042

Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR----PGPDSIV 1387
              G IA   +L  R     Q KL   +  V  L+  S+R F +  +        GP   +
Sbjct: 1043 LFGTIAREQDLLLR----FQDKLAAVIKTVGELDFNSYRAFRNAERGPETDGTTGPVRFL 1098

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            D ELL  +  +    Q EI    G +  Q+ + + +L
Sbjct: 1099 DGELLERFLDVDETTQKEICEGLGPSVEQMRNMVEEL 1135


>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.4 bits (106), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +   L      P      
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|443896643|dbj|GAC73987.1| predicted DNA methylase [Pseudozyma antarctica T-34]
          Length = 1285

 Score = 45.1 bits (105), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 43/301 (14%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            V+  E+      L + QG LL   G  + +++    +L       + P  +V+L+   + 
Sbjct: 1007 VHKTEVDDVPLVLRAFQGRLLAGVGKVLRIYELGKKKLLRKCENRSFPTAIVALDAQGSR 1066

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDF----------------GSLDCFATEFL--ID 1249
            I++GD+ +S+ F S+K    +L   A D                  + D F   ++  ID
Sbjct: 1067 IVVGDMQESVIFASYKPLENRLVTFADDIMPRYVTRCTMLDYDTVAAADKFGNVYVVRID 1126

Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
              T S  V ++   +   +  P +  +     LL  A + VG  +T   R  M+      
Sbjct: 1127 ADT-SRSVDEDVTGMTTMHEKPLLMGAAHKATLL--AHYFVGDIITSLSRAVMV------ 1177

Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                PG     R  LL+  + G+IG + P + +     + +L+ +L      + G +  +
Sbjct: 1178 ----PG----GREVLLYTGISGTIGALVPFVSKEDVDTMTTLEMQLRQQSDSLVGRDHLA 1229

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R  ++  K        ++D +L   + +LP  +Q  +A +     S++   L  L  G 
Sbjct: 1230 YRSSYAPVK-------HVIDGDLCESFGLLPPAKQSAVAQELDRKPSEVNKKLAQLREGA 1282

Query: 1429 S 1429
            +
Sbjct: 1283 T 1283


>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
 gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
          Length = 1225

 Score = 45.1 bits (105), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
 gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
          Length = 1168

 Score = 45.1 bits (105), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|401413996|ref|XP_003886445.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120865|emb|CBZ56420.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 2869

 Score = 45.1 bits (105), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 59/292 (20%)

Query: 1111 LAIGTAYVQGEDVAARGRVLLF----------STGRNADNPQNLVT-------EVYSK-E 1152
            LA G      E +   GR+ LF          S  R+AD P +          E+++   
Sbjct: 2457 LAAGVGVPLSETIECSGRLYLFKLPESAMRLASPPRSADTPGDQAEYGTPERLELFADIV 2516

Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
            L G ++ + S       + +++ + GP++ +H+   ++    AF D+  + V ++  ++N
Sbjct: 2517 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMESSKFLRGAFSDSS-VCVTAVANLRN 2575

Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
            F LL D  K +  ++W+        ++  +++ F   +    A  FL   + L ++ +D 
Sbjct: 2576 FFLLADALKGLNLVAWEYHAEADSRKVTRISRTFPKSNFPVAACSFLAYENLLGMLATDI 2635

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              N+++F Y                   +   + + F +L +L   ++   AA    K  
Sbjct: 2636 DGNVRLFCY-------------------NADKNASGFEKLDILQCDAEDRCAAGCVVKLQ 2676

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
            ++ +   T+  S+G  A    L FR         LQ+LQ +L   +P   GL
Sbjct: 2677 QYVVDSETV-ASLGEAADGSTLAFRLLSVENHAFLQTLQDRLARYLPEPLGL 2727


>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
          Length = 1121

 Score = 45.1 bits (105), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 137/318 (43%), Gaps = 46/318 (14%)

Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
            IGT++V  +D       RGR+L+     N       V ++ S  LKGA   L++L  H++
Sbjct: 794  IGTSFVTDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLSTLGEHIV 848

Query: 1169 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS---IYFL 1220
                  ++++ +         L  +A Y  P  + + L+I  N I + D+ +S   + F+
Sbjct: 849  AGLSKTVVVYNYVEETTVFGSLQKLAAY-RPASFPLGLDISGNIIGVVDLMQSLTLVEFI 907

Query: 1221 SWKEQG-AQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
              K+   A+L  +A+ +     +AT    +DG     + +D Q NI +    P+      
Sbjct: 908  PSKDGSRAKLEEVARHYQP--GWATSVTNLDGE--RWLEADAQGNIIVLQRNPEAPTEQD 963

Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKTN----RFALLFG 1327
              KL   +E ++G  + +  RL +   S++    +P       G  +T        L+  
Sbjct: 964  RSKLEVTSEINIGEQINQIRRLHV--ASNENAVVSPKAFLGSVGLSETTINCWTQLLILV 1021

Query: 1328 TLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
             ++G++   G IAP  +     L + Q +L D +     ++   +R F +  +    GP 
Sbjct: 1022 QIEGTLYLFGEIAPKYQ---DLLLTFQARLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077

Query: 1385 SIVDCELLSHYEMLPLEE 1402
              VD E++  +  L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093


>gi|115390120|ref|XP_001212565.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
 gi|114194961|gb|EAU36661.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
          Length = 1217

 Score = 45.1 bits (105), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 96/460 (20%), Positives = 193/460 (41%), Gaps = 48/460 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + +  Q +    ++ S    DN    Q IPL  TP +     E+ L+ +I S  
Sbjct: 759  QCLEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFVKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + L++       D   L   +         +   +++++P  A       
Sbjct: 815  NVLSPSTR--ARLLEDSKSRNGDTTVLPPEEFGYPRATGHWASCIQVVDPLDAKA---VV 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
             T+ ++ +E A++V  V     T++++ET L +GT          +A G + ++   R  
Sbjct: 870  HTVELEDNEAAVSVAAVPF---TSQDDETFLVVGTVKDMTVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL + QG L +  G  + ++     +L         P  +
Sbjct: 924  EDGREL-EFIHKTKVEEPPLALLAFQGRLAVGLGSLLRIYDLGMKQLLRKCQAHVVPKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSISRWTTSTT-MVDYETVA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
            D+  N+ +     K+SE       G  L+  R   H   +  + +     Q + TS  +T
Sbjct: 1040 DKFGNLWLVRCPKKVSEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFHGTIGMLVPFVSREDVDF--FQNLEMQLASQHPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
             +R +++  K        ++D +L   Y +LP + ++ IA
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIA 1186


>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
 gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
           [Brugia malayi]
          Length = 521

 Score = 44.7 bits (104), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
           A  T   ++ DS  N+ P++D +  +R N      + +G  K              EL  
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403

Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           VEL G K ++T+  +               DDE+  YLI+S ++ T VL       E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
              + V G T+ AG LF  + ++QV      ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
           putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score = 44.7 bits (104), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           +K+ P   ++  LP D   ++ +P  IGG+LV G+N   Y          L+   + L  
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269

Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
            +   ++  S+ ++   H    +N   LL    GDL  LTV Y  D   ++ + ++   T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
            P  L+ +I    N   F+     D LL QF              E+ GD  A+      
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
           L  +SSD             Y S  N   +   TF     D+L  I  L+  S  +  RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416

Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
           N D+    +S  S  + +   +P    + +    +          + + +DE   YL+I 
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472

Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
            SL ++T+VL   +++ +V +S   FV  ++  +    G   V+QV+  G    R ++G 
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             T D  F P+       S N+  + +++++  V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564


>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
 gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
          Length = 1227

 Score = 44.7 bits (104), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
 gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
 gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
 gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
 gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
          Length = 1227

 Score = 44.7 bits (104), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
 gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
          Length = 1227

 Score = 44.7 bits (104), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
          Length = 1227

 Score = 44.7 bits (104), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
          Length = 1138

 Score = 44.3 bits (103), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score = 44.3 bits (103), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           NL   A  L+ VP P+GG+L++G   I Y          ++N  ++L            +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280

Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
           E +     W  L N   LL+   G L  L +V D    V+   +     +   S +  +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340

Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
             + FLGS  GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362


>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
 gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
          Length = 1228

 Score = 44.3 bits (103), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
 gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|320581947|gb|EFW96166.1| hypothetical protein HPODL_2449 [Ogataea parapolymorpha DL-1]
          Length = 1203

 Score = 44.3 bits (103), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 60/281 (21%), Positives = 111/281 (39%), Gaps = 42/281 (14%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            AL + QG LL+ +  ++IL+     +L   +        +V L      +++ D+  S+ 
Sbjct: 944  ALTAFQGKLLVGAKNELILYDIGQKQLVKRSSTRLECYEIVDLKTQGFRVIVSDVRDSVR 1003

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--- 1275
            +  +K           D        T  L+D  T+  VV D+  NI +     ++SE   
Sbjct: 1004 YTVYKPLENSFVDFIDDTMQRHVTRT-LLLDYDTV--VVGDKFGNISVLRCPEQISEMSD 1060

Query: 1276 --------SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
                      +  KL +   ++VG   T F +       S   G A         ++++G
Sbjct: 1061 EDNHGFLVKMRRTKLDNPVNYYVGDMPTFFQK------GSLTIGGAE--------SIIYG 1106

Query: 1328 TLDGSIGCIAPLDELT----FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
             L G +GC+ P+  L+    F+ LQ L       +     L  R + +F    K +   P
Sbjct: 1107 CLQGQMGCLYPMKSLSEINFFKELQRL------IIHEFTSLTDREYLKF----KGYYNPP 1156

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             + +D +L+  Y  L  E+++ IA +       I   ++D+
Sbjct: 1157 KNSIDGDLIEEYYRLGPEKRIRIATKMDRLPRDIDRRISDM 1197


>gi|240280498|gb|EER44002.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus H143]
          Length = 305

 Score = 44.3 bits (103), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 61/283 (21%), Positives = 118/283 (41%), Gaps = 27/283 (9%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            AL   QG LL   G  + ++     ++         P  VV L    + I++ D+ +S+ 
Sbjct: 39   ALLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESLT 98

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW- 1277
            ++ +K Q  +L     D  S     T  ++D  T++    D+  N+ +     K SE   
Sbjct: 99   YVVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEAD 155

Query: 1278 ---KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
                G  L+   ++  GA    ++      Q L TS  +     G     R  L++  L 
Sbjct: 156  EDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQ 211

Query: 1331 GSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
            G++  + P    +E+ F   QSL+ +L    P +AG +   +R +++  K         +
Sbjct: 212  GTVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TI 262

Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            D +L   Y +LP +++ +IA +   +  +I   + D+    ++
Sbjct: 263  DGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTQVAY 305


>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
          Length = 1087

 Score = 44.3 bits (103), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 34/143 (23%), Positives = 68/143 (47%), Gaps = 10/143 (6%)

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            +GT+++  E+   RGR+L+F  G NAD    ++    S  LKG+   +  L G ++ A  
Sbjct: 118  VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 172

Query: 1173 PKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
              ++++  T    L G+            L+     I+   + +++ + +W+    +  L
Sbjct: 173  KTVVMNATTAKANLGGLVVLGETKF--TYLDDESKAIVEYALDEAVLWAAWEPIDERTYL 230

Query: 1232 LAKDFGSLDCFATEFLIDGSTLS 1254
            L  D+G L  +    L+DG+T++
Sbjct: 231  LGDDYGFL--YILTILVDGATVT 251



 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            +GT+++  E+   RGR+L+F  G NAD    ++    S  LKG+   +  L G ++ A  
Sbjct: 765  VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 819

Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
              ++++ +     T   L  +A Y      +           + DI KSI  + +   GA
Sbjct: 820  KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 877

Query: 1228 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
                 +L  +A+   +   F+T  + +  T + + +D   N+ +     +        ++
Sbjct: 878  DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 934

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1334
                E ++G  V +  R+ +               +T++ ALL      GT +GSI    
Sbjct: 935  EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 979

Query: 1335 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1382
             I P ++    RLQS    L          DS  PH   L+P +  F ++ S   A R  
Sbjct: 980  LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1039

Query: 1383 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              P   VD EL+  +  L +E Q  +A   G
Sbjct: 1040 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1070


>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
            [Plasmodium yoelii yoelii 17XNL]
 gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
          Length = 863

 Score = 44.3 bits (103), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 97/213 (45%), Gaps = 8/213 (3%)

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
             P  +++SL++++N+I++GDI  S+  LS+    + L  + +D+ ++ C    F+   S 
Sbjct: 624  TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 680

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
               +VSD + N  +F  +          KL   A F+ G  V K L + +  +S      
Sbjct: 681  SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVALFNHGHVVNKMLPVSL--SSLIEEEE 738

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
            A       + ++L  + +GSI  I P   LT F++   ++  L DS+  +  +N  S   
Sbjct: 739  AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIXNINNNSNNT 798

Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            +  N          +VD E+   +  +P E+Q 
Sbjct: 799  YKMN--LSEKSSKGVVDGEVFKMFFSMPFEKQF 829


>gi|405121632|gb|AFR96400.1| hypothetical protein CNAG_03173 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1276

 Score = 44.3 bits (103), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 109/266 (40%), Gaps = 42/266 (15%)

Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
            LA+GT  +    GED         V   GRVLL    +  D       +V ++    GA+
Sbjct: 945  LAVGTGILPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGSGWDIKVKAELATVGAV 1003

Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
             AL  + G L +A+G K+ +H+      EL   + + A    + SL+++       +  +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSLMRPEGAL 1062

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD  +S+  L+  E    +    ++  +    A   L D     +V+SD   N+  + 
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAHSNLLTYR 1121

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                       QKL   A F +   VT+F    ++ T++      P         +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
             +G +G I  L   + R L  LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190


>gi|301091539|ref|XP_002895953.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262096049|gb|EEY54101.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 118

 Score = 43.9 bits (102), Expect = 0.61,   Method: Composition-based stats.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
            T +  +  + P+ +  FRRL +LQ ++V+++P    L+PR FR   +N K     PD
Sbjct: 6    TSEEGVSALIPVGKGVFRRLFTLQNEMVNTLPQNCALDPREFRMLKTNAKRRCGRPD 62


>gi|145549784|ref|XP_001460571.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428401|emb|CAK93174.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1178

 Score = 43.9 bits (102), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            ALA+ +G LL+ +G  + +++     +   A       ++ S+ + ++ I + ++  SI+
Sbjct: 913  ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVADSIH 972

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1266
             L +  +      LA D       A+  L     I G    ++ VS      DE++    
Sbjct: 973  LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1032

Query: 1267 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
            + Y  KM +        K+     F+VG  +T   ++ +++TSS+               
Sbjct: 1033 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1078

Query: 1324 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            +++GT  GSI  + P D   ++ F     L  ++   V H   L+ R   QF S   A+ 
Sbjct: 1079 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1130

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P   SI+D +L   +  +   +Q  +A +   T + I+  L D+
Sbjct: 1131 PC-KSIIDGDLCDQFGNMQYNKQRAVAEEFDRTPADIIKKLEDI 1173


>gi|116191283|ref|XP_001221454.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
 gi|88181272|gb|EAQ88740.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
          Length = 979

 Score = 43.9 bits (102), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 71/344 (20%), Positives = 144/344 (41%), Gaps = 42/344 (12%)

Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
            TI +Q +E A ++ + +    T++++E+ L + T    G D+    R L  S G     R
Sbjct: 558  TIHLQENEAAASLAIASF---TSQDDESFLVVST----GRDMVLNPRQL--SGGYSYVYR 608

Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPP 1195
              D+ ++L   ++        +AL + +G L+   G  ++++     + L      D  P
Sbjct: 609  FHDDGRDLEL-IHKTGTTEPPTALEAFRGRLVAGIGKTLVVYDLGLKQMLRKTQANDVVP 667

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
              +VSL    N I++GD+   +  ++++ +  QL     D  +     T  ++D    S+
Sbjct: 668  GLIVSLQTQGNRIVVGDVQHGVAMVAYRTESNQLIPFVDDTIARWTTCTT-MVDYD--SV 724

Query: 1256 VVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAH---VTKFLRLQMLATSSD 1308
               D+  N  I     + S    E    + L +R   H   H   +T     Q + T   
Sbjct: 725  AGGDKFGNFWIVRTPQQASLEADEPGAHRLLHAREHLHGAPHRLQLTAHFHTQDIPTGIT 784

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
            +T    G  +     L++    G++G   P    ++  F    +L++ +    P + G +
Sbjct: 785  KTHLVVGGQE----VLVWSGFQGTVGVFVPFVTREDADF--FLALEQHMRGEEPSLIGRD 838

Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
              ++R ++   K        +VD +L   Y++LP +++  IA +
Sbjct: 839  HLAYRGYYEPAKG-------VVDGDLCERYQLLPGDKKQRIAAE 875


>gi|406700450|gb|EKD03620.1| hypothetical protein A1Q2_02097 [Trichosporon asahii var. asahii CBS
            8904]
          Length = 1119

 Score = 43.9 bits (102), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 141/352 (40%), Gaps = 62/352 (17%)

Query: 1022 VPVLKPLNQVLSLLIDQEVGHQI-DNHNLSSVDLHRTYTVE---EYEVRILEPDRAGGPW 1077
            V  L   N V++  + + + HQ  D    SSV+L    T+E   E+++    P+R     
Sbjct: 727  VAALPGYNLVVAGTVTRSMDHQTGDVLQSSSVELRNATTLELLSEFQL----PEREA--- 779

Query: 1078 QTRATIPMQSSENALTV--RVVTLFNTTTKENETLLAIGTAYVQGEDVAA-----RGRVL 1130
                     +S NA+T+  R   L  T   ENE  L   T     EDV +     RGR+L
Sbjct: 780  --------VASVNAVTLHGRKYILVGTAIFENEDALEDATL----EDVTSFIATNRGRLL 827

Query: 1131 LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----EL 1185
            LF    +A    +LVT   S    G +     + G L +A+  K+ + + T       E 
Sbjct: 828  LFQINESAGPSLDLVT---SMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884

Query: 1186 NGIAF-YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCF 1242
               AF ++   L VV ++  K  + +GD  +SI  LS   +   +    +D  +  + C 
Sbjct: 885  ASFAFAFETHHLAVVEIDKEKRLV-VGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCL 943

Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
            +    ++   +   ++D   N+  F           G+ +   A   +   VT+     +
Sbjct: 944  SAVHDVEPGVM---IADNYANLLTFRL---------GKGITPAASIGLSEDVTRLQPGTL 991

Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
               S++            R  LL  T++G +G I  L + + R L  LQ+ +
Sbjct: 992  APVSAE--------GDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNM 1035


>gi|145510432|ref|XP_001441149.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408388|emb|CAK73752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1174

 Score = 43.9 bits (102), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)

Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
            ALA+ +G LL+ +G  + +++     +   A       ++ S+ + ++ I + ++  SI+
Sbjct: 909  ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVSDSIH 968

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1266
             L +  +      LA D       A+  L     I G    ++ VS      DE++    
Sbjct: 969  LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1028

Query: 1267 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
            + Y  KM +        K+     F+VG  +T   ++ +++TSS+               
Sbjct: 1029 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1074

Query: 1324 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
            +++GT  GSI  + P D   ++ F     L  ++   V H   L+ R   QF S   A+ 
Sbjct: 1075 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1126

Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            P   SI+D +L   +  +   +Q  +A +   T + I+  L D+
Sbjct: 1127 PC-KSIIDGDLCEQFGNMQYNKQRTVAEEFDRTPADIIKKLEDI 1169


>gi|453087531|gb|EMF15572.1| splicing factor 3B subunit 3 [Mycosphaerella populorum SO2202]
          Length = 1223

 Score = 43.9 bits (102), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 74/363 (20%), Positives = 143/363 (39%), Gaps = 36/363 (9%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +T+ M  +E AL    V      ++  E  LA+GT      G  + A G V +F    + 
Sbjct: 884  STVEMGDNEAALCCACVAF---ESRNWEVFLAVGTGQHMSPGTGLQAAGYVHIFKLEEDG 940

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
                  +T V+  +    + AL    G L +  G ++ ++      L   A   A P  +
Sbjct: 941  TK----LTFVHKTKFDQPVYALLPFHGRLALGVGNELFIYDIGQKALLRKARGQATPNQI 996

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            VSL      I+ GD+ + + ++ +K    +L     D        T  +ID  T +    
Sbjct: 997  VSLESHGQRIICGDVSEGVTYMVYKPGYNRLIPFVDDVVQRWTTGTT-MIDYETTA--GG 1053

Query: 1259 DEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLATSSDRT 1310
            D+  N+ +     + S    E   G  +++   +  GA     LR     Q +  S  RT
Sbjct: 1054 DKFGNLWVVRCPEQPSQEADEEGAGGFIMNERSYLGGAPYRLDLRAHYYCQDIPMSLQRT 1113

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G  +     L +  L G++G + P    +++ F    SL+++L    P +AG +  
Sbjct: 1114 ALVAGGQEV----LFWSGLQGTLGMLVPFVTREDVEF--FTSLEQQLRIEDPPLAGRDHL 1167

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
             +R ++   K        ++D +L   +  L  + + ++A +   +  +I   + ++   
Sbjct: 1168 MYRSYYVPVKG-------VIDGDLCERFMALSYDSKQKVAAEVDRSVKEIEKKVQEMRTR 1220

Query: 1428 TSF 1430
             +F
Sbjct: 1221 VAF 1223


>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
          Length = 967

 Score = 43.9 bits (102), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           S + +    + L+ VP P+GG+LV+G   I Y        L   N  +S+D ++    ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249

Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
           +    D ++   L ++     V  + T    +V L+  + G+V    Q ++    +P V 
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             S I  +GN +F++GS  GDS L+Q   G   S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343



 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 51/224 (22%), Positives = 108/224 (48%), Gaps = 18/224 (8%)

Query: 1152 ELKGAISALASLQGHLLIASGPKII-LHKWTGTELNG--IAFYDAPPLYVVSLNI---VK 1205
            ++ G +  + S++  ++ A   KI  L+ +    L G  I F       VV+L++     
Sbjct: 746  DMPGVVYRMESIKNTIIAAVDGKIYGLYNFKPDLLKGERIEFKFLLHNNVVALDMDTDNN 805

Query: 1206 NFILLGDIHKSIYFLSWK--EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
            + +L+GD+ +S+  L  +  E+  +L+L A D   +   A +F+ +     L+ +D++ N
Sbjct: 806  DTLLVGDLMESMSLLKVEKDEESLKLSLEAVDNKQVWMTAVKFVNENV---LIGADDRHN 862

Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
            +      P++ +  K  KL     +H+G  V +F R  +L    D   A+   D  +++ 
Sbjct: 863  L-FTMIKPEIRQEGKTCKLELEGGYHLGTLVNRF-RKDIL---RDVENASDNIDSISKYE 917

Query: 1324 --LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
                F T++GSIG +  +   +F   + +Q+ +++ +P+   L+
Sbjct: 918  SEFTFATVNGSIGTVKTISRESFEFFKGIQEGILNILPNNGNLD 961


>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
 gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1153

 Score = 43.9 bits (102), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 58/252 (23%), Positives = 106/252 (42%), Gaps = 37/252 (14%)

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            +GT+++   +V  RGR+L+F  G N+D    ++    S  LKG+   +  L G ++ A  
Sbjct: 835  VGTSFLHDGEVNIRGRLLIF--GVNSDRTPYIIA---SHTLKGSCRCIGVLNGKIVAALN 889

Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
              ++++ +     T   L  +A Y       + ++I  N I + DI KS+  + +     
Sbjct: 890  KTVVMYDYEETSRTTANLRKVATYRCATC-PIDIDIRGNIIAVADIMKSVALVEYTPGVD 948

Query: 1228 QLNLLAKDFG--SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
             L    ++ G  +   FAT  + +  T + + SD   N+ +     +        +L   
Sbjct: 949  GLPDKLEEVGRHAQQVFATS-IAEVDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVL 1007

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF-----GTLDGSI---GCIA 1337
             E ++G  V K  R+ +               +T++ ALL       T +GSI     I 
Sbjct: 1008 CEMNLGEMVNKIKRINV---------------ETSKDALLIPRAFVATTEGSIYLFSLIP 1052

Query: 1338 PLDELTFRRLQS 1349
            P ++    RLQS
Sbjct: 1053 PQNQDLLMRLQS 1064


>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
 gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
          Length = 1273

 Score = 43.9 bits (102), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570


>gi|422294117|gb|EKU21417.1| uv-damaged dna-binding protein [Nannochloropsis gaditana CCMP526]
          Length = 192

 Score = 43.5 bits (101), Expect = 0.79,   Method: Composition-based stats.
 Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 5/106 (4%)

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
            R+G            L+FGT  G IG I P+ E  +R   +L K L   V  V GL+   
Sbjct: 29   RSGGEVARGHVQDLGLMFGTQQGMIGSILPISEEDYRFFVALTKCLNKVVKGVGGLSHEE 88

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLP---LEEQLEIAHQTG 1411
            +R+F ++           VD +L+  +  LP   +EE +E+    G
Sbjct: 89   YRRFLTDKAI--SDTQGFVDGDLIESFLELPTQRMEEVVELMRVEG 132


>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
          Length = 332

 Score = 43.5 bits (101), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S     S E     R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
           +DP      V +Y GL  IIL    G        DT   +   +S RIE  +++      
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
                D  F++GY  P   +++E EL    + +++ +    +  ++  TL          
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           ++  D+  L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254


>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
           (Spliceosome-associated protein 130) (SAP 130)
           (Pre-mRNA-splicing factor SF3b 130 kDa subunit)
           (SF3b130) (STAF130) [Ciona intestinalis]
          Length = 1216

 Score = 43.5 bits (101), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 97/470 (20%), Positives = 164/470 (34%), Gaps = 129/470 (27%)

Query: 304 LPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS-- 358
           L   A  L++VP      GGVLV   N I Y            N+    D    +PR   
Sbjct: 228 LEERANHLISVPGGNDGPGGVLVCAENYITY-----------KNFGDQPDIRTPIPRRRN 276

Query: 359 -------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
                     V   A H T       L+ T+ GD+  +T+  D  +V  + L   +   +
Sbjct: 277 DLDDPERGMIVVCSATHKTKSMF-FFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPV 335

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKR-- 466
           +  +  +     F+ + +G+  L Q          +  SS +  E GD    AP   R  
Sbjct: 336 SMAMCVLRTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNL 395

Query: 467 --------LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
                   L    +  + D+ N +   LY +      S+                 L+  
Sbjct: 396 VLVDELDSLSPIMTCLISDLANEDTPQLYVTCGRGPRSS-----------------LRVL 438

Query: 519 SYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
            +GL +            S   + ELPG    +WTV  K               ++E+ +
Sbjct: 439 RHGLEV------------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDS 472

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           Y+I+S    T+VL   + + EVT+S   F+      + +L G   ++QV+  G R +   
Sbjct: 473 YIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRAD 530

Query: 638 ----------------------------------YMTQDLSFGPSNSESGSGSENSTVLS 663
                                             Y   D S G  N  +     NS V+ 
Sbjct: 531 KRVNEWKTPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVC 589

Query: 664 VSIAD--------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
           + ++          ++ +G++D ++R++  DP+ C   +S+Q  PA  ES
Sbjct: 590 MDLSKVPPTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639


>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
 gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
          Length = 1229

 Score = 43.5 bits (101), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1072

 Score = 43.5 bits (101), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           ++  +++D  F++G+  P ++I++E                  +A+  +  +K+      
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
             ++  +A  ++ VPSP+ G +++G N+I YH  + SC +              LP    
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
            +E+       L+    LL   +G L++L + Y+  +     V  L L       +   +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           T + N + ++ S+ GDS L++ 
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356


>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
 gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
          Length = 1119

 Score = 43.5 bits (101), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+ ++       L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
 gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
          Length = 1227

 Score = 43.5 bits (101), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
 gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
          Length = 1228

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|449437538|ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus]
          Length = 1376

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 90/216 (41%), Gaps = 12/216 (5%)

Query: 1148 VYSKELKGAISALAS-LQGHLLIASGPKIILHKWTGTELNGIAFYDA--PPLYVVSLNIV 1204
            VYS  L G + A+   L  + L ++G    +  +       +  +        + SL   
Sbjct: 1076 VYSTSLPGMVLAICPYLDRYFLASAGNAFYVCGFPNDSFQRVKRFAVGRTRFMITSLTAH 1135

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
             N I +GD    I F S++E   +L  +  D  S    A   L+D  T   VVSD + +I
Sbjct: 1136 VNRIAVGDCRDGILFFSYQEDAKKLEQIYSD-PSQRLVADCTLLDVDTA--VVSDRKGSI 1192

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR-----LQMLATSSDRTGAAPGSD-K 1318
             I   + ++ ++   +  L+    +    +   LR      ++ A    R  A PGSD  
Sbjct: 1193 AILSCSDRLEDNASPECNLTLNCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFD 1252

Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
            ++   ++  TL GSI    PL    +  L+++Q KL
Sbjct: 1253 SSHNTIIASTLLGSIVIFTPLSRDEYELLEAVQAKL 1288


>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
 gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
          Length = 1166

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + L+     +G V ++A L   G     ++D ++L  +  + ++LE+D   + L      
Sbjct: 56  MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
            F        K G      G  + +D +GR   V       LVY L       + I   L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
           +A + G+      D  G   G+   I ++  +   DLD   + +                
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
                      +KH   +++   +   L      WS + +   AYKL+ VP     P  G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253

Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           V+V+    I Y H Q  S  + +        ++  LP     + + A       +   LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308

Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
               GDL+ LT+  D    V+ L +   +     + +  + + L F G   G+  L QF 
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
                S+     + EF         +K   +  +  L  + N   L    S    T++  
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423

Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
              S      L  + G  K+ S  L +    SAT +       L ELPG    IWTV  K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
                          +D Y  Y+++S    T+VL   + + EV +S        T+    
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521

Query: 617 LFGRRRVIQVFERGARILDGS 637
           L GR  V+Q+  +G R +  +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541


>gi|167539942|ref|XP_001741428.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
 gi|165894130|gb|EDR22214.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
          Length = 1004

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 47/197 (23%), Positives = 92/197 (46%), Gaps = 34/197 (17%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
             T+ ++S+E AL V  +          + + A+GTA ++  ++  + GR+LL        
Sbjct: 701  TTVELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 746

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            + + L+  V +++  GA+  L   Q  ++      + + +  G +L+        PL  V
Sbjct: 747  DSEGLIYIVGTEDYDGAVYCLKKYQKGIVAFINRNVHVIEKKGKDLSTKQNM-LLPLIGV 805

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
            SL+I K++I+ GD+ +S+    ++     L+++ KD         GS++  + T FL   
Sbjct: 806  SLDICKDYIIAGDLARSVSVYRYRNDIEHLDIVGKDNQIVWSSCVGSIESEYGTSFL--- 862

Query: 1251 STLSLVVSDEQKNIQIF 1267
                  V+D   NI+IF
Sbjct: 863  ------VADVSGNIKIF 873


>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
          Length = 1133

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|406868052|gb|EKD21089.1| pre-mRNA-splicing factor rse1 [Marssonina brunnea f. sp.
            'multigermtubi' MB_m1]
          Length = 1236

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 71/339 (20%), Positives = 142/339 (41%), Gaps = 38/339 (11%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGRNAD 1139
            I +  +E A+++  V+    +++++E  L IGT    G+D+    R          R  D
Sbjct: 875  IDLDDNEAAVSMAAVSF---SSQDDEVFLVIGT----GKDMIVSPRSSTAGFIHVYRFHD 927

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
            N +  +  ++  +++    AL   QG LL+  G  + ++     +L   A  +  P  +V
Sbjct: 928  NGKE-IEFIHKTKVEEPPMALLGFQGRLLVGIGKDLRIYDLGMRQLLRKAQAEVAPNLIV 986

Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
             L    + I++ D+ +SI  + +K Q  +L     D  S     T  ++D  T++    D
Sbjct: 987  GLQTQGSRIVVSDVQESIIMIVYKFQENKLIPFVDDTISRWTSCTT-MVDYETVA--GGD 1043

Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLS-----RAEFHVGAHVTKFLR---LQMLATSSDRTG 1311
            +  N+ +     K SE    +   S     R+      H    +     Q +  S  +T 
Sbjct: 1044 KFGNLWLLRCPTKASEEADEEGSASHLVHERSYLQGSPHRLTLMAHFFTQDIPMSIQKTN 1103

Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               G     R  +L+  + G++G + P    +++ F   Q+L++ L      +AG +   
Sbjct: 1104 LVAG----GRDCILWSGIQGTLGILIPFVSREDVDF--FQTLEQHLRSEDAPLAGRDHLI 1157

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
            +R ++   K        ++D +L   Y +LP +++  IA
Sbjct: 1158 YRSYYVPVKG-------VIDGDLCERYTLLPTDKKQMIA 1189


>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1245

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 99/450 (22%), Positives = 193/450 (42%), Gaps = 81/450 (18%)

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G    F  GSR       ++RLR H  L    + A   LH  +     +  TS+G++   
Sbjct: 775  GRTAVFACGSRTSVLFWEKDRLR-HSPLILKEVAAAASLHTHDYRSSLVLATSEGLV--- 830

Query: 982  QLPSGSTYDNYWPVQKIPLKA------TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
                     +   ++K+ +++       P +I++                 P+++ L++ 
Sbjct: 831  -------IGDVQNLEKLHIRSIHTGLDNPRRISH----------------SPVHKALAVG 867

Query: 1036 IDQEVGHQIDNHNLS--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
              +    ++    +S  SV L+   T+++    +L+ D            PM  +  AL+
Sbjct: 868  CVRHTPVRVGEPEISRGSVQLYNDTTLDKLGQVVLDHDEE----------PM--AIKALS 915

Query: 1094 VRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKE 1152
            VRV        +E +    +GT  +   E+ ++ GR+LL     +    ++ V    S++
Sbjct: 916  VRV-------AEEAKDCFVVGTVIIDSLENESSSGRLLLVEP--DYSRGESFVAVSASEK 966

Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTE----LNGIAFYDAPPLYVVSLNIVK--N 1206
            +KG + A+A++ G ++ A    ++++     +    L+ +   +    YVV+ N+V   N
Sbjct: 967  VKGCVYAVAAVDGLVVAAVNSAVVIYSIEADDHTRALSFVKKVEWNHNYVVA-NLVSRGN 1025

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
             +L+GD   S+  L + E+GA  N+ A+D+  L   + E L + + +    +D   N+ +
Sbjct: 1026 LLLVGDAISSVTLLQY-ERGALQNV-ARDYSPLWPTSVEMLDERNVIG---ADNDCNLFM 1080

Query: 1267 FYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
            F       +    +K+L R   ++ G  V KF+  ++    S    +   SD       L
Sbjct: 1081 FTL-----QDGAERKVLERNGHYYFGDMVNKFIPGEIYRALS----SFEASDIEVEPKQL 1131

Query: 1326 FGTLDGSIGCIAPL-DELTFRRLQSLQKKL 1354
            F T  GSIG +  + DEL+   + SLQ+ L
Sbjct: 1132 FFTTTGSIGVVIDMSDELSL-HMSSLQRNL 1160


>gi|452986188|gb|EME85944.1| hypothetical protein MYCFIDRAFT_59215 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1223

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 76/363 (20%), Positives = 149/363 (41%), Gaps = 36/363 (9%)

Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-YVQ-GEDVAARGRVLLF 1132
            G  +   TI +  +E AL+   V      +K  E  LA+GT  ++Q G  V   G V ++
Sbjct: 878  GGREVTCTIELGENEAALSCACVAF---ESKNWEVYLAVGTGQHMQPGTGVQTAGYVHIY 934

Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
               ++       +  V+  + +  + AL   +G L +  G ++ ++      L   A   
Sbjct: 935  KLLKDGAE----LEFVHKTKFELPVYALMPFRGRLALGVGNELFIYDMGMKALLRKARNI 990

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
            A P  +VSL    N I+ GD+ + + +L +K    +L     D        T  ++D  T
Sbjct: 991  AVPNQIVSLESQGNRIICGDVSEGVTYLVYKPTFNRLIPFVDDTVQ-RWTTTTTMVDYET 1049

Query: 1253 LSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLA 1304
             +    D+  N+ I     + S    E   G  +++   +  GA     LR     Q + 
Sbjct: 1050 AA--GGDKFGNLWIVRCPEQPSQEADEEGAGGYIMNERSYLNGAPYRLDLRAHYFCQDIP 1107

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
             S  RT    G  +     L +  L G++G + P    +++ F    +L++++    P +
Sbjct: 1108 MSMQRTALVAGGQEV----LFWSGLQGTLGILIPFVTREDVEF--FTALEQQMRTEDPPL 1161

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            AG +   +R ++   K        ++D +LL  +  L  + + +IA +   +  +I   +
Sbjct: 1162 AGRDHLMYRSYYVPVKG-------VIDGDLLERFMGLSYDTKQKIAAEVDRSVKEIEKKV 1214

Query: 1422 NDL 1424
             ++
Sbjct: 1215 QEM 1217


>gi|281202530|gb|EFA76732.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 933

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 46/203 (22%), Positives = 88/203 (43%), Gaps = 33/203 (16%)

Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
            +G QLN    +  S+  +   F  +G TL LV   E + +      P     ++G+ L+ 
Sbjct: 755  KGVQLNPRKVESASIHLY--RFTNNGQTLQLVYKTEVEEV------PYAISHFQGRLLVG 806

Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
                     +   LR+  +     +T  + G  +     L++ TL+G+IG + P    ++
Sbjct: 807  ---------IANQLRIYEMVNHISKTSLSVGGPE----VLVYATLNGTIGALVPFVSRED 853

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
            + F    SL+ ++    P + G +  ++R ++   K       +++D +L   Y  L   
Sbjct: 854  VDF--YTSLELQMRQENPPLCGRDHLAYRSYYFPVK-------NVIDGDLCEQYISLDPT 904

Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
            +Q  IA +   + S+IL  L DL
Sbjct: 905  KQQSIAEELSRSPSEILKKLEDL 927


>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
          Length = 238

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)

Query: 57  NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
           NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28  NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98  -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88  QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQ 208
            Q
Sbjct: 202 KQ 203


>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
          Length = 1160

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+ ++       L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|451818558|ref|YP_007454759.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
            [Clostridium saccharoperbutylacetonicum N1-4(HMT)]
 gi|451784537|gb|AGF55505.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
            [Clostridium saccharoperbutylacetonicum N1-4(HMT)]
          Length = 493

 Score = 42.4 bits (98), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 5/97 (5%)

Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRAT-IPMQSSENALTVRVVTLFNTTTK----ENETL 1110
            R Y+  E+   +++P+  G P    AT +  + +E A  + V  +F T       EN   
Sbjct: 122  RAYSDGEFTWHVVKPEGGGVPGPRAATTMTKRMTEKARELGVEIIFETPVNKIIMENGEA 181

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
            + +      GE++ ARG+ ++ +TG   DNPQ +  E
Sbjct: 182  VGVIAKNKAGEEIEARGKAVILATGGFGDNPQMIKEE 218


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 1140

 Score = 42.4 bits (98), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y   +       NN  +    S+ L  ++  V     
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288

Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
              W+Q D    LL+   G L  L +V D R  V+   +     +   S +  +G  + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346

Query: 425 LGSRLGDSLLVQFTCGS 441
           +GS  GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
           FGSC A4]
          Length = 1132

 Score = 42.0 bits (97), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 31/265 (11%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G    +DP GR   + +Y   ++++   Q  S   G +    +G       E    I  R
Sbjct: 117 GSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR 176

Query: 240 DLDMKHVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +D   V+   F+H     P + +L+E  +     +V    ++    A S  T++  +  
Sbjct: 177 -IDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-- 233

Query: 298 IWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
              A  L   A  L+ VP+P+   GG+L++G  +I Y         A NN  VS    Q 
Sbjct: 234 ---AQELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QP 279

Query: 355 LPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
           L  ++  V        W Q D    LL+   G L  L +V     V+R +L     +   
Sbjct: 280 LEEATIFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRA 332

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
           S +  +G  + F+GS  GDS +++ 
Sbjct: 333 SVLVYLGGGVVFVGSHQGDSQVIRI 357


>gi|82541417|ref|XP_724950.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479780|gb|EAA16515.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
          Length = 2227

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 51/219 (23%), Positives = 92/219 (42%), Gaps = 35/219 (15%)

Query: 1167 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY--FLSWKE 1224
            LL  +  KI +H+    +    AF D    Y+  + I +NFI++ D++K IY    S++E
Sbjct: 2005 LLHCTNSKIYIHEIKNNDFIKGAFLDNN-FYISDIKIFRNFIIISDLYKGIYINMYSYEE 2063

Query: 1225 Q--GAQLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
            Q    ++  ++K F   +L+  +  +++  S + ++  D   N  IF +  K        
Sbjct: 2064 QYDSRRIISISKTFYNHNLNILSCHYIVYNSNICIIAMDVYNNFFIFGHKNKQD----ID 2119

Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
             L     F+    + KF  +  L  +     A   S+            DGSI    PL+
Sbjct: 2120 NLYIYNYFNFNRRILKF--INELHQNKHSNSALSISN------------DGSIHIYHPLN 2165

Query: 1341 E---LTFRRLQSLQKKLVDSVPHVA-----GLNPRSFRQ 1371
            +   + F+ +  + KK +   P++A      L P  F Q
Sbjct: 2166 DKAFIFFKHIFKIVKKFI--FPNLALNINSDLKPDIFIQ 2202


>gi|62318969|dbj|BAD94072.1| spliceosomal - like protein [Arabidopsis thaliana]
          Length = 165

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 35/141 (24%), Positives = 64/141 (45%), Gaps = 26/141 (18%)

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
             +FHVG  VT   +  M+   S+              ++++GT+ GSIG +      D++
Sbjct: 42   VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 87

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
             F     L+  +    P + G +  ++R       A+ P  D ++D +L   +  LP++ 
Sbjct: 88   DF--FSHLEMHMRQEYPPLRGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 138

Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
            Q +IA +   T ++IL  L D
Sbjct: 139  QRKIADELDRTPAEILKKLED 159


>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
 gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
          Length = 1292

 Score = 41.6 bits (96), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
             E  Q +F   V+    NI P+ DFS    YG + +   +  G  ++ +  ++      
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474

Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                      G  GIWT+ ++              + D YHA+ +IS    T VL    
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              ++T++V +  Q  T+A G L     V QV+    ++   +          SN  S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579

Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
             +    +SV ++    V+L ++   + L++G   T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615


>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
            binding / protein binding [Botryotinia fuckeliana]
          Length = 1157

 Score = 41.6 bits (96), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)

Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
            +GT+++  E+   RGR+L+F  G NAD    ++    S  LKG+   +  L G ++ A  
Sbjct: 835  VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 889

Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
              ++++ +     T   L  +A Y      +           + DI KSI  + +   GA
Sbjct: 890  KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 947

Query: 1228 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
                 +L  +A+   +   F+T  + +  T + + +D   N+ +     +        ++
Sbjct: 948  DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 1004

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1334
                E ++G  V +  R+ +               +T++ ALL      GT +GSI    
Sbjct: 1005 EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 1049

Query: 1335 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1382
             I P ++    RLQS    L          DS  PH   L+P +  F ++ S   A R  
Sbjct: 1050 LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1109

Query: 1383 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              P   VD EL+  +  L +E Q  +A   G
Sbjct: 1110 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1140


>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
          Length = 1185

 Score = 41.6 bits (96), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
           GGV+V   N + Y            N+    D    +PR    ++      +  AHAT  
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294

Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
              +   LL T+ GDL  +T+  +  +V  + L   +   ++S +  +     F+    G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
           +  L Q T  G           E   + E    + + LR        D+L  ++N E   
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           L   A+ +T     T     R +L      +   +GL +            S   + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WTV                + D ++ +Y+I+S    T+VL   + + E+T+S  
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
           +     T+++G L G   ++Q++  G R                                
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557

Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
           L G    Y   D + G  N  +      S ++++ + D         ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616

Query: 683 LVGDPSTC 690
           +  DP+ C
Sbjct: 617 ISLDPNDC 624


>gi|340502545|gb|EGR29224.1| hypothetical protein IMG5_160230 [Ichthyophthirius multifiliis]
          Length = 315

 Score = 41.2 bits (95), Expect = 4.7,   Method: Composition-based stats.
 Identities = 39/175 (22%), Positives = 80/175 (45%), Gaps = 17/175 (9%)

Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD---CFATEFLIDGSTLSLVVSDEQ 1261
            K + L+GDI K +      E+  ++  + ++  +++   C    F+ + +TL  +V+DE 
Sbjct: 93   KQYFLVGDIQKGMQLYEMDERENKVKQIGEENANINIRQCLLF-FIQNTNTLRALVADEY 151

Query: 1262 KNIQIFYYAPKMS-ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
            KN+  +    +      K  ++   A FH+G+ V K +        +D+       D   
Sbjct: 152  KNLYAYSLIQQQDLNEKKNVQMELIANFHLGSKVNKII--------TDQKQIIEKGDNQQ 203

Query: 1321 RFA---LLFGTLDGSIGCIAPLDELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQ 1371
            + A   +L  + DG+I  +  + E     L   LQK + + +P++  L+ R +R+
Sbjct: 204  QEAVSHILLLSQDGNISVLKLIYEQGENTLLFDLQKTIYEELPYIGSLDYREYRE 258


>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
          Length = 1020

 Score = 41.2 bits (95), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + + G + ++A     G++    +D II+  +  +I+++EF  + +      +  F    
Sbjct: 62  HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114

Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               K G      G  + VDP+GR      V    L  ++ + SQ        E T  S 
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163

Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
               A    + V  L  LD+          GY  PV   L     E +    G+ ++   
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210

Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
              +    +   L      WS   +   A  L  VP       GVLV G + I Y HS  
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
            +  +A+     + +  Q        V  +L  A   +      LL T  GDL  +T+  
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325

Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           V D        V+RL +   +   + + +  + +   F+ S  G+    QF         
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385

Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
           +  + ++F  G  E+  P     R + + +L + ++     +    +N T E A + +S 
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
               +      LK   +GL ++    +            ELPG    +WT   K +RG  
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
                     D Y AY+I+S    T+VL   + + EVT++  +     T+A   L G   
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534

Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
           +IQV  +G R +   +   + +  P +    + + N   ++V+++    V   M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593


>gi|284028391|ref|YP_003378322.1| NAD+ synthetase [Kribbella flavida DSM 17836]
 gi|283807684|gb|ADB29523.1| NAD+ synthetase [Kribbella flavida DSM 17836]
          Length = 271

 Score = 41.2 bits (95), Expect = 4.9,   Method: Composition-based stats.
 Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 9/137 (6%)

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            L  +  DE   + I      M ++ K   L  R EFH G    +   +   A +  R G 
Sbjct: 100  LDFIRPDETLTVDIKDSTDAMVDAMKHVGLGDRVEFHAGNVKARERMIAQYAVAGVRGGL 159

Query: 1313 APGSDKTNRFALLFGTLDGSIGC-IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
              G+D      + F T  G   C + PL  LT RR++++ ++L  S P + G  P     
Sbjct: 160  VVGTDHAAEAVMGFYTKWGDGACDVTPLSGLTKRRVRAIGERLGAS-PEITGKVPT---- 214

Query: 1372 FHSNGKAHRPG-PDSIV 1387
              ++ ++ RPG PD  V
Sbjct: 215  --ADLESDRPGIPDETV 229


>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
          Length = 1123

 Score = 40.8 bits (94), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)

Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
           N ++  K +S + V+DS      LK   YGL IN              E+VE  LPG   
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WT                   +DE+  YL+IS    T+VL   + + E+T+S    + 
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
             TI    + G   ++Q+   G R                  IL  S   + ++ G SN 
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551

Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
           E               +      S ++S+S+ D        P++++G  D +IR+L  DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611

Query: 688 STC--TVSVQTPAAI 700
            +    +S+Q  ++I
Sbjct: 612 GSTLELLSLQALSSI 626


>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1158

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)

Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
           +I    +   +K+H    +    PH           +GGV+VVG   + Y          
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
                       ++ +++   +LD A     W + +V    L+   G L LLT+  DG  
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
           V+ + L+K   +   S++  +GN + F+ S  GDS L Q                   D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363

Query: 458 EADAPSTK 465
            AD P+ K
Sbjct: 364 SADKPADK 371


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score = 40.8 bits (94), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
           H+        + +GG+LVVG   + Y   +  C +             E+P  + S+   
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605

Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
                W + D    LL+ + G L LLT++  G VV  LD+S    +   S +  + +  L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663

Query: 423 FFLGSRLGDSLLVQF 437
            F+GS  GDS L + 
Sbjct: 664 LFVGSHNGDSQLFRL 678


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
           phaseolina MS6]
          Length = 1129

 Score = 40.4 bits (93), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 63/262 (24%), Positives = 102/262 (38%), Gaps = 32/262 (12%)

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP GR   + +Y G+  ++    +G     GD +    G    +RIE   V +   L  
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKRK--GDPEVSALGEPVPSRIEEMFVRSSAFLHR 178

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           K  +         +P++ +L+E +     R+  +      +     +     P+      
Sbjct: 179 KSPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEE 231

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           L   A  L+ VP P  GVLV+G   I Y           N+Y  +L   + L  S+  V 
Sbjct: 232 LDLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV- 279

Query: 364 LDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIG 419
                  W Q  N   LL+   G L L  ++ D    VV+   L K   +   S +  + 
Sbjct: 280 ------AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLD 333

Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
               F+GS  GDS +++ T GS
Sbjct: 334 AGHVFVGSHEGDSQVIRITEGS 355


>gi|341893349|gb|EGT49284.1| hypothetical protein CAEBREN_30765 [Caenorhabditis brenneri]
          Length = 213

 Score = 40.4 bits (93), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 39/200 (19%), Positives = 82/200 (41%), Gaps = 15/200 (7%)

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            +S+  LS++        +AKD+ S      EF+   S L          +++    P   
Sbjct: 2    RSVSLLSYRTLEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTD 61

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGS 1332
            +   G+ +L    +     +TK +   +L    D        D + R+   +++GT  GS
Sbjct: 62   D---GRYVLEPTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGS 110

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            +G +  +D++  + L S++K + D+  +   +   ++R F  N +   P     +D +L+
Sbjct: 111  LGLVVQIDDMYKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLI 168

Query: 1393 SHYEMLPLEEQLEIAHQTGT 1412
                 +     +EI  +  T
Sbjct: 169  ESILDMDRSRAIEILEKANT 188


>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1155

 Score = 40.4 bits (93), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 80/373 (21%), Positives = 147/373 (39%), Gaps = 62/373 (16%)

Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-- 1124
            +L+P   G P++      +   E   TV    L +T  +  E  + +GT ++  E++   
Sbjct: 800  VLQP--LGNPFE------LNEGEVVETVIRAQLRDTFGRLAERFI-VGTRFLVDENLVPG 850

Query: 1125 --ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-- 1180
              ++GRVL+F         Q     + S  LK     LA ++  +++A    +++ ++  
Sbjct: 851  SNSKGRVLVFGVDEERSPFQ-----IVSHPLKSGCRRLAVMEEMIVVALTKTVVVARYEE 905

Query: 1181 ---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW--------------K 1223
               T  +L  +A Y     Y + + +    I +GDI KS+  + +               
Sbjct: 906  LTSTSGKLIKVASYQTTS-YAIDVAVEGRLIAVGDIMKSMSLVEFVPPTTVAGDGKAGET 964

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            ++ AQL  + + + S    A       S L    +D   N+ +      +  +  G  L 
Sbjct: 965  KKPAQLIEVCRHYQSSWSTAVAHFEGESWLE---ADADGNVMV------LGRNTTGVTLE 1015

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGA-APGSDKTNRFALLFGTLDGSI---GCIAP- 1338
             R    + + +     +  +   S  TG  AP   K         T +GSI   G IAP 
Sbjct: 1016 DRRRMEITSEINLGENINRIQKISVETGPNAPIHPKA-----FLSTTEGSIYLVGAIAPQ 1070

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + +L    L +LQ +L D V  +  +  ++FR F  N +    GP   +D E +  +  +
Sbjct: 1071 MRDL----LLNLQDRLEDYVGTLGNIPFKNFRSFR-NAEREADGPVRFIDGEYIERFLDM 1125

Query: 1399 PLEEQLEIAHQTG 1411
              E Q ++    G
Sbjct: 1126 NEETQSQVCRDLG 1138


>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 932

 Score = 40.4 bits (93), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
           D Y  + +P PI     V      YH +  + A A     + +  +       L R+   
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233

Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
             L+  A    W   D     LS   G+L LLT+  +G VV  L L     +   S +  
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +GN L FLGS  GDS L+Q    S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 22,122,138,541
Number of Sequences: 23463169
Number of extensions: 940356149
Number of successful extensions: 2150685
Number of sequences better than 100.0: 997
Number of HSP's better than 100.0 without gapping: 461
Number of HSP's successfully gapped in prelim test: 536
Number of HSP's that attempted gapping in prelim test: 2144292
Number of HSP's gapped (non-prelim): 3192
length of query: 1431
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1275
effective length of database: 8,698,941,003
effective search space: 11091149778825
effective search space used: 11091149778825
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)