BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000548
         (1431 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9FGR0|CPSF1_ARATH Cleavage and polyadenylation specificity factor subunit 1
            OS=Arabidopsis thaliana GN=CPSF160 PE=1 SV=2
          Length = 1442

 Score = 2249 bits (5829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1092/1460 (74%), Positives = 1254/1460 (85%), Gaps = 47/1460 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR +S D  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYELV                          ELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
            YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
             DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131

Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
            A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191

Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
            GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251

Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
            LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311

Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
            AHV+KFLRLQM+++         G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
            KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422

Query: 1412 TTRSQILSNLNDLALGTSFL 1431
            TTR  IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442


>sp|Q7XWP1|CPSF1_ORYSJ Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Oryza sativa subsp. japonica GN=Os04g0252200 PE=3 SV=2
          Length = 1441

 Score = 1873 bits (4852), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 943/1472 (64%), Positives = 1121/1472 (76%), Gaps = 72/1472 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
            ICQLPS   YD+YWPVQK+PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998  ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057

Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
              H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
             NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            +ASLQGHLLIASGPKI L+KWTG EL  +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235

Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
            LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD  KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295

Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
            QKLLSRAEFHVGAH+TKFLRLQML T          S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349

Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
            DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL  YEML 
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEMLS 1409

Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
            L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441


>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor subunit 1
            OS=Drosophila melanogaster GN=Cpsf160 PE=1 SV=1
          Length = 1455

 Score =  516 bits (1329), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 420/1486 (28%), Positives = 698/1486 (46%), Gaps = 190/1486 (12%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+K+PL+ TP Q+ Y  E  +Y LI      
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054

Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
            +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103

Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
            +I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++        
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163

Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
            P   +T+   KE+     KG +SA++ + G L+   G KI + +    +L G+AF D   
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222

Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
            +YV  +  VK+ I + D++KSI  L ++E+   L+L ++DF  L+ +  EF++D S L  
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
            +V+D ++NI ++ Y P+  ES  GQKLL +A++H+G  V    R+Q       +      
Sbjct: 1283 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341

Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                N+  +++GTLDG++G   PL E  +RR   LQ  L+    H+ GLNP+ +R   S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
             K        I+D +L+  Y ++   E+ E+A + GT   +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445


>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor subunit 1 OS=Bos
            taurus GN=CPSF1 PE=1 SV=1
          Length = 1444

 Score =  338 bits (868), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   +  
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1063

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1064 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1411

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434



 Score =  304 bits (779), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543

Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
             +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670


>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor subunit 1 OS=Mus
            musculus GN=Cpsf1 PE=1 SV=1
          Length = 1441

 Score =  338 bits (868), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
             WPV+KIPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060

Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116

Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
              +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL   
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176

Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
             GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235

Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
            E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295

Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
             RA+FHVGAHV  F R       +   GAA G  K      N+    F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348

Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
            + E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y  L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408

Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
               E+ E+A + GTT   IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431



 Score =  307 bits (787), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor subunit 1 OS=Homo
            sapiens GN=CPSF1 PE=1 SV=2
          Length = 1443

 Score =  334 bits (856), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
             WPV+KIPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060

Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
               + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T   
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116

Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
             +  +A GT  +QGE+V  RGR+L+         P   +T+     +Y KE KG ++AL 
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176

Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
               GHL+ A G KI L     +EL G+AF D   LY+  +  VKNFIL  D+ KSI  L 
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235

Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
            ++E+   L+L+++D   L+ ++ +F++D + L  +VSD  +N+ ++ Y P+  ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
            LL RA+FHVGAHV  F R       +   GA  G  K      N+    F TLDG IG +
Sbjct: 1296 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1348

Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
             P+ E T+RRL  LQ  L   +PH AGLNPR+FR  H + +  +    +++D ELL+ Y 
Sbjct: 1349 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1408

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
             L   E+ E+A + GTT   IL +L
Sbjct: 1409 YLSTMERSELAKKIGTTPDIILDDL 1433



 Score =  305 bits (780), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669


>sp|Q7SEY2|CFT1_NEUCR Protein cft-1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
            CBS 708.71 / DSM 1257 / FGSC 987) GN=cft-1 PE=3 SV=2
          Length = 1456

 Score =  294 bits (752), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 345/1418 (24%), Positives = 594/1418 (41%), Gaps = 189/1418 (13%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
            D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96   DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154  LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
            L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156  LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214  VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
              D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216  DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276  YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334  PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
             ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394  AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
            +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454  VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485  SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
            +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508  ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530  ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
               A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568  LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574  ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
                +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628  DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624  IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+ +
Sbjct: 688  LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742

Query: 683  LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
                P    +        I +S K ++ C LY D          ++  +    VG+    
Sbjct: 743  AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789

Query: 742  ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                   + +I   +   SG L I+ +P+      V + +S        Y+   L     
Sbjct: 790  -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ Y  +  
Sbjct: 831  -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
               + +  P S S       +   ++ N  F++ P +    ++ PH A    P +R +  
Sbjct: 881  ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928

Query: 918  KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
             NISG+   FL GS P + +   +       L    + A +  H   C HGFIY  + GI
Sbjct: 929  -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987

Query: 978  LKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             ++ Q+P+ S+Y      V+KIP+      + Y      Y  +V    ++P      L  
Sbjct: 988  ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            D +   +    N++   +     V+   +++L    +G  W    T+ M+  E  L V  
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092

Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK- 1154
            + L  + +T E + L+A+GTA ++GED+  RGRV +F        P    T   SK+LK 
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKL 1149

Query: 1155 --------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLN 1202
                    GA++AL+ +  QG +L+A G K ++   K  GT L  +AF D    YV S+ 
Sbjct: 1150 VAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVK 1207

Query: 1203 IVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
             +      L+ D  K ++F  + E+  ++ L  K    ++    +FL DG  L +V SD 
Sbjct: 1208 ELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDA 1267

Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
              +I I  + P+  +S +G  LL R  F+ GAH      L + A   + +  +  S++ +
Sbjct: 1268 DGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENS 1326

Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA-- 1378
               LL  +  G +  + PL E  +RRL SL  +L + +PH AGLNP+ +R    +  A  
Sbjct: 1327 PHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASM 1386

Query: 1379 HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
              PG D+     IVD ++L  +  L   ++ E+A + G
Sbjct: 1387 QLPGVDAGIGRNIVDGKILERFLELGTGKRQEMAGRAG 1424


>sp|O74733|CFT1_SCHPO Protein cft1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
            GN=cft1 PE=3 SV=1
          Length = 1441

 Score =  291 bits (745), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 336/1443 (23%), Positives = 598/1443 (41%), Gaps = 205/1443 (14%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV   ++ G +  ++ L   G++     D +I+  + AK+S LE+D         S+H
Sbjct: 92   LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148

Query: 161  CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
             +E      +K      +  P  + VDP   C  +L +   M+ +        L  +E  
Sbjct: 149  YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202

Query: 220  F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
               S    S   + S V+    LD  +  + D  F++GY EP + IL+  E T    +  
Sbjct: 203  IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262

Query: 277  KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
            +  T + S +++    +   +I +  +LP+D Y  +++P+P+GG L++G N + Y  S  
Sbjct: 263  RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322

Query: 336  ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLVLLT 390
             +  + +N+Y           +S F++EL+   A  L     +    +L   +G    L 
Sbjct: 323  RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYLD 382

Query: 391  VVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGT 443
             + DG+ V+ L L     + N   L S IT     G +L FLGS+  DS L++++     
Sbjct: 383  FLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS----- 437

Query: 444  SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                    EE    E D      L  ++   + DM++  E      +             
Sbjct: 438  ---RRTTNEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRL 490

Query: 504  AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV------------------ 542
             + D L NIGP+ DF+ G      A +     Q N+   ELV                  
Sbjct: 491  EICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNI 545

Query: 543  --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETA 593
                    +  GC+ +WTV   S +  N  S   A Y + E   YL++S E  + +    
Sbjct: 546  FPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG 604

Query: 594  DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSES 652
            +   EV  S D+    +T+  G+L    R++Q+     R+ D +  +TQ  +F       
Sbjct: 605  ETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF------- 656

Query: 653  GSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAAIES 702
               S+   V+S SI DP +++    G I          RL+  D       V+T A++ S
Sbjct: 657  ---SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-ASLVS 712

Query: 703  SKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEAIDG 741
                     L+                +D   E  L      KTS +  +  G  +++  
Sbjct: 713  PDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQSV-S 769

Query: 742  ADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
            AD  P              LDQ  +          + G L+++++ +F+ +   D F   
Sbjct: 770  ADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVFDLP 829

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
             T      +   ++   T  N  S             S ++VEL +         P LF 
Sbjct: 830  PT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPHLFL 870

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
                  I  Y+A+L+    NT K  + ++ ++   V   + +R        TP DA +  
Sbjct: 871  RSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAESTM 923

Query: 904  ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
            E    +     ++T  + +  H   F++G +P   +       +  P   +  I++    
Sbjct: 924  EKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPF 983

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
            H  +   G+IYV     ++IC+      YDN WP +K+ L    + I Y   K +Y +  
Sbjct: 984  HAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQINGIAYHPTKMVYAVGS 1043

Query: 1021 SVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
            +VP+   +     N+  ++  D +    +   N  S+DL    T                
Sbjct: 1044 AVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT---------------- 1084

Query: 1076 PWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
             W    +   Q  E  L+V +V L  + TTK  +  +A+GT+  +GED+A RG   LF  
Sbjct: 1085 -WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLFEI 1143

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
                  P     ++ +  V  +E+KG ++ +  + G+LL   G K+I+      + L G+
Sbjct: 1144 IDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHLVGV 1203

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            +F D    Y +S   ++N +L GD+ +++ F+ + E+  ++ L +K   +L+  A +FL+
Sbjct: 1204 SFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEALNVSAADFLV 1262

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             G  L  VV+D   N+++  Y P+  ES  G++L++R +FH+G  +T    + +L     
Sbjct: 1263 QGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITA---MTILPKEKK 1319

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
               A  G D  + F+ +    DG +  + P+ +  +RRL  +Q  L + V  + GLNP+S
Sbjct: 1320 HQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKS 1379

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
            +R   S      P    I+D  L+ ++  + +  + E+AH+ G   S I+++L +L    
Sbjct: 1380 YRLITSPSNLTNPT-RRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEAL 1438

Query: 1429 SFL 1431
            S++
Sbjct: 1439 SYM 1441


>sp|Q2TZ19|CFT1_ASPOR Protein cft1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
            GN=cft1 PE=3 SV=1
          Length = 1393

 Score =  290 bits (741), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 337/1401 (24%), Positives = 590/1401 (42%), Gaps = 208/1401 (14%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
            ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88   EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191  CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
            C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148  CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230  IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207  YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267  LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347  VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327  RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405  KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
               P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387  PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
               E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441  RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513  GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
            GPLKD ++G    +          S  ELV                            +L
Sbjct: 501  GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545  PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
              C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561  ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602  SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
             +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614  ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
              S  E    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669  DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725  LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768  KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809  WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
             +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858  -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKIPLKAT 1003
            G   + +      C  GFI + S+ +      L  C L   +   +Y+P  +Q+IP+   
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973

Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1063
               + Y +   +Y +  S            L  D E+  +  N   S         V+  
Sbjct: 974  VDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRS 1022

Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
             ++++ P      W    T+    +E+ + V+ ++L  +  T E + ++ +GTA+ +GED
Sbjct: 1023 SLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGED 1074

Query: 1123 VAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKI 1175
            +A+RG V +F   +   +P+    +     V  + +KGA++AL+ +  QG L++A G K 
Sbjct: 1075 IASRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKC 1134

Query: 1176 ILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQ 1228
            I+   K  G+ L  +AF D        +++VK        ++ D  K ++F  + E+  +
Sbjct: 1135 IVRGLKEDGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYK 1189

Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
            ++L AKD   L+  A +FL DG+ L ++V+D   N+ +  Y P+  +S  G +LLSR++F
Sbjct: 1190 MSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKF 1249

Query: 1289 HVGAHVTKFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
            H G  ++    L   + SS++      A     K  R  +L  + +GS+G +  + E ++
Sbjct: 1250 HTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESY 1309

Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
            RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D +LL  +  +  + ++
Sbjct: 1310 RRLSALQSQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKV 1365

Query: 1405 EIAHQTGTTRSQILSNLNDLA 1425
            EIA + G    +I ++   ++
Sbjct: 1366 EIASRVGANEWEIKADFEAIS 1386


>sp|Q5BDG7|CFT1_EMENI Protein cft1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
            CBS 112.46 / NRRL 194 / M139) GN=cft1 PE=3 SV=1
          Length = 1339

 Score =  276 bits (706), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 352/1456 (24%), Positives = 603/1456 (41%), Gaps = 234/1456 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++I+ +R        S ++ +T+ R          L L   Y+L G V  +
Sbjct: 28   NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + ++ G D      ++++AF DAK+S++E+D   +GL   S+H +E  +      
Sbjct: 82   CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
              +    G ++  DP  RC         + I+   Q G  LV D+  FGS   +  R+E 
Sbjct: 136  ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193

Query: 232  --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
                                SS V+ L  LD  + H     F++ Y EP   IL+ +  T
Sbjct: 194  NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253

Query: 270  WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                +  +      + +++    +    + S   LP D +K++A+P P+GG L++G+N  
Sbjct: 254  SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313

Query: 329  IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
            +H      + A+ +N ++    S     +S  ++ L+        +D    LL+  TG  
Sbjct: 314  VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373

Query: 387  VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
             L++   DGR V  +    LS  +   L S  ++   +GN   F GS   DS+L+     
Sbjct: 374  ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428

Query: 441  SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             G S  SS  K+ F G    D         S  DA +D +     +       N  S   
Sbjct: 429  -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479

Query: 500  TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
              +F    + D L + GP++D   G    A +  T   K    ELV   G    G   + 
Sbjct: 480  VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536

Query: 555  HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
             +        S     A+S    S +   +D+   Y+I+S        E+   VLE  D 
Sbjct: 537  KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            L  +T          T+  G L  + RVIQV     R  D  +   D             
Sbjct: 595  LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            S+    ++ ++ DPY+ +   D ++ LL  D S                  +   TL  D
Sbjct: 642  SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685

Query: 716  KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
               + WL     S +A   T    +I                +  +   L ++ +P+F  
Sbjct: 686  VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
            V +V + V     I+ T   E  K S T              +EN+  + VVEL      
Sbjct: 731  VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
            ++ S PFL     +  ++ Y+ +     E T          R L  +N +  +  N    
Sbjct: 770  SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819

Query: 894  RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
                D    E  P       + I  NI+G    F+ G  P    +FR      P   +L 
Sbjct: 820  ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
             G I       +   + GF Y+ S G L + +LP G+     W ++ +P+     ++TY 
Sbjct: 866  GGFIKGLGCFDS--PDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYV 922

Query: 1011 AEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
            +  + Y       VL    +    L  D E+  +  N  +S +       V +  ++++ 
Sbjct: 923  SASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVS 970

Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
            P      W    + P++ +E+ + ++ ++L  +  T E   ++ +GT+  +GED+ +RG 
Sbjct: 971  PKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGC 1026

Query: 1129 VLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--K 1179
            + +F       +P+    N   ++  KE +KGA++AL+ +  QG L+ A G K ++   K
Sbjct: 1027 IYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLK 1086

Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
              G+ L  +AF D    +V  +  +K     + GD  K ++F  + E+  +++L AKD  
Sbjct: 1087 EDGSLLP-VAFMDMQ-CFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLD 1144

Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
             L+  A +FL DG+ L +VV+D   N+ +  Y P+   S  G KLL+R++FH G   +  
Sbjct: 1145 YLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTV 1204

Query: 1298 LRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
              L     SS+R  A  GSDK +         +L  + +GSIG +  + E ++RRL +LQ
Sbjct: 1205 TLLPRTLVSSER--AMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQ 1262

Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             +L +++ H  GLNPR++R   S+  A R     ++D  LL  Y  +  + + EIA + G
Sbjct: 1263 SQLTNTLEHPCGLNPRAYRAVESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRVG 1318

Query: 1412 TTRSQILSNLNDLALG 1427
             T  +I ++L  ++ G
Sbjct: 1319 ATEWEIRADLEAISGG 1334


>sp|A2R919|CFT1_ASPNC Protein cft1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513)
            GN=cft1 PE=3 SV=1
          Length = 1383

 Score =  271 bits (693), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 331/1464 (22%), Positives = 618/1464 (42%), Gaps = 210/1464 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD     +   S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665  TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700  IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
            + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720  VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
               L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761  DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
                V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803  IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
             ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846  HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
                         S   F  L   N        +    ++ C+LP  + +D  W ++++ 
Sbjct: 904  ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR-T 1057
            L      + Y     +Y       VL   +     L  D E+  +  N  +S     R +
Sbjct: 947  LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 999

Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
            +    ++  +   D     +    +  + + E  + ++ ++L  +  T E + ++ +GTA
Sbjct: 1000 FIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1058

Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
            + +GED+ +RG + +F   +   +P +  T+     +  + +KGA++AL+ +  QG +L+
Sbjct: 1059 FARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLV 1118

Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
            A G K ++   K  G+ L  +AF D    YV  +  +K     +LGD  K ++F  + E+
Sbjct: 1119 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEE 1176

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
              +++L AKD   L+  A EFL DG  L +VV+D   NI +  Y P+  +S  G +LLSR
Sbjct: 1177 PYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSR 1236

Query: 1286 AEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
            ++FH+G   +    L     SS++   ++ G D  N+  L   L  T +GS+G I  + E
Sbjct: 1237 SKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPE 1296

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
             ++RRL +LQ +L +++ H  GLNPR+FR   S+G A R     ++D  LL  +  +  +
Sbjct: 1297 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1352

Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
             + EIA + G    +I ++L  ++
Sbjct: 1353 RKTEIAGRVGAREWEIKADLEAIS 1376


>sp|Q6C740|CFT1_YARLI Protein CFT1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=CFT1
            PE=3 SV=1
          Length = 1269

 Score =  218 bits (554), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 311/1370 (22%), Positives = 548/1370 (40%), Gaps = 215/1370 (15%)

Query: 98   AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
            A  LEL+  Y L G V  +  +     DN    DS+ ++ + AK  ++ ++ S   +   
Sbjct: 51   APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107

Query: 158  SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            S+H +E  + L      E       V  +       +L    +M  L   + G   + D+
Sbjct: 108  SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162

Query: 218  DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
            +   SG  F    + S ++    LD  +++V D  F+H Y E  M IL + +  W G  +
Sbjct: 163  EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218

Query: 276  WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
                T   + +S+    K   +I +   LP DA K++ +P+P+GG L++ ANTI Y   S
Sbjct: 219  ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278

Query: 336  ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
            AS    + N     +S   + R  S+  + L+ A   ++Q   + ALL T+ G    L  
Sbjct: 279  ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338

Query: 392  VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
              DGR V  L+L    P   +L+  S +    +   FLGSR GDSLLV++  G   S   
Sbjct: 339  NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
              L                          D  N  +  LYG  +  TE +  +     + 
Sbjct: 399  ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432

Query: 507  DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
            D ++ +GP++  + G    +      + TG+   S   ++              ++PG +
Sbjct: 433  DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492

Query: 549  GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
              W+V  +    G  A S       D  ++YL     A   V+E   L T+  ++  +FV
Sbjct: 493  -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               T    +      ++QV      I D                  S  + +T +   + 
Sbjct: 550  SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
              YVL+ +SDG +++L        VS+     +++++  +   +     G        +T
Sbjct: 601  GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648

Query: 728  DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
            D+ L+ G         G P        VVCY  G+L             +    S    I
Sbjct: 649  DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            +D           ++++   +E  GQ        +++V++A+      H       +LT 
Sbjct: 697  LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
              ++ Y+ Y +                               LRF +  L     E TP 
Sbjct: 744  QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770

Query: 908  GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
                +R+T    I+G     ++G       +  + L   P+L +       VAFT     
Sbjct: 771  INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKNLYPLIVSV 1022
            + +  F Y+T  G +  C+     + +  WPV+ + L   T  ++TY    ++Y      
Sbjct: 823  SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
             V+     V  ++ D+      D+  + S+  D+    T +   +RI+ P      W   
Sbjct: 877  -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924

Query: 1081 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGR-- 1136
             +   +  +E AL    V L  +  K +   ++A+GT+ ++GED+AARG + LF      
Sbjct: 925  DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIV 984

Query: 1137 -NADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-ELNGIAFYD 1192
               + P+    + ++    ++GA +A+  + G LL   G K+++        L  +AF D
Sbjct: 985  PEKERPETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLD 1044

Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
                YV     + + +LLGD  +S+ F+ +     Q+   A+D   +     +F I+G  
Sbjct: 1045 MQ-TYVAVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARDLQRVLVTTCDFAIEGEN 1103

Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
            L+ VV+D QK + I  Y P   +S+ G +LL R+ F+ G  +             D +  
Sbjct: 1104 LTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVI-------------DSSAM 1150

Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
             P ++  +RF ++    DGS+  + P  E  +RRL ++Q ++ D   HV GL+PR++R  
Sbjct: 1151 VPINE--DRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLHPRAYRYD 1208

Query: 1373 H----SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
                 +    HRP    I+D   L  +  LP  +Q   A++ G    Q++
Sbjct: 1209 PILPGTGNSPHRP----ILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI 1254


>sp|A8XPU7|CPSF1_CAEBR Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Caenorhabditis briggsae GN=cpsf-1 PE=3 SV=1
          Length = 1454

 Score =  213 bits (542), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 193/748 (25%), Positives = 344/748 (45%), Gaps = 83/748 (11%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            ++   DA +S+  GE  D      +D    YS      VV +++G + I  +P+   V+ 
Sbjct: 737  KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
            + +F +    +VD  + E  K+ + +  ++ E+           +  ++ ++S       
Sbjct: 792  IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KVVE  +     + + P L AI+ D  ++ Y+ +    P+        V+  +   +  +
Sbjct: 852  KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
              S   N+   R P +     E  HG     I  F+ IS  + G  + G+ P   +V+  
Sbjct: 908  RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPTL-LVYGA 962

Query: 942  --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKI 998
               ++ H    DGSI AFT  +N N  HGF+Y+T Q   L+I ++     YD  +PV+KI
Sbjct: 963  WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022

Query: 999  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHR 1056
             +  T H + Y    ++Y ++ SVP  KP N++  ++ D  QE  H+ D + +  +    
Sbjct: 1023 EVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPP 1078

Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ETL 1110
             YT+  +              Q  A +P    E      V  + +   K        +T 
Sbjct: 1079 KYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTY 1126

Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
            LA+ T    GE+V  RGR++L         P        +  +Y KE KG ++ L ++ G
Sbjct: 1127 LALATVNNYGEEVLVRGRIILCEVIEVVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAING 1186

Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
             LL   G K+ + ++   +L GI+F D    YV  L+ ++   L  D  +S+  + ++E+
Sbjct: 1187 LLLSGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSIRTIALALDARESMSLIRFQEE 1245

Query: 1226 GAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
               +++ ++D     C     A+EFL+DG  +  ++SDE  NI +F Y+P+  ES  G++
Sbjct: 1246 NKAMSIASRD--DRKCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGER 1303

Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
            L  +A  ++G ++  FLR++   +  D +      +   R   +FG+LDGS G I PL E
Sbjct: 1304 LTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTE 1363

Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYE 1396
             ++RRL  LQ  +    P +AGL+ +  R     Q   NG+  R    +++D +++  Y 
Sbjct: 1364 KSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNAR----NLIDGDVVEQYL 1419

Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             L + ++ ++A + G  R  IL +L  L
Sbjct: 1420 HLSVYDKTDLARRLGVGRYHILDDLMQL 1447



 Score =  188 bits (478), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL++P P+GG +V G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
             F+GSRLGDS L+++T                  ++    S K+ R    +  +  ++ 
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410

Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           +++ LYG A     +++ E   ++  F   D L+N+GP+K   +G R N  ++    +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469

Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
            +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+S 
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
             R+ ++          E   +     T+AAG L      +QV     A + DG  M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           +              N  V+  SI DPYV +   +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613


>sp|Q1E5B0|CFT1_COCIM Protein CFT1 OS=Coccidioides immitis (strain RS) GN=CFT1 PE=3 SV=1
          Length = 1387

 Score =  208 bits (530), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 160/559 (28%), Positives = 275/559 (49%), Gaps = 49/559 (8%)

Query: 891  RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
            RF  +P  AY     PH    + +  + +I G++  F+SGS PC+ M          +L 
Sbjct: 860  RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
              ++ + +  H   C  GF YV +  ++++C+LPS + +DN W  +K+ +      + YF
Sbjct: 912  GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYF 971

Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1070
            A   +Y L  S  V   L +      D E+  +  +  +S +       +E   +++L P
Sbjct: 972  AHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSP 1020

Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
                  W    +  +  +E  + ++ + +  +  T E + +L +GTA V+GED+  RG +
Sbjct: 1021 RT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSI 1076

Query: 1130 LLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KW 1180
             +F     A +P     N   ++++K+ +KGA++A++ +  QG L++A G K ++   K 
Sbjct: 1077 YVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKE 1136

Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
             G+ L  +AF D    YV  L  ++     ++GD  K I+F  + E+  +L L  KD   
Sbjct: 1137 DGSLLP-VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEY 1194

Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
            L   A +FL DG  L ++V+D+   I +  Y P+   S KG +LL R+ FH G H T  +
Sbjct: 1195 LQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTG-HFTSTM 1253

Query: 1299 RLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
             L +   SS  +   P  D  +       + +L  + +GSIG + PL E ++RRL +LQ 
Sbjct: 1254 TL-LPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQS 1312

Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
            +LV S+ H  GLNP+++R   S+G   R     IVD  LL  +  + ++ + EIA + G 
Sbjct: 1313 QLVTSMEHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGA 1368

Query: 1413 TRSQILSNLNDLALGTSFL 1431
                I  +L  ++ G  FL
Sbjct: 1369 DIESIRVDLETISGGLDFL 1387



 Score =  113 bits (282), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++++++  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV DE     D    G      
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      +  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ ++  S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439

Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           SM    ++   + FG  +              D     VN   LS   S +N     +  
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735


>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor subunit 1
            OS=Caenorhabditis elegans GN=cpsf-1 PE=3 SV=2
          Length = 1454

 Score =  190 bits (482), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 176/705 (24%), Positives = 323/705 (45%), Gaps = 69/705 (9%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
            +V +E+G L I  +P    V+ + +F +    +VD T   E  +       ++ E     
Sbjct: 777  IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836

Query: 814  GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
               E +++       +V+E  +     + + P L AI+ D  ++ Y+ +          S
Sbjct: 837  DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886

Query: 868  DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             +P+     +S   +       ++S L N    R P +     +  +G     I  F+ +
Sbjct: 887  SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941

Query: 921  SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
            S  + G  + G+ P   +V+     ++ H    DG I AFT  +N N  HG +Y+T  + 
Sbjct: 942  SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000

Query: 977  ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
             L+I ++     Y+  +PV+KI +  T H + Y    ++Y ++ S+P  KP N++  ++ 
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVMN 1058

Query: 1037 D--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            D  QE  H+ D + +  +     YT+  +  +    D A  P      I  +  E     
Sbjct: 1059 DDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTAC 1109

Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
              V L + +T    ETLLA+GT    GE+V  RGR++L         P    +      +
Sbjct: 1110 EDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVL 1169

Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
            + KE KG ++ L ++ G LL   G K+ + ++   +L GI+F D    YV  L+ ++   
Sbjct: 1170 FDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIA 1228

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNI 1264
            +  D  +S+  + ++E    +++ ++D     C     A++ ++DG+ +  ++SDE  NI
Sbjct: 1229 IACDARESMSLIRFQEDNKAMSIASRD--DRKCAQPPMASQLVVDGAHVGFLLSDETGNI 1286

Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
             +F YAP+  ES  G++L  RA  ++G ++  F+RL+   +               R   
Sbjct: 1287 TMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTT 1346

Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAH 1379
            +F +LDGS G + PL E ++RRL  LQ  +    P +AGL+ +  R     Q   NG+  
Sbjct: 1347 VFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNA 1406

Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            R    +++D +++  Y  L L ++ ++A + G  R  I+ +L  L
Sbjct: 1407 R----NLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQL 1447



 Score =  186 bits (473), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S +VI L+ +D  + ++ 
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
             +LL +P P+GG LV G+NT+ Y +Q+   C L LN+     D   + P        + 
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL ++    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
             F+GSRLGDS L+++T    T                   + KRL+  + D  A +  +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + +++ LYG A     +++ E   ++  F   D L N+GP+K    G R N  ++    +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
           Q++              N  V+  SI DPYV L   +G + L  LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627


>sp|Q4WCL1|CFT1_ASPFU Protein cft1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
            CBS 101355 / FGSC A1100) GN=cft1 PE=3 SV=2
          Length = 1401

 Score =  189 bits (479), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
            + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884  LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
            YV S+ +L+IC+ PS + +D  W ++KI +      + Y      Y L  S       + 
Sbjct: 943  YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996

Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
               L  D E+     N  L    L     + +  ++++ P      W    +  +   E 
Sbjct: 997  DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049

Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
             + V+ + L  +  T E   ++ +GTA+ +GED+ +RG + +F   +   +P+   T+  
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109

Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
              +  KEL KGA++AL+ +  QG L+ A G K ++   K  G+ L  +AF D    YV  
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167

Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            L  +K     ++GD  K ++F  + E+  +++L  KD G L+  A EFL DG  L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227

Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
            D   N+ +  Y P+  +S  G +LL+R++FH+G   T    L     SS++  A P S  
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287

Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
               +T    +L  +  GS+G +  + E ++RRL +LQ +L +S+ H  GLNPR++R   S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347

Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
            +G A R     ++D  LL  +  +    ++EIA + G    +I ++L  + A G  +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401



 Score =  136 bits (342), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740


>sp|A1DB13|CFT1_NEOFI Protein cft1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 /
            FGSC A1164 / NRRL 181) GN=cft1 PE=3 SV=1
          Length = 1400

 Score =  187 bits (474), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 150/542 (27%), Positives = 260/542 (47%), Gaps = 50/542 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            + I  NIS     F+ G    + +   +      R+  +   G  ++   L + + + GF
Sbjct: 885  LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            IYV S+ +L+IC+ PS + +D  W ++KI +      + Y      Y L  S       +
Sbjct: 943  IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDHLAYATSSETYVLGTS------HS 996

Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
                L  D E+     N  +S +   R  +++    R          W    +  +  +E
Sbjct: 997  ADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPAE 1047

Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1147
              + V+ + L  +  T E   ++ +GTA+  GED+ +RG + +F   +   +P+   T+ 
Sbjct: 1048 YVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1107

Query: 1148 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
               +  KEL KGA++AL+ +  QG L+ A G K ++   K  G+ L  +AF D    YV 
Sbjct: 1108 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV- 1164

Query: 1200 SLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
              N+VK        ++GD  K ++F  + E+  +++L  KD G L+  A EFL DG  L 
Sbjct: 1165 --NVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLF 1222

Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
            ++V+D   N+ +  Y P+  +S  G +LL+R++FH+G   T    L     SS++  A P
Sbjct: 1223 ILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADP 1282

Query: 1315 GS----DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             S     +T    +L  +  GS+G +  + E ++RRL +LQ +L +S+ H  GLNPR++R
Sbjct: 1283 DSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYR 1342

Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTS 1429
               S+G A R     ++D  LL  +  +    ++EIA + G    +I ++L  + A G  
Sbjct: 1343 AVESDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLG 1398

Query: 1430 FL 1431
            +L
Sbjct: 1399 YL 1400



 Score =  131 bits (330), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)

Query: 57  NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLVV   +V++I+ +++VQ     G+ E K++         D +    L L   Y L G 
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80

Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           V  +    IL+   GG       ++++LAF +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 81  VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
                  +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D       +D 
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192

Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
           F       G    ++          SS V+ L  LD  + H     F++ Y EP   +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +  +   +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312

Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N  +H      + A+ +N +A  + +   + +S  ++ L+      L +     LL  
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372

Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
            +G++VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
           + ++  S         +    ++  D        +S  D  + D+   E E    G   +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
              S    + F + D L NIGPL+D + G           L  NA +    I+ Q    N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549

Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
             LV              +    +G+WT       G     D  R+   + EY  Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606

Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
                   ++   + +  DL      E   + D      TI  G L  +RRV+QV     
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661

Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           R    SY   D+  G     P   E    S+    +S S+ADPY+ +   D ++ LL  D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
            S     V+   +  + K    SC LY DK 
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741


>sp|A1C3U1|CFT1_ASPCL Protein cft1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 /
            DSM 816 / NCTC 3887 / NRRL 1) GN=cft1 PE=3 SV=1
          Length = 1401

 Score =  185 bits (469), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 158/559 (28%), Positives = 262/559 (46%), Gaps = 55/559 (9%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
            N    R P D+ T       +  + + I  +ISG+   F+ G         SR C   + 
Sbjct: 861  NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
            R R  V   L D     FT   + +   GFIYV S+ +++ICQLP  + YD  W ++K+ 
Sbjct: 920  RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
            +      + Y      Y L  S       +    L  D E+  +  N  +S +   R   
Sbjct: 974  IGEHVDHLAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQCC 1027

Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
            +     +++ P      W    +  +   E  + V+ + L  +  T E + ++ +GTA  
Sbjct: 1028 L-----KVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALA 1078

Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTE----VYSKEL-KGAISALASL--QGHLLIAS 1171
            +GED+ ARG + +F   +   +P+   T+    +  KEL KGA++AL+ +  QG L+ A 
Sbjct: 1079 RGEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQ 1138

Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
            G K ++   K  G+ L  +AF D    YV  L  +K     ++GD  K I+F  + E+  
Sbjct: 1139 GQKCMVRGLKEDGSLLP-VAFMDVQ-CYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPY 1196

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            +++L  KD    +  A +FL DG  L ++V+D   N+ +  Y P+   S  G KLL R++
Sbjct: 1197 KMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSK 1256

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSD-----KTNRFALLFGTLDGSIGCIAPLDEL 1342
            FH+G H T  L L    T+S    +A         +     +L  +  GSIG +  + E 
Sbjct: 1257 FHMG-HFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEE 1315

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
            ++RRL +LQ +L ++V H  GLNPR++R   S+G A R     ++D  LL  +  +  + 
Sbjct: 1316 SYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGR----GMLDGNLLYQWLSMSKQR 1371

Query: 1403 QLEIAHQTGTTRSQILSNL 1421
            ++EIA + G    +I ++L
Sbjct: 1372 RMEIAARVGAHEWEIKADL 1390



 Score =  119 bits (297), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)

Query: 57  NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
           NLVV   +V++I+    V    EG   +  S         D + +  L L   Y L G V
Sbjct: 28  NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81

Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
             L     + ++ G D      +I+LAF +AK+S++E+D   +G+   S+H +E  +   
Sbjct: 82  SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135

Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
                +  + G ++ VDP  RC  V  +G++ + IL   Q G  LV GD           
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194

Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              +D+ G+     G        SS V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T    +  +      +  ++    +   ++ S   LP D +K++A+P P+GG L++G 
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + +     +S  ++ L+      L N     LL+  +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
           G +VL+    DGR V  + + +  P     +I         ++G+   F GS   +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
            ++  S  +  S   + E   IE D         S  D  +D        LY +A +   
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483

Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
                    S   ++ F V D L N  PL+D + G                   L + A 
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543

Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
             + G     + K+    +V+    +    G+WT       +++  + D + +    +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
             Y+I++ + +T+  E +++      ++  F           +I  G L  R+RV+QV  
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658

Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
              R  D      DL      P   E    S+    LS S+ADPY+ +   D ++ LL  
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711

Query: 686 DPS 688
           D S
Sbjct: 712 DDS 714


>sp|Q6BHK3|CFT1_DEBHA Protein CFT1 OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
            JCM 1990 / NBRC 0083 / IGC 2968) GN=CFT1 PE=3 SV=2
          Length = 1342

 Score =  155 bits (392), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 121/500 (24%), Positives = 231/500 (46%), Gaps = 46/500 (9%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
            N    + ++L  +  P +AY+   T      +R+  F N++G    F++G  P +     
Sbjct: 810  NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865

Query: 941  ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
              +    +      V+F    +    +G IY+ +    +IC++P    Y+N WP++KIP+
Sbjct: 866  HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKIPI 925

Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRTY 1058
            K +   +TY    N +       V+    ++    +D+E G  I   + S  S + ++ Y
Sbjct: 926  KESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKGY 977

Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
                  ++++ P      W    TI +   E  + V+ + L   ++TK+    + L+ IG
Sbjct: 978  ------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVIG 1027

Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
            T   + ED++A G   +F         G+   N +    E++ ++ KGA++++  + G  
Sbjct: 1028 TGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHK--FKEIHQEDTKGAVTSICEISGRF 1085

Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
            L++ G KII+       +  +AF D   +YV       N ++LGD  KSI+   +  +  
Sbjct: 1086 LVSQGQKIIIRDLQDDGVVPVAFLDTS-VYVSEAKSFGNLLILGDSLKSIWLAGFDAEPF 1144

Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
            ++ +L KD  SLD    +F+I    + ++++D    + +  Y P+   S  GQ+L+ +A 
Sbjct: 1145 RMVMLGKDLQSLDVNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKAS 1204

Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
            F++ +  T       + +        P S  T  F  +  T+DGS   + P++E ++RR+
Sbjct: 1205 FNINSTPT------CIRSIPKNEEINPSS--TEVFQSIGSTIDGSFYTVFPINEASYRRM 1256

Query: 1348 QSLQKKLVDSVPHVAGLNPR 1367
              LQ+++ D   H  GLNPR
Sbjct: 1257 YILQQQITDKEYHFCGLNPR 1276



 Score = 92.0 bits (227), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/414 (21%), Positives = 178/414 (42%), Gaps = 64/414 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V  A V++++ +   E  +++ K                  L+LV  ++LHG +  + 
Sbjct: 29  LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +     +NS+  D ++++ + AK+S++++D  ++ +   S+H +E+          E  
Sbjct: 71  AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
               LV V+P   C  +    L   +                 +  S G      +++  
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183

Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            S G   S   ESS +I+ R LD +   + D  F++ Y EP + I+  +   WAG +   
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
                   LS+    K    +    NLP D  K++ +P P+ G L++G N  IH  +   
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303

Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
           +  LALN +  S+ +S +    +S  +++L+      + ND   L+    GD   +    
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363

Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           DG+ +++            D+  T P     +I T+ N+L F+ ++ G++ L++
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLE 413


>sp|P0CM62|CFT1_CRYNJ Protein CFT1 OS=Cryptococcus neoformans var. neoformans serotype D
            (strain JEC21 / ATCC MYA-565) GN=CFT1 PE=3 SV=1
          Length = 1431

 Score =  150 bits (379), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
            I  F NI G  G F++G +P W +        HP     L   ++      H       F
Sbjct: 931  IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            I +       IC LP     D   P  +  ++     IT+      Y    S+ V  P  
Sbjct: 987  IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042

Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                   D+E   Q+  D  +L      R+ T+E +        +   PW+         
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
            +E  +++  V L +          +A+GT +  GED A RG   +F   +          
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149

Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
                   + +      +  ++A+  + G+LL  +GPK+ +      ++L G+AF D   L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y  ++ + KNF+L+GD+ KS +F+S +E   +   ++KD   +     +FL+    ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
             SD   ++++  + P   +S  G++L+ R E+H G+  T    +    T+ +    AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +++ T DG++  +  + +  F+RLQ +  +LV +  HVAGLNPR+FR    N 
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               RP    I+D +LL+ + + P+  Q E+  Q GT    + S+L  L 
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428



 Score =  112 bits (279), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
           NLVV  A V+ ++ +R +     E  K  ++  E ++ V M+ +                
Sbjct: 48  NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107

Query: 97  --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
             +   L L+  + L+G +  LA  ++         D +I++F+DAK+++LE+  S   +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
              S+H +E    ++     +S+   PL++ DP  R   + +    + +L   Q  S L 
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
              D    G    A    S V++L D+ +  K+++D +F+ G+  P + +L     TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277

Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
           R+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +GG+++V +  I 
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336

Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +  Q    +A+C  A  +   SL  S  +   S  + L+ +   ++     LL  + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394

Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS L +       
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
              S   K+E  +++ D    + L    +DA  D    E   L+G A+          + 
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495

Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
           +  D L  +G + D  +G+                        IN       I+K+  + 
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
             EL   +G+W +      G      +     +   A +++S E          L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
                 + G+T++A   F R  +++V      +LD +  + Q +         G G +  
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            +++ SI+DP+V++  +D S+   VGD    TV+ + P   E       +  ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712


>sp|P0CM63|CFT1_CRYNB Protein CFT1 OS=Cryptococcus neoformans var. neoformans serotype D
            (strain B-3501A) GN=CFT1 PE=3 SV=1
          Length = 1431

 Score =  150 bits (379), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
            I  F NI G  G F++G +P W +        HP     L   ++      H       F
Sbjct: 931  IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
            I +       IC LP     D   P  +  ++     IT+      Y    S+ V  P  
Sbjct: 987  IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042

Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
                   D+E   Q+  D  +L      R+ T+E +        +   PW+         
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089

Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
            +E  +++  V L +          +A+GT +  GED A RG   +F   +          
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149

Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
                   + +      +  ++A+  + G+LL  +GPK+ +      ++L G+AF D   L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            Y  ++ + KNF+L+GD+ KS +F+S +E   +   ++KD   +     +FL+    ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
             SD   ++++  + P   +S  G++L+ R E+H G+  T    +    T+ +    AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
                   +++ T DG++  +  + +  F+RLQ +  +LV +  HVAGLNPR+FR    N 
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379

Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
               RP    I+D +LL+ + + P+  Q E+  Q GT    + S+L  L 
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428



 Score =  112 bits (279), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
           NLVV  A V+ ++ +R +     E  K  ++  E ++ V M+ +                
Sbjct: 48  NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107

Query: 97  --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
             +   L L+  + L+G +  LA  ++         D +I++F+DAK+++LE+  S   +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
              S+H +E    ++     +S+   PL++ DP  R   + +    + +L   Q  S L 
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
              D    G    A    S V++L D+ +  K+++D +F+ G+  P + +L     TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277

Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
           R+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +GG+++V +  I 
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336

Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +  Q    +A+C  A  +   SL  S  +   S  + L+ +   ++     LL  + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394

Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS L +       
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
              S   K+E  +++ D    + L    +DA  D    E   L+G A+          + 
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495

Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
           +  D L  +G + D  +G+                        IN       I+K+  + 
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
             EL   +G+W +      G      +     +   A +++S E          L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
                 + G+T++A   F R  +++V      +LD +  + Q +         G G +  
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            +++ SI+DP+V++  +D S+   VGD    TV+ + P   E       +  ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712


>sp|Q0UUE2|CFT1_PHANO Protein CFT1 OS=Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 /
            FGSC 10173) GN=CFT1 PE=3 SV=1
          Length = 1375

 Score =  145 bits (365), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 133/521 (25%), Positives = 226/521 (43%), Gaps = 44/521 (8%)

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
            P  +S  L   N+   +L      R   D    E          +    NI+G+      
Sbjct: 836  PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895

Query: 930  GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            G  P + +           L    + + T  H  +C  GF Y+ S   L+I QLP  + Y
Sbjct: 896  GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955

Query: 990  DNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1045
             +  W  +++P+ A  H + Y     LY +    P    L P +     L  +E   +  
Sbjct: 956  GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014

Query: 1046 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1103
              H +  V   +T+TV   +  +L+P                  E  L ++ + L  + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055

Query: 1104 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAIS 1158
            T + + ++A+GTA V GED+A +G + +F        P +  T      +   E+KG +S
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRLKLIVKDEVKGTVS 1115

Query: 1159 ALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGD 1212
            A++ L  QG L++A G K ++   K  GT L  +AF D    YV +L  + N   +L+GD
Sbjct: 1116 AISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGD 1173

Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
             +K  +F  + E+  ++ L  +    L+C   +FL     L ++V+D   N+Q+  + P 
Sbjct: 1174 AYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVADADMNLQVLQFDPD 1233

Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGSDKTNRFALLFGTL 1329
              +S  G +LL ++ FH G   +    LQ    + T+S+ T +   S   ++  +L  + 
Sbjct: 1234 HPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQ--ILCTSQ 1291

Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
             G++  I PL E ++RRL  L   L   +    GLN ++FR
Sbjct: 1292 SGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1332



 Score =  132 bits (331), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
           ++    G + +LT   DGR V  + +    +    ++L +     T +G    F+GS  G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +S+L+ +T                        ++ +LRR  S+  Q   + E++S     
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472

Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
                     N+T +  K             T++F V D L +I P++D           
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532

Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
            + G + ++    A G     N EL        ELP   G+W V+ K           G 
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592

Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
           +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+  
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706

Query: 676 SDGSIRL 682
            D S+++
Sbjct: 707 EDSSVKI 713


>sp|Q5AFT3|CFT1_CANAL Protein CFT1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
            GN=CFT1 PE=3 SV=1
          Length = 1420

 Score =  132 bits (332), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 120/572 (20%), Positives = 246/572 (43%), Gaps = 73/572 (12%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +A+     P+G   +R +  F N++G    F++G  P   +  
Sbjct: 855  NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 909

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
               +    Q    + ++ +   +    +G I++ +Q   +IC+LP    Y+   P++ + 
Sbjct: 910  VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 969

Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
            +  +   I Y    +         VL    Q+    +D+E     G   D  +  ++   
Sbjct: 970  IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEEGKPIAGIIKDIKDTPAMSFK 1022

Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV-----------------T 1098
             +       ++++ P      W    TI +  +E  +T++ +                 +
Sbjct: 1023 GS-------IKLVSPYN----WTVIETIELGDNEVGMTLKSMILDVGSESGSTLGSDPNS 1071

Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSK 1151
            L     K+    + IG    + ED+AA G   ++         G+   N +    E++ +
Sbjct: 1072 LIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKE 1129

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
            E +GAI+++  L G  L++ G K+I+          +AF D P +YV       N ++LG
Sbjct: 1130 ETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILG 1188

Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
            D+ K  + + +  +  ++ +L KD   +     +F+I+   + ++V+D    + +  Y P
Sbjct: 1189 DLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDP 1248

Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNR 1321
               +S  G KLL++A F + + ++    L ++    S +T A          P +  +N 
Sbjct: 1249 DDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNY 1308

Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGK 1377
            F ++  T DGS   + P++E  +RR+  LQ++L+D   H  GLNPR  R    +  +N  
Sbjct: 1309 FQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNET 1368

Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
              +P    I+D +L+  +  L  + +  +A++
Sbjct: 1369 NTKP----ILDYDLIRSFTKLSDDRKRNLANK 1396



 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
              LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
           +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435


>sp|Q6FSD2|CFT1_CANGA Protein CFT1 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM
            3761 / NBRC 0622 / NRRL Y-65) GN=CFT1 PE=3 SV=1
          Length = 1361

 Score =  120 bits (301), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%)

Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
            +T TK     + +G  Y   EDV   G   ++        P    T     E++ ++++G
Sbjct: 1026 DTRTKRKREYIIVGIGYATMEDVPPTGEFHIYDITEVVPEPGKPNTNFKLKEIFKEDIRG 1085

Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
             +S +  + G  LI+   KI++        +  +AF D P ++V SL    N I++GD  
Sbjct: 1086 IVSVVNGISGRFLISQSQKIMVRDVQQDNSVIPVAFLDVP-VFVTSLKTFGNLIVIGDAM 1144

Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
            + I F+ +  +  ++  L       +  + EFL++   +  +V+D    + +  YAP   
Sbjct: 1145 QGIQFVGFDAEPYRMITLGSSITKFEVISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQP 1204

Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR-FALLFGTLDGSI 1333
             +  GQ+L+  + F++ +         ML   +D     P   + +R F  +   +DGSI
Sbjct: 1205 NTLSGQRLVHCSSFNLHS----LNNCTMLLPKNDE---FPRDQRYSRSFQTITAQVDGSI 1257

Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ---FHSNGKAHRPGPDSIVDCE 1390
              I P+ E T+RRL  +Q++++D  P +AGLNPR  RQ   ++  G + RP    ++D  
Sbjct: 1258 SKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQDNKYYHLGHSLRP----MLDFN 1313

Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRS-QILSNLNDLAL 1426
            ++  ++ + +  +  I  + G   + ++  +L DL  
Sbjct: 1314 IIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLEF 1350



 Score = 57.4 bits (137), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)

Query: 96  ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           I +  L L+  ++L G +  +A++   S G   N      ++L+   AK+S+L +++   
Sbjct: 43  IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102

Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
            +   S+H     FES   L L       AR   ++++P G     +++   ++ +    
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153

Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
            G     DED                 F    G +   + +H   +IN  +L   +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213

Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
           D  F+ G+ +  + +L++ +L W G    V    +  +IS     SI  T     +I   
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
             LP D +    +   + G L++G N I +   +      L LN+Y+   L   + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330

Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
           S  V  +      L    +N+ +    LL  + G +  + +  +GR++ + +++      
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390

Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
                K+NP   S++ +D   +     F+G + GD+ +++     S   +      +++ 
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449

Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
           + +A A                 SD   D VN +    +G+  SN   +AQ+        
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
            L NIGP+     G   + + +  G+   +  E+        + T  +      NA  + 
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557

Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
           +    ++   ++ I+      ++ +   L T D       + E+  +     QGR     
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617

Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TI    +   +R++QV      + D ++               +   +  V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666

Query: 670 YVLLGMSDGSIRLL 683
           YVL+ +S G I++ 
Sbjct: 667 YVLITLSRGDIKVF 680


>sp|Q6CTT2|CFT1_KLULA Protein CFT1 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
            DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=CFT1 PE=3
            SV=1
          Length = 1300

 Score =  112 bits (281), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 164/338 (48%), Gaps = 30/338 (8%)

Query: 1088 SENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1141
            SEN++   + T+    N+ T+    L+ IG+++V+ ED  + G +L+         P   
Sbjct: 954  SENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKP 1013

Query: 1142 -QNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG---IAFYDAPPL 1196
              N    +++ +E++G+++A+  + G  +I    K ++      E N    +AF D P +
Sbjct: 1014 DSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQ--EDNSAVPVAFLDMP-V 1070

Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
            ++       N +++GD  +   F+ +  +  ++ +L K          EFL++   ++ +
Sbjct: 1071 FITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFI 1130

Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
            V+D Q ++ +  YAP  + S  GQ+L+    F++      +++L        R     GS
Sbjct: 1131 VTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLV-------RKHVEFGS 1182

Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFH 1373
              +N  AL   T DGSI  + PL+E ++RR   +Q++L+D    +AG N +  R   +++
Sbjct: 1183 KTSNYIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYY 1241

Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
              G + RP  DS    ++L  Y  LP+ ++  I ++ G
Sbjct: 1242 HKGHSLRPTLDS----QVLKKYIHLPITKRTTIENRVG 1275



 Score = 76.3 bits (186), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           A  L L   ++L G +  + +L Q G   S  +   IL+   +K+S++ FD     L   
Sbjct: 45  AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
           S+H +   ++++L     S     ++ VDP  RC  +LV+   ++ IL        +  D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155

Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
           ED  G     + R++ +  I    + M         KHV D  +++ + +P + IL++  
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
           L W G      +T     LS+    ++  +I    +LP+D + L  VP   G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272

Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
            + Y S S    SC + LN +A S  +++    S  ++ L  +    +    ++D+ +L 
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
            +   +  +    +G ++ + D  +            I N + F  SRL     GD    
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380

Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
              +L+ F  G    +    LK  F        + ++L  +  D        E  +LYG 
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428

Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           + NNT +     Q+ F  ++ DS+ NIGPL   + G   + + +   +   +  E   + 
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487

Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
              G+      T  H + + H   + +  +    ++    + ++ +   L T D   E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543

Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
           +            + D+    RTI    +   +R++QV   G  + D  +     +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
                            ++   I DPY+L   + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626


>sp|Q06632|CFT1_YEAST Protein CFT1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
            GN=CFT1 PE=1 SV=1
          Length = 1357

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)

Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
            W+   +   P  S  N +   ++ + + T ++ E ++A G A    ED    G   ++  
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059

Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
                  P        + E++ +E+ G +S +  + G  +I+   K+++        +  +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119

Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
            AF D P ++V       N +++GD  +   F+ +  +  ++  L +        + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178

Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
            +G  +    +D  +N+ +  YAP    S  GQ+L+  + F +  H T      ML   ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                  GS +   F  + G +DGS+  I PL E  +RRL  +Q++++D    + GLNPR 
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290

Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
             R    F+  G + RP    ++D  ++  +  L ++ +  IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>sp|Q75EY8|CFT1_ASHGO Protein CFT1 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 /
            FGSC 9923 / NRRL Y-1056) GN=CFT1 PE=3 SV=1
          Length = 1305

 Score =  105 bits (262), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 79/343 (23%), Positives = 150/343 (43%), Gaps = 33/343 (9%)

Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
             N+ TK     L +G  YV+ ED+   G   L+        P    T     +++ ++++
Sbjct: 967  LNSNTKRRREYLVVGNTYVRDEDIGGTGSFYLYDITEVVPEPGKPDTNYKFKDIFQEDIR 1026

Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
            G +S +  + G  +I+   K ++        +  +AF D P +++       N +++GD 
Sbjct: 1027 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDS 1085

Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
             +   FL +  +  ++  L K    L+    EFL++   +  +V+D    + +  YAP  
Sbjct: 1086 MQGFSFLGFDAEPYRMLTLGKSVSKLETMCVEFLVNNGDVYFLVTDRNNLMHVLKYAPDE 1145

Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR--------FALL 1325
              S  GQ+L+    F++ +  T       L   +D  G      K +R        F  +
Sbjct: 1146 PNSLSGQRLVHCTSFNLHSTNT----CMRLIKKNDEFG------KVSRGFGIYMPSFQCI 1195

Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPG 1382
                DG+I  + PL E ++R L  +Q++L+D    + GLNPR  R    F+  G   RP 
Sbjct: 1196 GSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMERLENPFYQMGHILRP- 1254

Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTG-TTRSQILSNLNDL 1424
               ++D  +L  +  L +  ++ +A + G    ++I  +L D+
Sbjct: 1255 ---MLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDI 1294



 Score = 55.5 bits (132), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)

Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            ++S++ FD     L   S+H +++   E   L  G       P ++ +P  RC  +LV+
Sbjct: 82  GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134

Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
               + +   +G                     D  G   G S  + +SH+ +    D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           +VKD  F+ G  +  + +L++ +L+W G       T     LS+    ++  +I     L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
           P+  + ++ + +   G ++ G N + Y   + +   A++LN ++ S           L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
             E P   F+ + +         D+ LL  +   +  + +  +GR++            +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360

Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           + K N  P+ +  ++     SL F+G + GD++ V+       + L S L+         
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
                  + + S+ L+   + + + LYG   NN E  +K       F     D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453

Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
           +   + G   + + +   ++  +  EL  +     G     T+   +       + +  +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513

Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
               ++        YL+ +  ++T   + + D   +  ++ D+     T++     G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573

Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           ++QV  +G  + D ++   MT +  F               V+ V I DP++LL  S G 
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619

Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
           I++   +P      V+T  P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643


>sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1
          Length = 1095

 Score = 87.8 bits (216), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
            +T P+   E   ++    L  + + ++     IGTAYV  E+    +GR+L+F      D
Sbjct: 764  STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 816

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW   E  G            
Sbjct: 817  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 873

Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
                LYV +     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 874  HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 930

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             L       + N  +F    K SE    +   +L    E+H+G  V +F    ++     
Sbjct: 931  YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 981

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                 P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   
Sbjct: 982  ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1037

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R F++  K       + +D +L+  +  L      EI+        +++  + +L
Sbjct: 1038 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1091



 Score = 68.2 bits (165), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 113/502 (22%), Positives = 193/502 (38%), Gaps = 123/502 (24%)

Query: 95  GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
           G+    L+ +    ++G + +L +    G      +D + +A E  K  VL++D     +
Sbjct: 49  GLQCICLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEV 104

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGL 213
              +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++G    
Sbjct: 105 ITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK- 156

Query: 214 VGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                       F+ R+E   V++++           F++G  +P +V+L++        
Sbjct: 157 ----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ------DN 189

Query: 274 VSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G  TI 
Sbjct: 190 KDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIV 243

Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
           Y S SA  A+ +            + R+   V+ D +          LL    G L LL 
Sbjct: 244 YCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGLLHLLV 286

Query: 391 VVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+              
Sbjct: 287 ITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL----------- 335

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDS 508
                      P TK    S  + L+  VN   +  +       +   +  T S A +D 
Sbjct: 336 ----------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD- 381

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
               G L+    G+ IN  AS            VEL G KG+W++               
Sbjct: 382 ----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--------------R 411

Query: 569 AAYDDEYHAYLIISLEARTMVL 590
           +A DD Y  +L++S  + T VL
Sbjct: 412 SATDDPYDTFLVVSFISETRVL 433


>sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1
            SV=1
          Length = 1090

 Score = 87.8 bits (216), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
            +T P+   E   ++    L  + + ++     IGTAYV  E+    +GR+L+F      D
Sbjct: 759  STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 811

Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
                L+ E   KE KGA+ +L +  G LL A   KI L+KW   E  G            
Sbjct: 812  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 868

Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
                LYV +     +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 869  HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 925

Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
             L       + N  +F    K SE    +   +L    E+H+G  V +F    ++     
Sbjct: 926  YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 976

Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
                 P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GL+   
Sbjct: 977  ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1032

Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
            +R F++  K       + +D +L+  +  L      EI+        +++  + +L
Sbjct: 1033 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1086



 Score = 67.0 bits (162), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               +   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + R+   V+ D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                           P TK    S  + L+  VN   +  +       +   +  T S 
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +A DD Y  +L++S  + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428


>sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1
            SV=1
          Length = 1088

 Score = 85.1 bits (209), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ          L      EE E   VR+L+       ++  +T P+ S E   ++
Sbjct: 716  RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T++      +GTAYV   E+   +GR+L+F      D    L+ E   KE 
Sbjct: 772  ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL A   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
            ++GD+ KSI  L +K +   +   A+D+ +    A E L D   L    ++   N+    
Sbjct: 881  VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
               + +   +  +L    E+H+G  V +F    ++    D   G  P         ++FG
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989

Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            T++G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 990  TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 56.2 bits (134), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  +L + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2
            SV=2
          Length = 1088

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)

Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
            + + HQ      +   L    + EE E   VR+L+       ++  ++ P+ + E   ++
Sbjct: 716  RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771

Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
                L  + T +      +GTAYV   E+   +GR+L+F      +    L+TE   KE 
Sbjct: 772  ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821

Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
            KGA+ +L +  G LL +   KI L+KW     GT EL     +    L  + +    +FI
Sbjct: 822  KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880

Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
             +GD+ KSI  L +K +   +   A+D+ +    A E L D   L    +D   NI    
Sbjct: 881  AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 937

Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
               + +   +  ++    E+H+G  V +F    ++          P SD      ++FGT
Sbjct: 938  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990

Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            + G IG IA L +  +  L+ LQ  L   +  V GL+   +R F++  +
Sbjct: 991  VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039



 Score = 64.7 bits (156), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 81.3 bits (199), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVP P GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1
            SV=1
          Length = 1140

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVP P GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1
          Length = 1140

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVP P GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2
          Length = 1140

 Score = 80.9 bits (198), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+ ++   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 50.8 bits (120), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 96/461 (20%), Positives = 173/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVP P GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D +            L++S   +T VL   
Sbjct: 402 --------IDLPGIKGLWPL--RSDPGRETDDT------------LVLSFVGQTRVLMLN 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 80.5 bits (197), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  E+   + GR+++F   + +D     V E   KE+KGA+  +   
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 109/550 (19%), Positives = 204/550 (37%), Gaps = 125/550 (22%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           ++A  L  V    ++G +  + +    G      +D + +      + +LE+  S   + 
Sbjct: 44  VTAEGLRPVKEVGMYGKIAVMELFRPKG----ESKDLLFILTAKYNVCILEYKQSGESID 99

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
           I +     +   +  + GR S   G +  +DP+ R  G+ +Y     ++   +    L  
Sbjct: 100 IIT----RAHGNVQDRIGRPS-ETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKEL-- 152

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
                     F+ R+E  HVI+++           F++G   P +  +++      GR  
Sbjct: 153 --------KAFNIRLEELHVIDVK-----------FLYGCQAPTICFVYQDP---QGR-- 188

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
              H              + P  W   N+  +A  ++AVP P GG +++G  +I YH+  
Sbjct: 189 ---HVKTYEVSLREKEFNKGP--WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGD 243

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTV- 391
              A+A             + + S  V    +D   + +L  D+       G L +L + 
Sbjct: 244 KYLAIA-----------PPIIKQSTIVCHNRVDPNGSRYLLGDME------GRLFMLLLE 286

Query: 392 ---VYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS---GTS 444
                DG V ++ L +     + +   +T + N + F+GSRLGDS LV+    S   G+ 
Sbjct: 287 KEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSY 346

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
           +++       G I  D       R+     +                        T S A
Sbjct: 347 VVAMETFTNLGPI-VDMCVVDLERQGQGQLV------------------------TCSGA 381

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
            ++     G L+    G+ I+  AS            ++LPG KG+W +    +R     
Sbjct: 382 FKE-----GSLRIIRNGIGIHEHAS------------IDLPGIKGLWPLRSDPNR----- 419

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                    E    L++S   +T VL       E TE + +    +T   GN+   +++I
Sbjct: 420 ---------ETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNV-AHQQLI 469

Query: 625 QVFERGARIL 634
           Q+     R++
Sbjct: 470 QITSASVRLV 479


>sp|Q6P6Z0|DDB1_XENLA DNA damage-binding protein 1 OS=Xenopus laevis GN=ddb1 PE=2 SV=1
          Length = 1140

 Score = 80.5 bits (197), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
            K+  T   +GTA V  ++   + GR+++F       N   L T V  KE+KGA+ ++   
Sbjct: 823  KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876

Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
             G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S+ 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931

Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
             L++K        +A+DF      A E L D + L    ++   N+ +       +   +
Sbjct: 932  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988

Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
             Q L     FH+G  V  F    L +Q L  +S  T  +          +LFGT++G IG
Sbjct: 989  RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+  
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096

Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
            +      +M  +   L+I   +G  R   + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129



 Score = 51.2 bits (121), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 97/455 (21%), Positives = 172/455 (37%), Gaps = 110/455 (24%)

Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
           G++    +MI L+   G   ++  E        F+ R+E  HVI+++ L         FV
Sbjct: 122 GIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSCQAPTICFV 181

Query: 253 HG-----YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           +      +++   V L E+E +                        + P  W   N+  +
Sbjct: 182 YQDPQGRHVKTYEVSLREKEFS------------------------KGP--WKQENVEAE 215

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV---EL 364
           A  ++AVP P GG +++G  +I YH+     A+A             + + S  V    +
Sbjct: 216 ASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCHNRV 264

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
           D   + +L  D+       G L +L +      DG V ++ L +     + +   +T + 
Sbjct: 265 DVNGSRYLLGDME------GRLFMLLLEKEEQMDGSVTLKDLRVELLGETSIAECLTYLD 318

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N + F+GSRLGDS LV+ T  S        + E F ++                 + DM 
Sbjct: 319 NGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMETFTNL---------------GPIVDMC 363

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
                          +    T S A ++     G L+    G+ I+  AS          
Sbjct: 364 -------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---------- 401

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             ++LPG KG+W +             R+AA D +    L++S   +T VL       E 
Sbjct: 402 --IDLPGIKGLWPL-------------RVAA-DRDTDDTLVLSFVGQTRVLTLTGEEVEE 445

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           T+   +    +T   GN+   +++IQ+     R++
Sbjct: 446 TDLAGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q9ESW0|DDB1_RAT DNA damage-binding protein 1 OS=Rattus norvegicus GN=Ddb1 PE=2 SV=1
          Length = 1140

 Score = 78.2 bits (191), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 38/285 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  V  KE+KGA+ ++ 
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGK--------LQTVAEKEVKGAVYSMV 874

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 875  EFNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 930  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L     FH+G  V  F    L +Q L  +S  T            ++L GT++G 
Sbjct: 987  EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLLGTVNGM 1036

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081



 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 94/461 (20%), Positives = 170/461 (36%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVP P GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS  V+    S   G+ +++       G I  D       R+    
Sbjct: 316 YLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1
          Length = 1140

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)

Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
            K+  T   +GTA V  E+   + GR+++F  S G+        +  +  KE+KGA+ ++ 
Sbjct: 823  KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874

Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
               G LL +    + L++WT      TE N   + +   LY   L    +FIL+GD+ +S
Sbjct: 875  EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929

Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
            +  L++K        +A+DF      A E L D + L    ++   N+ +       +  
Sbjct: 930  VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986

Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
             + Q L      H+G  V  F    L +Q L  +S  T  +          +LFGT++G 
Sbjct: 987  EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036

Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
            IG +  L E  +  L  +Q +L   +  V  +    +R FH+  K   P     +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094

Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
              +      +M  +   L+I   +G  R   + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129



 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/347 (22%), Positives = 132/347 (38%), Gaps = 85/347 (24%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++AVP P GG +++G  +I YH+     A+A             + + 
Sbjct: 207 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQ 255

Query: 359 SFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSV 410
           S  V    +D   + +L  D+       G L +L +      DG V ++ L +     + 
Sbjct: 256 STIVCHNRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETS 309

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRL 467
           +   +T + N + F+GSRLGDS LV+    S   G+ +++       G I  D       
Sbjct: 310 IAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLE 368

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           R+     +                        T S A ++     G L+    G+ I+  
Sbjct: 369 RQGQGQLV------------------------TCSGAFKE-----GSLRIIRNGIGIHEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           AS            ++LPG KG+W +   S R              E    L++S   +T
Sbjct: 400 AS------------IDLPGIKGLWPLRSDSHR--------------EMDNMLVLSFVGQT 433

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            VL       E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 434 RVLMLNGEEVEETELTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479


>sp|Q9XYZ5|DDB1_DROME DNA damage-binding protein 1 OS=Drosophila melanogaster GN=pic PE=1
            SV=1
          Length = 1140

 Score = 65.1 bits (157), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)

Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
            + EVG +ID HNL  +D        +    +L   +   P    + +  +  ++  T  V
Sbjct: 780  NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831

Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
            V                 T+ V  E+   + GR+++F    N       +T+V   ++ G
Sbjct: 832  V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869

Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
               AL    G +L   G  + L++WT  +   +       +  + L    +FIL+GD+ +
Sbjct: 870  TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929

Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
            SI  L  K+       +A+D       A E L D + L    S+   N+ +       + 
Sbjct: 930  SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986

Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
              + Q L   A FH+G  V  F    ++  +  +RT    G        +L+GT +G+IG
Sbjct: 987  DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039

Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
             +  + +  +  L  L+++L   +  V  +    +R F  N K      +  +D +L+  
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE--PSEGFIDGDLIES 1097

Query: 1395 Y 1395
            +
Sbjct: 1098 F 1098



 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>sp|Q52E49|RSE1_MAGO7 Pre-mRNA-splicing factor RSE1 OS=Magnaporthe oryzae (strain 70-15 /
            ATCC MYA-4617 / FGSC 8958) GN=RSE1 PE=3 SV=2
          Length = 1216

 Score = 58.2 bits (139), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 84/369 (22%), Positives = 164/369 (44%), Gaps = 52/369 (14%)

Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
            I + ++E AL++ VV+     +++ E+ L +GT    G+D+    R   F+ G     R 
Sbjct: 879  IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929

Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
            +++ + L   ++  +++   +AL   QG L+   G  + ++     +L   A  +  P  
Sbjct: 930  SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988

Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
            +VSLN   + I++GD+   + ++++K +  +L   A D  +     T  +   ST     
Sbjct: 989  IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045

Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
            +D+  N+ I     K S+          +E H+  H   +L     RL ++A        
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100

Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
            TS  +T    G  +     LL+G   G+IG + P    ++  F   QSL++ L    P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154

Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
            AG +   +R  +   K        ++D +L   Y MLP +++  IA +   +  +I   +
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207

Query: 1422 NDLALGTSF 1430
            +D+   ++F
Sbjct: 1208 SDIRTRSAF 1216



 Score = 41.6 bits (96), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 62/280 (22%), Positives = 108/280 (38%), Gaps = 70/280 (25%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  V D        V+RL +   +   +++++  + +   F+ S  G
Sbjct: 310 LLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILKSGFLFVASEFG 369

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGS 489
           + L  QF              E+ GD        + L   SSD   D     E +  Y  
Sbjct: 370 NHLFYQF--------------EKLGD------DDEELEFFSSDFPVDPKEPYEPVYFYPR 409

Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--- 542
            + N          A+ +S+ ++ PL D         DA    + +G   +S + ++   
Sbjct: 410 PTEN---------LALVESIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHG 460

Query: 543 ---------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
                    +LPG    +WT   +               DDEY AY+++S    T+VL  
Sbjct: 461 LEVNEIVASQLPGTPSAVWTTKLRR--------------DDEYDAYIVLSFTNGTLVLSI 506

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + + EV+++   F+      A    G   ++QV  +G R
Sbjct: 507 GETVEEVSDT--GFLSSVPTLAVQQLGDDGLVQVHPKGIR 544


>sp|Q4PGM6|RSE1_USTMA Pre-mRNA-splicing factor RSE1 OS=Ustilago maydis (strain 521 / FGSC
            9021) GN=RSE1 PE=3 SV=1
          Length = 1221

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 150/374 (40%), Gaps = 54/374 (14%)

Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
            QT   + M  +E A ++ VV      + E E +L +G+A     DV    R       +T
Sbjct: 878  QTTHRLEMDDNEAAFSIAVVPF---ASAEKEVMLVVGSAV----DVVLSPRSCKKAYLTT 930

Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
             R  DN + L   ++  E+      L + QG LL   G  + ++     +L       + 
Sbjct: 931  YRLLDNGRELEL-LHKTEVDDIPLVLRAFQGRLLAGIGKALRIYDLGKKKLLRKCENRSF 989

Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGS 1238
            P  VVSL+   + I++GD+ +SI F S+K    +L   A D                  +
Sbjct: 990  PTAVVSLDAQGSRIVVGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAA 1049

Query: 1239 LDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
             D F   ++  +DG+T S  V ++   + I +  P +  +     L+  A F VG  +T 
Sbjct: 1050 ADKFGNIYVLRLDGNT-SRSVDEDPTGMTIVHEKPVLMGAAHKASLV--AHFFVGDIITS 1106

Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLV 1355
              R  M+A                R  LL+  L GSIG + P + +     L +L+  L 
Sbjct: 1107 LHRTAMVA--------------GGREVLLYTGLSGSIGALVPFVSKEDVDTLSTLESHLR 1152

Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
                 + G +  ++R  ++  K       S++D +L   + +L   +Q  IA +      
Sbjct: 1153 QENNSIVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLSPAKQNAIAGELDRKPG 1205

Query: 1416 QILSNLNDLALGTS 1429
            +I   L  L  G +
Sbjct: 1206 EINKKLAQLREGAT 1219


>sp|Q21554|DDB1_CAEEL DNA damage-binding protein 1 OS=Caenorhabditis elegans GN=ddb-1 PE=1
            SV=2
          Length = 1134

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/281 (19%), Positives = 119/281 (42%), Gaps = 17/281 (6%)

Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
            T ++ T   +GT  +  ++   + GR+++F      D  ++ +  V+   ++G+  A+  
Sbjct: 814  TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870

Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
            L G L+ A    I L +WT  +   +       +  + L ++   + + D+ +S+  LS+
Sbjct: 871  LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930

Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
            +        +AKD+ S      EF+   S L          +++    P   +   G+ +
Sbjct: 931  RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1340
            L    +     + K +    L    +        D   +++  ++FGT  G+IG I  +D
Sbjct: 988  LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039

Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
            +   + L +++K + DSV +   +   S+R F    +A  P
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080



 Score = 47.8 bits (112), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP  IGGV+V+G+N++ Y        +    Y  SL     L  ++F+    +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           DA+   +L      LS   G L++L    T    G  V+ + +     + +   I  I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            + F+GSRLGDS L++  T  +G S   S + E + +I                 ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
             E         ++ +    T + A +D     G L+    G+ I+  AS          
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             V+L G  GI+ +   S    NAD+            Y+I+SL   T VL+      E 
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437

Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
            + ++      TI A  LFG      ++Q  E+  R++  S +++   + P+N E  S  
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494

Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
                +SV+ A+  ++L   D ++ LL     TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519


>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
            24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
            GN=rse-1 PE=3 SV=2
          Length = 1209

 Score = 55.5 bits (132), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)

Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
            RA G W +  +I    SE    ++ + L N             ++E E+ L +GT    G
Sbjct: 847  RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902

Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
            +D+    R   F+ G     R  ++ ++L   ++   ++    AL   QG LL   G  +
Sbjct: 903  KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959

Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
             ++     +L   A  D  P  +VSL    N I++GD+ + I ++ +K +G +L   A D
Sbjct: 960  RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019

Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
              +L+ + T   ++D    S+   D+  NI I     ++S+          +E H+  H 
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070

Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
              +L     RL        Q L TS  +T    G        LL+  L G++G   P   
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126

Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
             +++ F   Q+L+  +    P +AG +   +R +++  K        ++D +L   + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177

Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            P +++  IA +   +  +I   ++D+   ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209



 Score = 37.0 bits (84), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/139 (24%), Positives = 65/139 (46%), Gaps = 17/139 (12%)

Query: 543 ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           ELPG    +WT             +++  YD +Y AY+++S    T+VL   + + EV++
Sbjct: 470 ELPGTPSAVWT-------------TKLTKYD-QYDAYIVLSFTNGTLVLSIGETVEEVSD 515

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 661
           S  +     T+A   + G   +IQV  +G R +    + +  +    +  + + +EN  V
Sbjct: 516 S-GFLTTAPTLAVQQM-GEDGLIQVHPKGIRHIVQGRVNEWPAPQHRSIVAATANENQVV 573

Query: 662 LSVSIADPYVLLGMSDGSI 680
           +++S  +       SDGS+
Sbjct: 574 IALSSGEIVYFEMDSDGSL 592


>sp|O13807|DDB1_SCHPO DNA damage-binding protein 1 OS=Schizosaccharomyces pombe (strain
           972 / ATCC 24843) GN=ddb1 PE=1 SV=1
          Length = 1072

 Score = 54.3 bits (129), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
           RES   GPL+ VDP  R   + VY   + I+   +     +   +       FS RI+  
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V+           D   ++    P + +L++   +     ++K              ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
           +  +    + + HD  +   +PS  GGV V G   ++Y S+    +  L  Y        
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
             P ++FS  +     T L + + +++ ++G L     ++    V  ++L K   S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +  + ++  F+GS   +S+L+Q                         PS  +      +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            LQ+ VN   +S +    + T S+  T S A +D     G L+     + I         
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
               N  L+E+ G K  ++V            S  A YD+  + +L +  E R +++   
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
            +    + + D   +  TI    ++G  +++Q+  +  R+ DG  +   +S  P +   G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493

Query: 654 SGSENSTVLSVS 665
           S   ++  ++V+
Sbjct: 494 SSFADNVCVAVA 505



 Score = 42.0 bits (97), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 64/290 (22%), Positives = 124/290 (42%), Gaps = 38/290 (13%)

Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
            + +GT +    +D    GR+++F     +DN   +  E    +++G+++ L  L  HL++
Sbjct: 770  VVVGTGFNFPDQDAPDSGRLMVFEM--TSDNNIEMQAE---HKVQGSVNTLV-LYKHLIV 823

Query: 1170 A--SGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
            A  +    I     GT    N I      P Y + +++ ++ I+  D+ KSI  L + + 
Sbjct: 824  AGINASVCIFEYEHGTMHVRNSIR----TPTYTIDISVNQDEIIAADLMKSITVLQFIDD 879

Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY---YAPKMSESWKGQKL 1282
              QL  +A+D+  L   + E L   S     V++   N  I      +P++S+    +KL
Sbjct: 880  --QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRDNVSPQLSDR---KKL 931

Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
                +F++G  + K  R        D++   P         LL  T+DGS+  +      
Sbjct: 932  RWYKKFYLGELINK-TRHCTFIEPQDKSLVTP--------QLLCATVDGSLMIVGDAGMS 982

Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
                L  LQ  +   +P   GL+ + ++++    +     P  ++D  L+
Sbjct: 983  NTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLIDGSLI 1029


>sp|Q54SA7|SF3B3_DICDI Probable splicing factor 3B subunit 3 OS=Dictyostelium discoideum
            GN=sf3b3 PE=3 SV=1
          Length = 1256

 Score = 54.3 bits (129), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 54/289 (18%), Positives = 127/289 (43%), Gaps = 29/289 (10%)

Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
            +Y  E++  + A+A  QG L+   G  I ++     +L         P  +V+++ + + 
Sbjct: 979  LYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1038

Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
            +++GDI +SI+F+ +K     L + A D        +  ++D  T++   +D+  NI + 
Sbjct: 1039 LVVGDIQESIHFIKYKRSENMLYVFADDLAP-RWMTSSVMLDYDTVA--GADKFGNIFVL 1095

Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGA---------HVTKFLRLQMLATSSDRTGAAPGSDK 1318
                 +S+  +     ++ +F  G          H+  F     + T +  +    G + 
Sbjct: 1096 RLPLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPE- 1154

Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
                 +L+ T+ G+IG + P    +++ F    +L+  +      + G +  ++R ++  
Sbjct: 1155 ----VILYTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLPLCGRDHLAYRSYYFP 1208

Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
             K       +I+D +L   +  L  ++QL I+ +   + S+++  L ++
Sbjct: 1209 VK-------NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250



 Score = 43.9 bits (102), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
           GGVLV   + I Y +Q  +            +    +PR   S      L  +H++  Q 
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            +   L+ ++ GDL  +T+ Y G  V  ++++  +  VL + +T + N   F  S  GD 
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
            L  F         S G +EE G  +        L    R S    ++++ N E  S   
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
           S S           F V D +    P             LK   +GL +    +A     
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464

Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                   LPG   GIWTV   +S   NA         D+   Y+++S    T VL   D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + E  ES    ++  T       G   +IQVF  G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542


>sp|P0CR22|RSE1_CRYNJ Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
            neoformans serotype D (strain JEC21 / ATCC MYA-565)
            GN=RSE1 PE=3 SV=1
          Length = 1217

 Score = 50.8 bits (120), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LA  QG LL   G  + L++     L      +  P  VV++N+    I++GD+ +S ++
Sbjct: 951  LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010

Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
              ++     QL + A D     + C  +   +D  T++    D+  NI I    P +SE 
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065

Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +GA H T+ +    +  + TS  +     G     R  L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R ++   K        
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + +LP  +Q  IA     +   +L  L  +   ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217



 Score = 35.4 bits (80), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 57/267 (21%), Positives = 102/267 (38%), Gaps = 47/267 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  + + ++G  V  L +   +   + + +  +     ++ S   D  L QF
Sbjct: 313 LLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQF 372

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAP----STKRLRR----SSSDALQDMVNGEELSL 486
                  G    SS    E G+I+   P      + LR      +  +L  + +   ++L
Sbjct: 373 QSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVNL 432

Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
            G AS++T           R +       +   +GL +    S+             LPG
Sbjct: 433 LG-ASSDTPQIYAACGRGARSTF------RTLKHGLDVAEMVSS------------PLPG 473

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WT+                  DDEY +Y+++S    T+VL   + + EV ++  +
Sbjct: 474 VPTNVWTL--------------KLTEDDEYDSYIVLSFPNGTLVLSIGETIEEVNDT-GF 518

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
              G T+A   L G   ++QV   G R
Sbjct: 519 LSSGPTLAVQQL-GNAGLLQVHPYGLR 544


>sp|P0CR23|RSE1_CRYNB Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
            neoformans serotype D (strain B-3501A) GN=RSE1 PE=3 SV=1
          Length = 1217

 Score = 50.8 bits (120), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)

Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
            LA  QG LL   G  + L++     L      +  P  VV++N+    I++GD+ +S ++
Sbjct: 951  LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010

Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
              ++     QL + A D     + C  +   +D  T++    D+  NI I    P +SE 
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065

Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
                  G  +L    F +GA H T+ +    +  + TS  +     G     R  L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121

Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
            + G++G + P    D++ F  + +L+  +      + G +  ++R ++   K        
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172

Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
            +VD +L   + +LP  +Q  IA     +   +L  L  +   ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217



 Score = 35.4 bits (80), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 57/267 (21%), Positives = 102/267 (38%), Gaps = 47/267 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  + + ++G  V  L +   +   + + +  +     ++ S   D  L QF
Sbjct: 313 LLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQF 372

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAP----STKRLRR----SSSDALQDMVNGEELSL 486
                  G    SS    E G+I+   P      + LR      +  +L  + +   ++L
Sbjct: 373 QSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVNL 432

Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
            G AS++T           R +       +   +GL +    S+             LPG
Sbjct: 433 LG-ASSDTPQIYAACGRGARSTF------RTLKHGLDVAEMVSS------------PLPG 473

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WT+                  DDEY +Y+++S    T+VL   + + EV ++  +
Sbjct: 474 VPTNVWTL--------------KLTEDDEYDSYIVLSFPNGTLVLSIGETIEEVNDT-GF 518

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
              G T+A   L G   ++QV   G R
Sbjct: 519 LSSGPTLAVQQL-GNAGLLQVHPYGLR 544


>sp|Q4WLI5|RSE1_ASPFU Pre-mRNA-splicing factor rse1 OS=Neosartorya fumigata (strain ATCC
            MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=rse1 PE=3
            SV=1
          Length = 1225

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
             C  G + + +Q +    ++ S    DN    + IPL  TP ++    E+ L+ +I S  
Sbjct: 759  QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814

Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
             VL P  +  + LI+       + + L   D         +   ++I++P  A       
Sbjct: 815  NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869

Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
            +TI ++ +E A+++  V     +++++ET L +GTA   +     +A G + ++   R  
Sbjct: 870  STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923

Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
            ++ + L   ++  +++    AL   QG LL   G  + ++     +L            +
Sbjct: 924  EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982

Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
            V L    + I++ D+ +S+ ++ +K Q   L     D  S    +T  ++D  T++    
Sbjct: 983  VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039

Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
            D+  N+ +     K SE       G  L+    +  GA     L +    Q + TS  +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099

Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
                G     R  L++    G+IG + P    +++ F   Q+L+ +L    P +AG +  
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153

Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
             +R +++  K        ++D +L   Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188



 Score = 35.4 bits (80), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 2/60 (3%)

Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           DE+ AY+I+S    T+VL   + + EVT++  +     T+A   L G   +IQV  RG R
Sbjct: 484 DEFDAYIILSFANGTLVLSIGETVEEVTDT-GFLSTAPTLAVQQL-GEDSLIQVHPRGIR 541


>sp|B0M0P5|DDB1_DICDI DNA damage-binding protein 1 OS=Dictyostelium discoideum GN=repE PE=1
            SV=1
          Length = 1181

 Score = 48.1 bits (113), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)

Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
            + + ++  L S  G L+ A   ++   ++T ++        +  ++     I+K     +
Sbjct: 913  KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972

Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
            FIL+GD+ KS+  L  +  G+ L  +A++   +   +   + D      + ++   N  +
Sbjct: 973  FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 1028

Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
                   +   + + L S   +H+G  +           +S R G+    P SD+     
Sbjct: 1029 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1077

Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
            +L+ +++GSIG +A + E  F     LQK L   V  V G +  ++R F ++   H    
Sbjct: 1078 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1135

Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
             + +D +L+  +  L  E QL+     G T
Sbjct: 1136 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1165



 Score = 40.8 bits (94), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.134    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 521,901,764
Number of Sequences: 539616
Number of extensions: 22279362
Number of successful extensions: 51710
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 51273
Number of HSP's gapped (non-prelim): 220
length of query: 1431
length of database: 191,569,459
effective HSP length: 130
effective length of query: 1301
effective length of database: 121,419,379
effective search space: 157966612079
effective search space used: 157966612079
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 68 (30.8 bits)