BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000548
(1431 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FGR0|CPSF1_ARATH Cleavage and polyadenylation specificity factor subunit 1
OS=Arabidopsis thaliana GN=CPSF160 PE=1 SV=2
Length = 1442
Score = 2249 bits (5829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1092/1460 (74%), Positives = 1254/1460 (85%), Gaps = 47/1460 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYELV ELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
AHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TTR IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442
>sp|Q7XWP1|CPSF1_ORYSJ Probable cleavage and polyadenylation specificity factor subunit 1
OS=Oryza sativa subsp. japonica GN=Os04g0252200 PE=3 SV=2
Length = 1441
Score = 1873 bits (4852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 943/1472 (64%), Positives = 1121/1472 (76%), Gaps = 72/1472 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
ICQLPS YD+YWPVQK+PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998 ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
QKLLSRAEFHVGAH+TKFLRLQML T S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL YEML
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEMLS 1409
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441
>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor subunit 1
OS=Drosophila melanogaster GN=Cpsf160 PE=1 SV=1
Length = 1455
Score = 516 bits (1329), Expect = e-145, Method: Compositional matrix adjust.
Identities = 420/1486 (28%), Positives = 698/1486 (46%), Gaps = 190/1486 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1283 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor subunit 1 OS=Bos
taurus GN=CPSF1 PE=1 SV=1
Length = 1444
Score = 338 bits (868), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1063
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1064 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1411
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434
Score = 304 bits (779), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543
Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670
>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor subunit 1 OS=Mus
musculus GN=Cpsf1 PE=1 SV=1
Length = 1441
Score = 338 bits (868), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431
Score = 307 bits (787), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor subunit 1 OS=Homo
sapiens GN=CPSF1 PE=1 SV=2
Length = 1443
Score = 334 bits (856), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1296 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1348
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1349 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1408
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1409 YLSTMERSELAKKIGTTPDIILDDL 1433
Score = 305 bits (780), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
>sp|Q7SEY2|CFT1_NEUCR Protein cft-1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
CBS 708.71 / DSM 1257 / FGSC 987) GN=cft-1 PE=3 SV=2
Length = 1456
Score = 294 bits (752), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 345/1418 (24%), Positives = 594/1418 (41%), Gaps = 189/1418 (13%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+ +
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742
Query: 683 LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
P + I +S K ++ C LY D ++ + VG+
Sbjct: 743 AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ +I + SG L I+ +P+ V + +S Y+ L
Sbjct: 790 -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+ ++ +GT KE++ + V +L H P+L + + YQ Y +
Sbjct: 831 -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
+ + P S S + ++ N F++ P + ++ PH A P +R +
Sbjct: 881 ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
NISG+ FL GS P + + + L + A + H C HGFIY + GI
Sbjct: 929 -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987
Query: 978 LKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
++ Q+P+ S+Y V+KIP+ + Y Y +V ++P L
Sbjct: 988 ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
D + + N++ + V+ +++L +G W T+ M+ E L V
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092
Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK- 1154
+ L + +T E + L+A+GTA ++GED+ RGRV +F P T SK+LK
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKL 1149
Query: 1155 --------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLN 1202
GA++AL+ + QG +L+A G K ++ K GT L +AF D YV S+
Sbjct: 1150 VAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVK 1207
Query: 1203 IVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
+ L+ D K ++F + E+ ++ L K ++ +FL DG L +V SD
Sbjct: 1208 ELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDA 1267
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
+I I + P+ +S +G LL R F+ GAH L + A + + + S++ +
Sbjct: 1268 DGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENS 1326
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA-- 1378
LL + G + + PL E +RRL SL +L + +PH AGLNP+ +R + A
Sbjct: 1327 PHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASM 1386
Query: 1379 HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
PG D+ IVD ++L + L ++ E+A + G
Sbjct: 1387 QLPGVDAGIGRNIVDGKILERFLELGTGKRQEMAGRAG 1424
>sp|O74733|CFT1_SCHPO Protein cft1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
GN=cft1 PE=3 SV=1
Length = 1441
Score = 291 bits (745), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 336/1443 (23%), Positives = 598/1443 (41%), Gaps = 205/1443 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D S+H
Sbjct: 92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148
Query: 161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
+E +K + P + VDP C +L + M+ + L +E
Sbjct: 149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202
Query: 220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
S S + S V+ LD + + D F++GY EP + IL+ E T +
Sbjct: 203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
+ T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y S
Sbjct: 263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLVLLT 390
+ + +N+Y +S F++EL+ A L + +L +G L
Sbjct: 323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYLD 382
Query: 391 VVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DG+ V+ L L + N L S IT G +L FLGS+ DS L++++
Sbjct: 383 FLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS----- 437
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
EE E D L ++ + DM++ E +
Sbjct: 438 ---RRTTNEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRL 490
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV------------------ 542
+ D L NIGP+ DF+ G A + Q N+ ELV
Sbjct: 491 EICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNI 545
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETA 593
+ GC+ +WTV S + N S A Y + E YL++S E + +
Sbjct: 546 FPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG 604
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSES 652
+ EV S D+ +T+ G+L R++Q+ R+ D + +TQ +F
Sbjct: 605 ETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF------- 656
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAAIES 702
S+ V+S SI DP +++ G I RL+ D V+T A++ S
Sbjct: 657 ---SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-ASLVS 712
Query: 703 SKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEAIDG 741
L+ +D E L KTS + + G +++
Sbjct: 713 PDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQSV-S 769
Query: 742 ADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
AD P LDQ + + G L+++++ +F+ + D F
Sbjct: 770 ADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVFDLP 829
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
T + ++ T N S S ++VEL + P LF
Sbjct: 830 PT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPHLFL 870
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
I Y+A+L+ NT K + ++ ++ V + +R TP DA +
Sbjct: 871 RSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAESTM 923
Query: 904 ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
E + ++T + + H F++G +P + + P + I++
Sbjct: 924 EKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPF 983
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
H + G+IYV ++IC+ YDN WP +K+ L + I Y K +Y +
Sbjct: 984 HAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQINGIAYHPTKMVYAVGS 1043
Query: 1021 SVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
+VP+ + N+ ++ D + + N S+DL T
Sbjct: 1044 AVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT---------------- 1084
Query: 1076 PWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W + Q E L+V +V L + TTK + +A+GT+ +GED+A RG LF
Sbjct: 1085 -WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLFEI 1143
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
P ++ + V +E+KG ++ + + G+LL G K+I+ + L G+
Sbjct: 1144 IDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHLVGV 1203
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
+F D Y +S ++N +L GD+ +++ F+ + E+ ++ L +K +L+ A +FL+
Sbjct: 1204 SFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEALNVSAADFLV 1262
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
G L VV+D N+++ Y P+ ES G++L++R +FH+G +T + +L
Sbjct: 1263 QGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITA---MTILPKEKK 1319
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
A G D + F+ + DG + + P+ + +RRL +Q L + V + GLNP+S
Sbjct: 1320 HQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKS 1379
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R S P I+D L+ ++ + + + E+AH+ G S I+++L +L
Sbjct: 1380 YRLITSPSNLTNPT-RRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEAL 1438
Query: 1429 SFL 1431
S++
Sbjct: 1439 SYM 1441
>sp|Q2TZ19|CFT1_ASPOR Protein cft1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
GN=cft1 PE=3 SV=1
Length = 1393
Score = 290 bits (741), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 337/1401 (24%), Positives = 590/1401 (42%), Gaps = 208/1401 (14%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
S E +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKIPLKAT 1003
G + + C GFI + S+ + L C L + +Y+P +Q+IP+
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973
Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1063
+ Y + +Y + S L D E+ + N S V+
Sbjct: 974 VDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRS 1022
Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
++++ P W T+ +E+ + V+ ++L + T E + ++ +GTA+ +GED
Sbjct: 1023 SLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGED 1074
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKI 1175
+A+RG V +F + +P+ + V + +KGA++AL+ + QG L++A G K
Sbjct: 1075 IASRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKC 1134
Query: 1176 ILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQ 1228
I+ K G+ L +AF D +++VK ++ D K ++F + E+ +
Sbjct: 1135 IVRGLKEDGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYK 1189
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
++L AKD L+ A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++F
Sbjct: 1190 MSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKF 1249
Query: 1289 HVGAHVTKFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
H G ++ L + SS++ A K R +L + +GS+G + + E ++
Sbjct: 1250 HTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESY 1309
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
RRL +LQ +L +++ H GLNPR+FR S+G A R ++D +LL + + + ++
Sbjct: 1310 RRLSALQSQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKV 1365
Query: 1405 EIAHQTGTTRSQILSNLNDLA 1425
EIA + G +I ++ ++
Sbjct: 1366 EIASRVGANEWEIKADFEAIS 1386
>sp|Q5BDG7|CFT1_EMENI Protein cft1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
CBS 112.46 / NRRL 194 / M139) GN=cft1 PE=3 SV=1
Length = 1339
Score = 276 bits (706), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 352/1456 (24%), Positives = 603/1456 (41%), Gaps = 234/1456 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++I+ +R S ++ +T+ R L L Y+L G V +
Sbjct: 28 NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ G D ++++AF DAK+S++E+D +GL S+H +E +
Sbjct: 82 CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
+ G ++ DP RC + I+ Q G LV D+ FGS + R+E
Sbjct: 136 ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193
Query: 232 --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
SS V+ L LD + H F++ Y EP IL+ + T
Sbjct: 194 NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ + + +++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 254 SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N ++ S +S ++ L+ +D LL+ TG
Sbjct: 314 VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373
Query: 387 VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
L++ DGR V + LS + L S ++ +GN F GS DS+L+
Sbjct: 374 ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428
Query: 441 SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
G S SS K+ F G D S DA +D + + N S
Sbjct: 429 -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479
Query: 500 TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
+F + D L + GP++D G A + T K ELV G G +
Sbjct: 480 VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536
Query: 555 HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
+ S A+S S + +D+ Y+I+S E+ VLE D
Sbjct: 537 KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
L +T T+ G L + RVIQV R D + D
Sbjct: 595 LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ ++ ++ DPY+ + D ++ LL D S + TL D
Sbjct: 642 SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685
Query: 716 KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+ WL S +A T +I + + L ++ +P+F
Sbjct: 686 VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V +V + V I+ T E K S T +EN+ + VVEL
Sbjct: 731 VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
++ S PFL + ++ Y+ + E T R L +N + + N
Sbjct: 770 SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
D E P + I NI+G F+ G P +FR P +L
Sbjct: 820 ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
G I + + GF Y+ S G L + +LP G+ W ++ +P+ ++TY
Sbjct: 866 GGFIKGLGCFDS--PDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYV 922
Query: 1011 AEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
+ + Y VL + L D E+ + N +S + V + ++++
Sbjct: 923 SASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVS 970
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
P W + P++ +E+ + ++ ++L + T E ++ +GT+ +GED+ +RG
Sbjct: 971 PKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGC 1026
Query: 1129 VLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--K 1179
+ +F +P+ N ++ KE +KGA++AL+ + QG L+ A G K ++ K
Sbjct: 1027 IYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLK 1086
Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
G+ L +AF D +V + +K + GD K ++F + E+ +++L AKD
Sbjct: 1087 EDGSLLP-VAFMDMQ-CFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLD 1144
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
L+ A +FL DG+ L +VV+D N+ + Y P+ S G KLL+R++FH G +
Sbjct: 1145 YLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTV 1204
Query: 1298 LRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
L SS+R A GSDK + +L + +GSIG + + E ++RRL +LQ
Sbjct: 1205 TLLPRTLVSSER--AMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQ 1262
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+L +++ H GLNPR++R S+ A R ++D LL Y + + + EIA + G
Sbjct: 1263 SQLTNTLEHPCGLNPRAYRAVESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRVG 1318
Query: 1412 TTRSQILSNLNDLALG 1427
T +I ++L ++ G
Sbjct: 1319 ATEWEIRADLEAISGG 1334
>sp|A2R919|CFT1_ASPNC Protein cft1 OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513)
GN=cft1 PE=3 SV=1
Length = 1383
Score = 271 bits (693), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 331/1464 (22%), Positives = 618/1464 (42%), Gaps = 210/1464 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
S F L N + ++ C+LP + +D W ++++
Sbjct: 904 ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR-T 1057
L + Y +Y VL + L D E+ + N +S R +
Sbjct: 947 LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 999
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
+ ++ + D + + + + E + ++ ++L + T E + ++ +GTA
Sbjct: 1000 FIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1058
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L+
Sbjct: 1059 FARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLV 1118
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
A G K ++ K G+ L +AF D YV + +K +LGD K ++F + E+
Sbjct: 1119 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEE 1176
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G +LLSR
Sbjct: 1177 PYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSR 1236
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
++FH+G + L SS++ ++ G D N+ L L T +GS+G I + E
Sbjct: 1237 SKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPE 1296
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + + +
Sbjct: 1297 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1352
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
+ EIA + G +I ++L ++
Sbjct: 1353 RKTEIAGRVGAREWEIKADLEAIS 1376
>sp|Q6C740|CFT1_YARLI Protein CFT1 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=CFT1
PE=3 SV=1
Length = 1269
Score = 218 bits (554), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 311/1370 (22%), Positives = 548/1370 (40%), Gaps = 215/1370 (15%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A LEL+ Y L G V + + DN DS+ ++ + AK ++ ++ S +
Sbjct: 51 APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
S+H +E + L E V + +L +M L + G + D+
Sbjct: 108 SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162
Query: 218 DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ SG F + S ++ LD +++V D F+H Y E M IL + + W G +
Sbjct: 163 EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
T + +S+ K +I + LP DA K++ +P+P+GG L++ ANTI Y S
Sbjct: 219 ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278
Query: 336 ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
AS + N +S + R S+ + L+ A ++Q + ALL T+ G L
Sbjct: 279 ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338
Query: 392 VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
DGR V L+L P +L+ S + + FLGSR GDSLLV++ G S
Sbjct: 339 NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
L D N + LYG + TE + + +
Sbjct: 399 ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432
Query: 507 DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
D ++ +GP++ + G + + TG+ S ++ ++PG +
Sbjct: 433 DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492
Query: 549 GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
W+V + G A S D ++YL A V+E L T+ ++ +FV
Sbjct: 493 -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T + ++QV I D S + +T + +
Sbjct: 550 SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
YVL+ +SDG +++L VS+ +++++ + + G +T
Sbjct: 601 GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D+ L+ G G P VVCY G+L + S I
Sbjct: 649 DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
+D ++++ +E GQ +++V++A+ H +LT
Sbjct: 697 LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
++ Y+ Y + LRF + L E TP
Sbjct: 744 QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770
Query: 908 GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
+R+T I+G ++G + + L P+L + VAFT
Sbjct: 771 INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKNLYPLIVSV 1022
+ + F Y+T G + C+ + + WPV+ + L T ++TY ++Y
Sbjct: 823 SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
V+ V ++ D+ D+ + S+ D+ T + +RI+ P W
Sbjct: 877 -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924
Query: 1081 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGR-- 1136
+ + +E AL V L + K + ++A+GT+ ++GED+AARG + LF
Sbjct: 925 DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIV 984
Query: 1137 -NADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-ELNGIAFYD 1192
+ P+ + ++ ++GA +A+ + G LL G K+++ L +AF D
Sbjct: 985 PEKERPETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLD 1044
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
YV + + +LLGD +S+ F+ + Q+ A+D + +F I+G
Sbjct: 1045 MQ-TYVAVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARDLQRVLVTTCDFAIEGEN 1103
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
L+ VV+D QK + I Y P +S+ G +LL R+ F+ G + D +
Sbjct: 1104 LTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVI-------------DSSAM 1150
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
P ++ +RF ++ DGS+ + P E +RRL ++Q ++ D HV GL+PR++R
Sbjct: 1151 VPINE--DRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLHPRAYRYD 1208
Query: 1373 H----SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
+ HRP I+D L + LP +Q A++ G Q++
Sbjct: 1209 PILPGTGNSPHRP----ILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI 1254
>sp|A8XPU7|CPSF1_CAEBR Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis briggsae GN=cpsf-1 PE=3 SV=1
Length = 1454
Score = 213 bits (542), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 193/748 (25%), Positives = 344/748 (45%), Gaps = 83/748 (11%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
++ DA +S+ GE D +D YS VV +++G + I +P+ V+
Sbjct: 737 KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
+ +F + +VD + E K+ + + ++ E+ + ++ ++S
Sbjct: 792 IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KVVE + + + P L AI+ D ++ Y+ + P+ V+ + + +
Sbjct: 852 KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
S N+ R P + E HG I F+ IS + G + G+ P +V+
Sbjct: 908 RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPTL-LVYGA 962
Query: 942 --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKI 998
++ H DGSI AFT +N N HGF+Y+T Q L+I ++ YD +PV+KI
Sbjct: 963 WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHR 1056
+ T H + Y ++Y ++ SVP KP N++ ++ D QE H+ D + + +
Sbjct: 1023 EVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPP 1078
Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ETL 1110
YT+ + Q A +P E V + + K +T
Sbjct: 1079 KYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTY 1126
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
LA+ T GE+V RGR++L P + +Y KE KG ++ L ++ G
Sbjct: 1127 LALATVNNYGEEVLVRGRIILCEVIEVVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAING 1186
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
LL G K+ + ++ +L GI+F D YV L+ ++ L D +S+ + ++E+
Sbjct: 1187 LLLSGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSIRTIALALDARESMSLIRFQEE 1245
Query: 1226 GAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+++ ++D C A+EFL+DG + ++SDE NI +F Y+P+ ES G++
Sbjct: 1246 NKAMSIASRD--DRKCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGER 1303
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
L +A ++G ++ FLR++ + D + + R +FG+LDGS G I PL E
Sbjct: 1304 LTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTE 1363
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYE 1396
++RRL LQ + P +AGL+ + R Q NG+ R +++D +++ Y
Sbjct: 1364 KSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNAR----NLIDGDVVEQYL 1419
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
L + ++ ++A + G R IL +L L
Sbjct: 1420 HLSVYDKTDLARRLGVGRYHILDDLMQL 1447
Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL++P P+GG +V G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL +V G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
F+GSRLGDS L+++T ++ S K+ R + + ++
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410
Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
+++ LYG A +++ E ++ F D L+N+GP+K +G R N ++ +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469
Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
+ ++LV G G V+ +S R SS + ++E H YLI+S
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
R+ ++ E + T+AAG L +QV A + DG M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
+ N V+ SI DPYV + +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613
>sp|Q1E5B0|CFT1_COCIM Protein CFT1 OS=Coccidioides immitis (strain RS) GN=CFT1 PE=3 SV=1
Length = 1387
Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 160/559 (28%), Positives = 275/559 (49%), Gaps = 49/559 (8%)
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
RF +P AY PH + + + +I G++ F+SGS PC+ M +L
Sbjct: 860 RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
++ + + H C GF YV + ++++C+LPS + +DN W +K+ + + YF
Sbjct: 912 GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYF 971
Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1070
A +Y L S V L + D E+ + + +S + +E +++L P
Sbjct: 972 AHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSP 1020
Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
W + + +E + ++ + + + T E + +L +GTA V+GED+ RG +
Sbjct: 1021 RT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSI 1076
Query: 1130 LLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KW 1180
+F A +P N ++++K+ +KGA++A++ + QG L++A G K ++ K
Sbjct: 1077 YVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKE 1136
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
G+ L +AF D YV L ++ ++GD K I+F + E+ +L L KD
Sbjct: 1137 DGSLLP-VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEY 1194
Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
L A +FL DG L ++V+D+ I + Y P+ S KG +LL R+ FH G H T +
Sbjct: 1195 LQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTG-HFTSTM 1253
Query: 1299 RLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
L + SS + P D + + +L + +GSIG + PL E ++RRL +LQ
Sbjct: 1254 TL-LPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQS 1312
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
+LV S+ H GLNP+++R S+G R IVD LL + + ++ + EIA + G
Sbjct: 1313 QLVTSMEHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGA 1368
Query: 1413 TRSQILSNLNDLALGTSFL 1431
I +L ++ G FL
Sbjct: 1369 DIESIRVDLETISGGLDFL 1387
Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++++++ +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV DE D G
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + + ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ ++ S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439
Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
SM ++ + FG + D VN LS S +N +
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor subunit 1
OS=Caenorhabditis elegans GN=cpsf-1 PE=3 SV=2
Length = 1454
Score = 190 bits (482), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 176/705 (24%), Positives = 323/705 (45%), Gaps = 69/705 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
+V +E+G L I +P V+ + +F + +VD T E + ++ E
Sbjct: 777 IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836
Query: 814 GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
E +++ +V+E + + + P L AI+ D ++ Y+ + S
Sbjct: 837 DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886
Query: 868 DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
+P+ +S + ++S L N R P + + +G I F+ +
Sbjct: 887 SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941
Query: 921 SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
S + G + G+ P +V+ ++ H DG I AFT +N N HG +Y+T +
Sbjct: 942 SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
L+I ++ Y+ +PV+KI + T H + Y ++Y ++ S+P KP N++ ++
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVMN 1058
Query: 1037 D--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D QE H+ D + + + YT+ + + D A P I + E
Sbjct: 1059 DDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTAC 1109
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
V L + +T ETLLA+GT GE+V RGR++L P + +
Sbjct: 1110 EDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVL 1169
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
+ KE KG ++ L ++ G LL G K+ + ++ +L GI+F D YV L+ ++
Sbjct: 1170 FDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIA 1228
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNI 1264
+ D +S+ + ++E +++ ++D C A++ ++DG+ + ++SDE NI
Sbjct: 1229 IACDARESMSLIRFQEDNKAMSIASRD--DRKCAQPPMASQLVVDGAHVGFLLSDETGNI 1286
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
+F YAP+ ES G++L RA ++G ++ F+RL+ + R
Sbjct: 1287 TMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTT 1346
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAH 1379
+F +LDGS G + PL E ++RRL LQ + P +AGL+ + R Q NG+
Sbjct: 1347 VFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNA 1406
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R +++D +++ Y L L ++ ++A + G R I+ +L L
Sbjct: 1407 R----NLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQL 1447
Score = 186 bits (473), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S +VI L+ +D + ++
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
+LL +P P+GG LV G+NT+ Y +Q+ C L LN+ D + P +
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL ++ G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
F+GSRLGDS L+++T T + KRL+ + D A + +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ +++ LYG A +++ E ++ F D L N+GP+K G R N ++ +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
Q++ N V+ SI DPYV L +G + L LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627
>sp|Q4WCL1|CFT1_ASPFU Protein cft1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
CBS 101355 / FGSC A1100) GN=cft1 PE=3 SV=2
Length = 1401
Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
YV S+ +L+IC+ PS + +D W ++KI + + Y Y L S +
Sbjct: 943 YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996
Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
L D E+ N L L + + ++++ P W + + E
Sbjct: 997 DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049
Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
+ V+ + L + T E ++ +GTA+ +GED+ +RG + +F + +P+ T+
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109
Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167
Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
L +K ++GD K ++F + E+ +++L KD G L+ A EFL DG L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P S
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287
Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
+G A R ++D LL + + ++EIA + G +I ++L + A G +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401
Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
>sp|A1DB13|CFT1_NEOFI Protein cft1 OS=Neosartorya fischeri (strain ATCC 1020 / DSM 3700 /
FGSC A1164 / NRRL 181) GN=cft1 PE=3 SV=1
Length = 1400
Score = 187 bits (474), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 150/542 (27%), Positives = 260/542 (47%), Gaps = 50/542 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
+ I NIS F+ G + + + R+ + G ++ L + + + GF
Sbjct: 885 LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
IYV S+ +L+IC+ PS + +D W ++KI + + Y Y L S +
Sbjct: 943 IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDHLAYATSSETYVLGTS------HS 996
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
L D E+ N +S + R +++ R W + + +E
Sbjct: 997 ADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPAE 1047
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1147
+ V+ + L + T E ++ +GTA+ GED+ +RG + +F + +P+ T+
Sbjct: 1048 YVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1107
Query: 1148 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1108 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV- 1164
Query: 1200 SLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
N+VK ++GD K ++F + E+ +++L KD G L+ A EFL DG L
Sbjct: 1165 --NVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLF 1222
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
++V+D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P
Sbjct: 1223 ILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADP 1282
Query: 1315 GS----DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
S +T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R
Sbjct: 1283 DSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYR 1342
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTS 1429
S+G A R ++D LL + + ++EIA + G +I ++L + A G
Sbjct: 1343 AVESDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLG 1398
Query: 1430 FL 1431
+L
Sbjct: 1399 YL 1400
Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLVV +V++I+ +++VQ G+ E K++ D + L L Y L G
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80
Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
V + IL+ GG ++++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 81 VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
+ + G ++ VDP RC V +G++ + IL Q G L D +D
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192
Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
F G ++ SS V+ L LD + H F++ Y EP +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312
Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N +H + A+ +N +A + + + +S ++ L+ L + LL
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372
Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
+G++VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
+ ++ S + ++ D +S D + D+ E E G +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
S + F + D L NIGPL+D + G L NA + I+ Q N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549
Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
LV + +G+WT G D R+ + EY Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606
Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
++ + + DL E + D TI G L +RRV+QV
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661
Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
R SY D+ G P E S+ +S S+ADPY+ + D ++ LL D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
S V+ + + K SC LY DK
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741
>sp|A1C3U1|CFT1_ASPCL Protein cft1 OS=Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 /
DSM 816 / NCTC 3887 / NRRL 1) GN=cft1 PE=3 SV=1
Length = 1401
Score = 185 bits (469), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 158/559 (28%), Positives = 262/559 (46%), Gaps = 55/559 (9%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
N R P D+ T + + + I +ISG+ F+ G SR C +
Sbjct: 861 NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
R R V L D FT + + GFIYV S+ +++ICQLP + YD W ++K+
Sbjct: 920 RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
+ + Y Y L S + L D E+ + N +S + R
Sbjct: 974 IGEHVDHLAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQCC 1027
Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
+ +++ P W + + E + V+ + L + T E + ++ +GTA
Sbjct: 1028 L-----KVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALA 1078
Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTE----VYSKEL-KGAISALASL--QGHLLIAS 1171
+GED+ ARG + +F + +P+ T+ + KEL KGA++AL+ + QG L+ A
Sbjct: 1079 RGEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQ 1138
Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
G K ++ K G+ L +AF D YV L +K ++GD K I+F + E+
Sbjct: 1139 GQKCMVRGLKEDGSLLP-VAFMDVQ-CYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPY 1196
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+++L KD + A +FL DG L ++V+D N+ + Y P+ S G KLL R++
Sbjct: 1197 KMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSK 1256
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSD-----KTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G H T L L T+S +A + +L + GSIG + + E
Sbjct: 1257 FHMG-HFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEE 1315
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
++RRL +LQ +L ++V H GLNPR++R S+G A R ++D LL + + +
Sbjct: 1316 SYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGR----GMLDGNLLYQWLSMSKQR 1371
Query: 1403 QLEIAHQTGTTRSQILSNL 1421
++EIA + G +I ++L
Sbjct: 1372 RMEIAARVGAHEWEIKADL 1390
Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)
Query: 57 NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
NLVV +V++I+ V EG + S D + + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81
Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
L + ++ G D +I+LAF +AK+S++E+D +G+ S+H +E +
Sbjct: 82 SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
+ + G ++ VDP RC V +G++ + IL Q G LV GD
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194
Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
+D+ G+ G SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + ++ + ++ S LP D +K++A+P P+GG L++G
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + + +S ++ L+ L N LL+ +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
G +VL+ DGR V + + + P +I ++G+ F GS +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
++ S + S + E IE D S D +D LY +A +
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483
Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
S ++ F V D L N PL+D + G L + A
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543
Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
+ G + K+ +V+ + G+WT +++ + D + + +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
Y+I++ + +T+ E +++ ++ F +I G L R+RV+QV
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658
Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
R D DL P E S+ LS S+ADPY+ + D ++ LL
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711
Query: 686 DPS 688
D S
Sbjct: 712 DDS 714
>sp|Q6BHK3|CFT1_DEBHA Protein CFT1 OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
JCM 1990 / NBRC 0083 / IGC 2968) GN=CFT1 PE=3 SV=2
Length = 1342
Score = 155 bits (392), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/500 (24%), Positives = 231/500 (46%), Gaps = 46/500 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
N + ++L + P +AY+ T +R+ F N++G F++G P +
Sbjct: 810 NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
+ + V+F + +G IY+ + +IC++P Y+N WP++KIP+
Sbjct: 866 HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKIPI 925
Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRTY 1058
K + +TY N + V+ ++ +D+E G I + S S + ++ Y
Sbjct: 926 KESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKGY 977
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
++++ P W TI + E + V+ + L ++TK+ + L+ IG
Sbjct: 978 ------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVIG 1027
Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T + ED++A G +F G+ N + E++ ++ KGA++++ + G
Sbjct: 1028 TGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHK--FKEIHQEDTKGAVTSICEISGRF 1085
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L++ G KII+ + +AF D +YV N ++LGD KSI+ + +
Sbjct: 1086 LVSQGQKIIIRDLQDDGVVPVAFLDTS-VYVSEAKSFGNLLILGDSLKSIWLAGFDAEPF 1144
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
++ +L KD SLD +F+I + ++++D + + Y P+ S GQ+L+ +A
Sbjct: 1145 RMVMLGKDLQSLDVNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKAS 1204
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
F++ + T + + P S T F + T+DGS + P++E ++RR+
Sbjct: 1205 FNINSTPT------CIRSIPKNEEINPSS--TEVFQSIGSTIDGSFYTVFPINEASYRRM 1256
Query: 1348 QSLQKKLVDSVPHVAGLNPR 1367
LQ+++ D H GLNPR
Sbjct: 1257 YILQQQITDKEYHFCGLNPR 1276
Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 178/414 (42%), Gaps = 64/414 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V A V++++ + E +++ K L+LV ++LHG + +
Sbjct: 29 LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ +NS+ D ++++ + AK+S++++D ++ + S+H +E+ E
Sbjct: 71 AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
LV V+P C + L + + S G +++
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183
Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
S G S ESS +I+ R LD + + D F++ Y EP + I+ + WAG +
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
LS+ K + NLP D K++ +P P+ G L++G N IH +
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303
Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
+ LALN + S+ +S + +S +++L+ + ND L+ GD +
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363
Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
DG+ +++ D+ T P +I T+ N+L F+ ++ G++ L++
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLE 413
>sp|P0CM62|CFT1_CRYNJ Protein CFT1 OS=Cryptococcus neoformans var. neoformans serotype D
(strain JEC21 / ATCC MYA-565) GN=CFT1 PE=3 SV=1
Length = 1431
Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
I + IC LP D P + ++ IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042
Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149
Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
+ + + ++A+ + G+LL +GPK+ + ++L G+AF D L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
NLVV A V+ ++ +R + E K ++ E ++ V M+ +
Sbjct: 48 NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107
Query: 97 --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
+ L L+ + L+G + LA ++ D +I++F+DAK+++LE+ S +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E ++ +S+ PL++ DP R + + + +L Q S L
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
D G A S V++L D+ + K+++D +F+ G+ P + +L TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277
Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ + K C+ I +S+ +PL+ S LP D+ L+A PS +GG+++V + I
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336
Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+ Q +A+C A + SL S + S + L+ + ++ LL + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394
Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ +GR V++ LD P SD+T G+ F+GS GDS L +
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S K+E +++ D + L +DA D E L+G A+ +
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495
Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
+ D L +G + D +G+ IN I+K+ +
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
EL +G+W + G + + A +++S E L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
+ G+T++A F R +++V +LD + + Q + G G +
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+++ SI+DP+V++ +D S+ VGD TV+ + P E + ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712
>sp|P0CM63|CFT1_CRYNB Protein CFT1 OS=Cryptococcus neoformans var. neoformans serotype D
(strain B-3501A) GN=CFT1 PE=3 SV=1
Length = 1431
Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
I + IC LP D P + ++ IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042
Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149
Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
+ + + ++A+ + G+LL +GPK+ + ++L G+AF D L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
NLVV A V+ ++ +R + E K ++ E ++ V M+ +
Sbjct: 48 NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107
Query: 97 --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
+ L L+ + L+G + LA ++ D +I++F+DAK+++LE+ S +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E ++ +S+ PL++ DP R + + + +L Q S L
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
D G A S V++L D+ + K+++D +F+ G+ P + +L TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277
Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ + K C+ I +S+ +PL+ S LP D+ L+A PS +GG+++V + I
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336
Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+ Q +A+C A + SL S + S + L+ + ++ LL + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394
Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ +GR V++ LD P SD+T G+ F+GS GDS L +
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S K+E +++ D + L +DA D E L+G A+ +
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495
Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
+ D L +G + D +G+ IN I+K+ +
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
EL +G+W + G + + A +++S E L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
+ G+T++A F R +++V +LD + + Q + G G +
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+++ SI+DP+V++ +D S+ VGD TV+ + P E + ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712
>sp|Q0UUE2|CFT1_PHANO Protein CFT1 OS=Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 /
FGSC 10173) GN=CFT1 PE=3 SV=1
Length = 1375
Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 133/521 (25%), Positives = 226/521 (43%), Gaps = 44/521 (8%)
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
P +S L N+ +L R D E + NI+G+
Sbjct: 836 PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
G P + + L + + T H +C GF Y+ S L+I QLP + Y
Sbjct: 896 GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955
Query: 990 DNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1045
+ W +++P+ A H + Y LY + P L P + L +E +
Sbjct: 956 GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014
Query: 1046 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1103
H + V +T+TV + +L+P E L ++ + L + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055
Query: 1104 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAIS 1158
T + + ++A+GTA V GED+A +G + +F P + T + E+KG +S
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRLKLIVKDEVKGTVS 1115
Query: 1159 ALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGD 1212
A++ L QG L++A G K ++ K GT L +AF D YV +L + N +L+GD
Sbjct: 1116 AISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGD 1173
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+K +F + E+ ++ L + L+C +FL L ++V+D N+Q+ + P
Sbjct: 1174 AYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVADADMNLQVLQFDPD 1233
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGSDKTNRFALLFGTL 1329
+S G +LL ++ FH G + LQ + T+S+ T + S ++ +L +
Sbjct: 1234 HPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQ--ILCTSQ 1291
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
G++ I PL E ++RRL L L + GLN ++FR
Sbjct: 1292 SGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1332
Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
++ G + +LT DGR V + + + ++L + T +G F+GS G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472
Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
N+T + K T++F V D L +I P++D
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532
Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592
Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706
Query: 676 SDGSIRL 682
D S+++
Sbjct: 707 EDSSVKI 713
>sp|Q5AFT3|CFT1_CANAL Protein CFT1 OS=Candida albicans (strain SC5314 / ATCC MYA-2876)
GN=CFT1 PE=3 SV=1
Length = 1420
Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 120/572 (20%), Positives = 246/572 (43%), Gaps = 73/572 (12%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 855 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 909
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ +
Sbjct: 910 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 969
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
+ + I Y + VL Q+ +D+E G D + ++
Sbjct: 970 IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEEGKPIAGIIKDIKDTPAMSFK 1022
Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV-----------------T 1098
+ ++++ P W TI + +E +T++ + +
Sbjct: 1023 GS-------IKLVSPYN----WTVIETIELGDNEVGMTLKSMILDVGSESGSTLGSDPNS 1071
Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSK 1151
L K+ + IG + ED+AA G ++ G+ N + E++ +
Sbjct: 1072 LIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKE 1129
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
E +GAI+++ L G L++ G K+I+ +AF D P +YV N ++LG
Sbjct: 1130 ETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILG 1188
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ K + + + + ++ +L KD + +F+I+ + ++V+D + + Y P
Sbjct: 1189 DLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDP 1248
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNR 1321
+S G KLL++A F + + ++ L ++ S +T A P + +N
Sbjct: 1249 DDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNY 1308
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGK 1377
F ++ T DGS + P++E +RR+ LQ++L+D H GLNPR R + +N
Sbjct: 1309 FQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNET 1368
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+P I+D +L+ + L + + +A++
Sbjct: 1369 NTKP----ILDYDLIRSFTKLSDDRKRNLANK 1396
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ + ++ + ++ F+ + G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435
>sp|Q6FSD2|CFT1_CANGA Protein CFT1 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM
3761 / NBRC 0622 / NRRL Y-65) GN=CFT1 PE=3 SV=1
Length = 1361
Score = 120 bits (301), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
+T TK + +G Y EDV G ++ P T E++ ++++G
Sbjct: 1026 DTRTKRKREYIIVGIGYATMEDVPPTGEFHIYDITEVVPEPGKPNTNFKLKEIFKEDIRG 1085
Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
+S + + G LI+ KI++ + +AF D P ++V SL N I++GD
Sbjct: 1086 IVSVVNGISGRFLISQSQKIMVRDVQQDNSVIPVAFLDVP-VFVTSLKTFGNLIVIGDAM 1144
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ I F+ + + ++ L + + EFL++ + +V+D + + YAP
Sbjct: 1145 QGIQFVGFDAEPYRMITLGSSITKFEVISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQP 1204
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR-FALLFGTLDGSI 1333
+ GQ+L+ + F++ + ML +D P + +R F + +DGSI
Sbjct: 1205 NTLSGQRLVHCSSFNLHS----LNNCTMLLPKNDE---FPRDQRYSRSFQTITAQVDGSI 1257
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ---FHSNGKAHRPGPDSIVDCE 1390
I P+ E T+RRL +Q++++D P +AGLNPR RQ ++ G + RP ++D
Sbjct: 1258 SKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQDNKYYHLGHSLRP----MLDFN 1313
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRS-QILSNLNDLAL 1426
++ ++ + + + I + G + ++ +L DL
Sbjct: 1314 IIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLEF 1350
Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)
Query: 96 ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
I + L L+ ++L G + +A++ S G N ++L+ AK+S+L +++
Sbjct: 43 IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102
Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ S+H FES L L AR ++++P G +++ ++ +
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153
Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
G DED F G + + +H +IN +L +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213
Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
D F+ G+ + + +L++ +L W G V + +IS SI T +I
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
LP D + + + G L++G N I + + L LN+Y+ L + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330
Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
S V + L +N+ + LL + G + + + +GR++ + +++
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390
Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
K+NP S++ +D + F+G + GD+ +++ S + +++
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449
Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ +A A SD D VN + +G+ SN +AQ+
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
L NIGP+ G + + + G+ + E+ + T + NA +
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557
Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
+ ++ ++ I+ ++ + L T D + E+ + QGR
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617
Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI + +R++QV + D ++ + + V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666
Query: 670 YVLLGMSDGSIRLL 683
YVL+ +S G I++
Sbjct: 667 YVLITLSRGDIKVF 680
>sp|Q6CTT2|CFT1_KLULA Protein CFT1 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=CFT1 PE=3
SV=1
Length = 1300
Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 164/338 (48%), Gaps = 30/338 (8%)
Query: 1088 SENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1141
SEN++ + T+ N+ T+ L+ IG+++V+ ED + G +L+ P
Sbjct: 954 SENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKP 1013
Query: 1142 -QNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG---IAFYDAPPL 1196
N +++ +E++G+++A+ + G +I K ++ E N +AF D P +
Sbjct: 1014 DSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQ--EDNSAVPVAFLDMP-V 1070
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
++ N +++GD + F+ + + ++ +L K EFL++ ++ +
Sbjct: 1071 FITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFI 1130
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
V+D Q ++ + YAP + S GQ+L+ F++ +++L R GS
Sbjct: 1131 VTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLV-------RKHVEFGS 1182
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFH 1373
+N AL T DGSI + PL+E ++RR +Q++L+D +AG N + R +++
Sbjct: 1183 KTSNYIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYY 1241
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
G + RP DS ++L Y LP+ ++ I ++ G
Sbjct: 1242 HKGHSLRPTLDS----QVLKKYIHLPITKRTTIENRVG 1275
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A L L ++L G + + +L Q G S + IL+ +K+S++ FD L
Sbjct: 45 AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
S+H + ++++L S ++ VDP RC +LV+ ++ IL + D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
ED G + R++ + I + M KHV D +++ + +P + IL++
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
L W G +T LS+ ++ +I +LP+D + L VP G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272
Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
+ Y S S SC + LN +A S +++ S ++ L + + ++D+ +L
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
+ + + +G ++ + D + I N + F SRL GD
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380
Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
+L+ F G + LK F + ++L + D E +LYG
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428
Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
+ NNT + Q+ F ++ DS+ NIGPL + G + + + + + E +
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487
Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
G+ T H + + H + + + ++ + ++ + L T D E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543
Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
+ + D+ RTI + +R++QV G + D + +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
++ I DPY+L + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626
>sp|Q06632|CFT1_YEAST Protein CFT1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=CFT1 PE=1 SV=1
Length = 1357
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>sp|Q75EY8|CFT1_ASHGO Protein CFT1 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 /
FGSC 9923 / NRRL Y-1056) GN=CFT1 PE=3 SV=1
Length = 1305
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 150/343 (43%), Gaps = 33/343 (9%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
N+ TK L +G YV+ ED+ G L+ P T +++ ++++
Sbjct: 967 LNSNTKRRREYLVVGNTYVRDEDIGGTGSFYLYDITEVVPEPGKPDTNYKFKDIFQEDIR 1026
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G +S + + G +I+ K ++ + +AF D P +++ N +++GD
Sbjct: 1027 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDS 1085
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ FL + + ++ L K L+ EFL++ + +V+D + + YAP
Sbjct: 1086 MQGFSFLGFDAEPYRMLTLGKSVSKLETMCVEFLVNNGDVYFLVTDRNNLMHVLKYAPDE 1145
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR--------FALL 1325
S GQ+L+ F++ + T L +D G K +R F +
Sbjct: 1146 PNSLSGQRLVHCTSFNLHSTNT----CMRLIKKNDEFG------KVSRGFGIYMPSFQCI 1195
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPG 1382
DG+I + PL E ++R L +Q++L+D + GLNPR R F+ G RP
Sbjct: 1196 GSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMERLENPFYQMGHILRP- 1254
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTG-TTRSQILSNLNDL 1424
++D +L + L + ++ +A + G ++I +L D+
Sbjct: 1255 ---MLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDI 1294
Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)
Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
++S++ FD L S+H +++ E L G P ++ +P RC +LV+
Sbjct: 82 GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134
Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+ + +G D G G S + +SH+ + D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
+VKD F+ G + + +L++ +L+W G T LS+ ++ +I L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
P+ + ++ + + G ++ G N + Y + + A++LN ++ S L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
E P F+ + + D+ LL + + + + +GR++ +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360
Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ K N P+ + ++ SL F+G + GD++ V+ + L S L+
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
+ + S+ L+ + + + LYG NN E +K F D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
+ + G + + + ++ + EL + G T+ + + + +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513
Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++ YL+ + ++T + + D + ++ D+ T++ G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573
Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
++QV +G + D ++ MT + F V+ V I DP++LL S G
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619
Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
I++ +P V+T P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643
>sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1
Length = 1095
Score = 87.8 bits (216), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 764 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 816
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 817 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 873
Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 874 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 930
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 931 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 981
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 982 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1037
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1038 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1091
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 113/502 (22%), Positives = 193/502 (38%), Gaps = 123/502 (24%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
G+ L+ + ++G + +L + G +D + +A E K VL++D +
Sbjct: 49 GLQCICLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEV 104
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGL 213
+M + GR + G + +DP R G+ +Y GL +I ++G
Sbjct: 105 ITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK- 156
Query: 214 VGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
F+ R+E V++++ F++G +P +V+L++
Sbjct: 157 ----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ------DN 189
Query: 274 VSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+H + +LK I W+ NL + A L+ VP P+ GVL++G TI
Sbjct: 190 KDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIV 243
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
Y S SA A+ + + R+ V+ D + LL G L LL
Sbjct: 244 YCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGLLHLLV 286
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 287 ITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL----------- 335
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDS 508
P TK S + L+ VN + + + + T S A +D
Sbjct: 336 ----------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD- 381
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G L+ G+ IN AS VEL G KG+W++
Sbjct: 382 ----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--------------R 411
Query: 569 AAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 412 SATDDPYDTFLVVSFISETRVL 433
>sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1
SV=1
Length = 1090
Score = 87.8 bits (216), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 811
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 868
Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 869 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 925
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 926 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 976
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 977 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1032
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1033 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1086
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
+ +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK I W+ NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + R+ V+ D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
P TK S + L+ VN + + + + T S
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428
>sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1
SV=1
Length = 1088
Score = 85.1 bits (209), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS +L + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2
SV=2
Length = 1088
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 881 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 991 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVP P GG +++G +I YH+ A+A + + S V
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1
SV=1
Length = 1140
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVP P GG +++G +I YH+ A+A + + S V
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1
Length = 1140
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 95/461 (20%), Positives = 171/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVP P GG +++G +I YH+ A+A + + S V
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2
Length = 1140
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 50.8 bits (120), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 96/461 (20%), Positives = 173/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVP P GG +++G +I YH+ A+A + + S V
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D + L++S +T VL
Sbjct: 402 --------IDLPGIKGLWPL--RSDPGRETDDT------------LVLSFVGQTRVLMLN 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1
Length = 1140
Score = 80.5 bits (197), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ +
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 109/550 (19%), Positives = 204/550 (37%), Gaps = 125/550 (22%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
++A L V ++G + + + G +D + + + +LE+ S +
Sbjct: 44 VTAEGLRPVKEVGMYGKIAVMELFRPKG----ESKDLLFILTAKYNVCILEYKQSGESID 99
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
I + + + + GR S G + +DP+ R G+ +Y ++ + L
Sbjct: 100 IIT----RAHGNVQDRIGRPS-ETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKEL-- 152
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
F+ R+E HVI+++ F++G P + +++ GR
Sbjct: 153 --------KAFNIRLEELHVIDVK-----------FLYGCQAPTICFVYQDP---QGR-- 188
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
H + P W N+ +A ++AVP P GG +++G +I YH+
Sbjct: 189 ---HVKTYEVSLREKEFNKGP--WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGD 243
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTV- 391
A+A + + S V +D + +L D+ G L +L +
Sbjct: 244 KYLAIA-----------PPIIKQSTIVCHNRVDPNGSRYLLGDME------GRLFMLLLE 286
Query: 392 ---VYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS---GTS 444
DG V ++ L + + + +T + N + F+GSRLGDS LV+ S G+
Sbjct: 287 KEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSY 346
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
+++ G I D R+ + T S A
Sbjct: 347 VVAMETFTNLGPI-VDMCVVDLERQGQGQLV------------------------TCSGA 381
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
++ G L+ G+ I+ AS ++LPG KG+W + +R
Sbjct: 382 FKE-----GSLRIIRNGIGIHEHAS------------IDLPGIKGLWPLRSDPNR----- 419
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
E L++S +T VL E TE + + +T GN+ +++I
Sbjct: 420 ---------ETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNV-AHQQLI 469
Query: 625 QVFERGARIL 634
Q+ R++
Sbjct: 470 QITSASVRLV 479
>sp|Q6P6Z0|DDB1_XENLA DNA damage-binding protein 1 OS=Xenopus laevis GN=ddb1 PE=2 SV=1
Length = 1140
Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V ++ + GR+++F N L T V KE+KGA+ ++
Sbjct: 823 KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096
Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 97/455 (21%), Positives = 172/455 (37%), Gaps = 110/455 (24%)
Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
G++ +MI L+ G ++ E F+ R+E HVI+++ L FV
Sbjct: 122 GIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSCQAPTICFV 181
Query: 253 HG-----YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
+ +++ V L E+E + + P W N+ +
Sbjct: 182 YQDPQGRHVKTYEVSLREKEFS------------------------KGP--WKQENVEAE 215
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV---EL 364
A ++AVP P GG +++G +I YH+ A+A + + S V +
Sbjct: 216 ASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCHNRV 264
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
D + +L D+ G L +L + DG V ++ L + + + +T +
Sbjct: 265 DVNGSRYLLGDME------GRLFMLLLEKEEQMDGSVTLKDLRVELLGETSIAECLTYLD 318
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N + F+GSRLGDS LV+ T S + E F ++ + DM
Sbjct: 319 NGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMETFTNL---------------GPIVDMC 363
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+ T S A ++ G L+ G+ I+ AS
Sbjct: 364 -------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---------- 401
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
++LPG KG+W + R+AA D + L++S +T VL E
Sbjct: 402 --IDLPGIKGLWPL-------------RVAA-DRDTDDTLVLSFVGQTRVLTLTGEEVEE 445
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
T+ + +T GN+ +++IQ+ R++
Sbjct: 446 TDLAGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q9ESW0|DDB1_RAT DNA damage-binding protein 1 OS=Rattus norvegicus GN=Ddb1 PE=2 SV=1
Length = 1140
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 38/285 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + V KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGK--------LQTVAEKEVKGAVYSMV 874
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L FH+G V F L +Q L +S T ++L GT++G
Sbjct: 987 EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLLGTVNGM 1036
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IG + L E + L +Q +L + V + +R FH+ K
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 94/461 (20%), Positives = 170/461 (36%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 169 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVP P GG +++G +I YH+ A+A + + S V
Sbjct: 213 EAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 262 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS V+ S G+ +++ G I D R+
Sbjct: 316 YLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 374
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 375 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 402 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1
Length = 1140
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 987 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094
Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 132/347 (38%), Gaps = 85/347 (24%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++AVP P GG +++G +I YH+ A+A + +
Sbjct: 207 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQ 255
Query: 359 SFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSV 410
S V +D + +L D+ G L +L + DG V ++ L + +
Sbjct: 256 STIVCHNRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETS 309
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRL 467
+ +T + N + F+GSRLGDS LV+ S G+ +++ G I D
Sbjct: 310 IAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLE 368
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
R+ + T S A ++ G L+ G+ I+
Sbjct: 369 RQGQGQLV------------------------TCSGAFKE-----GSLRIIRNGIGIHEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
AS ++LPG KG+W + S R E L++S +T
Sbjct: 400 AS------------IDLPGIKGLWPLRSDSHR--------------EMDNMLVLSFVGQT 433
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
VL E TE + +T GN+ +++IQ+ R++
Sbjct: 434 RVLMLNGEEVEETELTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
>sp|Q9XYZ5|DDB1_DROME DNA damage-binding protein 1 OS=Drosophila melanogaster GN=pic PE=1
SV=1
Length = 1140
Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE--PSEGFIDGDLIES 1097
Query: 1395 Y 1395
+
Sbjct: 1098 F 1098
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>sp|Q52E49|RSE1_MAGO7 Pre-mRNA-splicing factor RSE1 OS=Magnaporthe oryzae (strain 70-15 /
ATCC MYA-4617 / FGSC 8958) GN=RSE1 PE=3 SV=2
Length = 1216
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/369 (22%), Positives = 164/369 (44%), Gaps = 52/369 (14%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I + ++E AL++ VV+ +++ E+ L +GT G+D+ R F+ G R
Sbjct: 879 IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
+++ + L ++ +++ +AL QG L+ G + ++ +L A + P
Sbjct: 930 SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+VSLN + I++GD+ + ++++K + +L A D + T + ST
Sbjct: 989 IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
+D+ N+ I K S+ +E H+ H +L RL ++A
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G + LL+G G+IG + P ++ F QSL++ L P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
AG + +R + K ++D +L Y MLP +++ IA + + +I +
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207
Query: 1422 NDLALGTSF 1430
+D+ ++F
Sbjct: 1208 SDIRTRSAF 1216
Score = 41.6 bits (96), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 62/280 (22%), Positives = 108/280 (38%), Gaps = 70/280 (25%)
Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ V D V+RL + + +++++ + + F+ S G
Sbjct: 310 LLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILKSGFLFVASEFG 369
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGS 489
+ L QF E+ GD + L SSD D E + Y
Sbjct: 370 NHLFYQF--------------EKLGD------DDEELEFFSSDFPVDPKEPYEPVYFYPR 409
Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--- 542
+ N A+ +S+ ++ PL D DA + +G +S + ++
Sbjct: 410 PTEN---------LALVESIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHG 460
Query: 543 ---------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
+LPG +WT + DDEY AY+++S T+VL
Sbjct: 461 LEVNEIVASQLPGTPSAVWTTKLRR--------------DDEYDAYIVLSFTNGTLVLSI 506
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ + EV+++ F+ A G ++QV +G R
Sbjct: 507 GETVEEVSDT--GFLSSVPTLAVQQLGDDGLVQVHPKGIR 544
>sp|Q4PGM6|RSE1_USTMA Pre-mRNA-splicing factor RSE1 OS=Ustilago maydis (strain 521 / FGSC
9021) GN=RSE1 PE=3 SV=1
Length = 1221
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 150/374 (40%), Gaps = 54/374 (14%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
QT + M +E A ++ VV + E E +L +G+A DV R +T
Sbjct: 878 QTTHRLEMDDNEAAFSIAVVPF---ASAEKEVMLVVGSAV----DVVLSPRSCKKAYLTT 930
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R DN + L ++ E+ L + QG LL G + ++ +L +
Sbjct: 931 YRLLDNGRELEL-LHKTEVDDIPLVLRAFQGRLLAGIGKALRIYDLGKKKLLRKCENRSF 989
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGS 1238
P VVSL+ + I++GD+ +SI F S+K +L A D +
Sbjct: 990 PTAVVSLDAQGSRIVVGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAA 1049
Query: 1239 LDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
D F ++ +DG+T S V ++ + I + P + + L+ A F VG +T
Sbjct: 1050 ADKFGNIYVLRLDGNT-SRSVDEDPTGMTIVHEKPVLMGAAHKASLV--AHFFVGDIITS 1106
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLV 1355
R M+A R LL+ L GSIG + P + + L +L+ L
Sbjct: 1107 LHRTAMVA--------------GGREVLLYTGLSGSIGALVPFVSKEDVDTLSTLESHLR 1152
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
+ G + ++R ++ K S++D +L + +L +Q IA +
Sbjct: 1153 QENNSIVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLSPAKQNAIAGELDRKPG 1205
Query: 1416 QILSNLNDLALGTS 1429
+I L L G +
Sbjct: 1206 EINKKLAQLREGAT 1219
>sp|Q21554|DDB1_CAEEL DNA damage-binding protein 1 OS=Caenorhabditis elegans GN=ddb-1 PE=1
SV=2
Length = 1134
Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/281 (19%), Positives = 119/281 (42%), Gaps = 17/281 (6%)
Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
T ++ T +GT + ++ + GR+++F D ++ + V+ ++G+ A+
Sbjct: 814 TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
L G L+ A I L +WT + + + + L ++ + + D+ +S+ LS+
Sbjct: 871 LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+ +AKD+ S EF+ S L +++ P + G+ +
Sbjct: 931 RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1340
L + + K + L + D +++ ++FGT G+IG I +D
Sbjct: 988 LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
+ + L +++K + DSV + + S+R F +A P
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
DA+ +L LS G L++L T G V+ + + + + I I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
+ F+GSRLGDS L++ T +G S S + E + +I ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E ++ + T + A +D G L+ G+ I+ AS
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
V+L G GI+ + S NAD+ Y+I+SL T VL+ E
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437
Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
+ ++ TI A LFG ++Q E+ R++ S +++ + P+N E S
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494
Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
+SV+ A+ ++L D ++ LL TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519
>sp|Q7RYR4|RSE1_NEUCR Pre-mRNA-splicing factor rse-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=rse-1 PE=3 SV=2
Length = 1209
Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
RA G W + +I SE ++ + L N ++E E+ L +GT G
Sbjct: 847 RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R F+ G R ++ ++L ++ ++ AL QG LL G +
Sbjct: 903 KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ +L A D P +VSL N I++GD+ + I ++ +K +G +L A D
Sbjct: 960 RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019
Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
+L+ + T ++D S+ D+ NI I ++S+ +E H+ H
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070
Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
+L RL Q L TS +T G LL+ L G++G P
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F Q+L+ + P +AG + +R +++ K ++D +L + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
P +++ IA + + +I ++D+ ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 65/139 (46%), Gaps = 17/139 (12%)
Query: 543 ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
ELPG +WT +++ YD +Y AY+++S T+VL + + EV++
Sbjct: 470 ELPGTPSAVWT-------------TKLTKYD-QYDAYIVLSFTNGTLVLSIGETVEEVSD 515
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 661
S + T+A + G +IQV +G R + + + + + + + +EN V
Sbjct: 516 S-GFLTTAPTLAVQQM-GEDGLIQVHPKGIRHIVQGRVNEWPAPQHRSIVAATANENQVV 573
Query: 662 LSVSIADPYVLLGMSDGSI 680
+++S + SDGS+
Sbjct: 574 IALSSGEIVYFEMDSDGSL 592
>sp|O13807|DDB1_SCHPO DNA damage-binding protein 1 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=ddb1 PE=1 SV=1
Length = 1072
Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RES GPL+ VDP R + VY + I+ + + + FS RI+
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V+ D ++ P + +L++ + ++K ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
+ + + + HD + +PS GGV V G ++Y S+ + L Y
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P ++FS + T L + + +++ ++G L ++ V ++L K S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ + ++ F+GS +S+L+Q PS + +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
LQ+ VN +S + + T S+ T S A +D G L+ + I
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
N L+E+ G K ++V S A YD+ + +L + E R +++
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ + + D + TI ++G +++Q+ + R+ DG + +S P + G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493
Query: 654 SGSENSTVLSVS 665
S ++ ++V+
Sbjct: 494 SSFADNVCVAVA 505
Score = 42.0 bits (97), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 124/290 (42%), Gaps = 38/290 (13%)
Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+ +GT + +D GR+++F +DN + E +++G+++ L L HL++
Sbjct: 770 VVVGTGFNFPDQDAPDSGRLMVFEM--TSDNNIEMQAE---HKVQGSVNTLV-LYKHLIV 823
Query: 1170 A--SGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
A + I GT N I P Y + +++ ++ I+ D+ KSI L + +
Sbjct: 824 AGINASVCIFEYEHGTMHVRNSIR----TPTYTIDISVNQDEIIAADLMKSITVLQFIDD 879
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY---YAPKMSESWKGQKL 1282
QL +A+D+ L + E L S V++ N I +P++S+ +KL
Sbjct: 880 --QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRDNVSPQLSDR---KKL 931
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
+F++G + K R D++ P LL T+DGS+ +
Sbjct: 932 RWYKKFYLGELINK-TRHCTFIEPQDKSLVTP--------QLLCATVDGSLMIVGDAGMS 982
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
L LQ + +P GL+ + ++++ + P ++D L+
Sbjct: 983 NTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLIDGSLI 1029
>sp|Q54SA7|SF3B3_DICDI Probable splicing factor 3B subunit 3 OS=Dictyostelium discoideum
GN=sf3b3 PE=3 SV=1
Length = 1256
Score = 54.3 bits (129), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 54/289 (18%), Positives = 127/289 (43%), Gaps = 29/289 (10%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
+Y E++ + A+A QG L+ G I ++ +L P +V+++ + +
Sbjct: 979 LYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1038
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
+++GDI +SI+F+ +K L + A D + ++D T++ +D+ NI +
Sbjct: 1039 LVVGDIQESIHFIKYKRSENMLYVFADDLAP-RWMTSSVMLDYDTVA--GADKFGNIFVL 1095
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGA---------HVTKFLRLQMLATSSDRTGAAPGSDK 1318
+S+ + ++ +F G H+ F + T + + G +
Sbjct: 1096 RLPLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPE- 1154
Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
+L+ T+ G+IG + P +++ F +L+ + + G + ++R ++
Sbjct: 1155 ----VILYTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLPLCGRDHLAYRSYYFP 1208
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
K +I+D +L + L ++QL I+ + + S+++ L ++
Sbjct: 1209 VK-------NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
Score = 43.9 bits (102), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
GGVLV + I Y +Q + + +PR S L +H++ Q
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
+ L+ ++ GDL +T+ Y G V ++++ + VL + +T + N F S GD
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
L F S G +EE G + L R S ++++ N E S
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
S S F V D + P LK +GL + +A
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464
Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
LPG GIWTV +S NA D+ Y+++S T VL D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ E ES ++ T G +IQVF G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542
>sp|P0CR22|RSE1_CRYNJ Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=RSE1 PE=3 SV=1
Length = 1217
Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217
Score = 35.4 bits (80), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 57/267 (21%), Positives = 102/267 (38%), Gaps = 47/267 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL + + ++G V L + + + + + + ++ S D L QF
Sbjct: 313 LLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQF 372
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAP----STKRLRR----SSSDALQDMVNGEELSL 486
G SS E G+I+ P + LR + +L + + ++L
Sbjct: 373 QSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVNL 432
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
G AS++T R + + +GL + S+ LPG
Sbjct: 433 LG-ASSDTPQIYAACGRGARSTF------RTLKHGLDVAEMVSS------------PLPG 473
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WT+ DDEY +Y+++S T+VL + + EV ++ +
Sbjct: 474 VPTNVWTL--------------KLTEDDEYDSYIVLSFPNGTLVLSIGETIEEVNDT-GF 518
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
G T+A L G ++QV G R
Sbjct: 519 LSSGPTLAVQQL-GNAGLLQVHPYGLR 544
>sp|P0CR23|RSE1_CRYNB Pre-mRNA-splicing factor RSE1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain B-3501A) GN=RSE1 PE=3 SV=1
Length = 1217
Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217
Score = 35.4 bits (80), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 57/267 (21%), Positives = 102/267 (38%), Gaps = 47/267 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL + + ++G V L + + + + + + ++ S D L QF
Sbjct: 313 LLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQF 372
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAP----STKRLRR----SSSDALQDMVNGEELSL 486
G SS E G+I+ P + LR + +L + + ++L
Sbjct: 373 QSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVNL 432
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
G AS++T R + + +GL + S+ LPG
Sbjct: 433 LG-ASSDTPQIYAACGRGARSTF------RTLKHGLDVAEMVSS------------PLPG 473
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WT+ DDEY +Y+++S T+VL + + EV ++ +
Sbjct: 474 VPTNVWTL--------------KLTEDDEYDSYIVLSFPNGTLVLSIGETIEEVNDT-GF 518
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
G T+A L G ++QV G R
Sbjct: 519 LSSGPTLAVQQL-GNAGLLQVHPYGLR 544
>sp|Q4WLI5|RSE1_ASPFU Pre-mRNA-splicing factor rse1 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=rse1 PE=3
SV=1
Length = 1225
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + +Q + ++ S DN + IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 815 NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG LL G + ++ +L +
Sbjct: 924 EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
D+ N+ + K SE G L+ + GA L + Q + TS +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
Score = 35.4 bits (80), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DE+ AY+I+S T+VL + + EVT++ + T+A L G +IQV RG R
Sbjct: 484 DEFDAYIILSFANGTLVLSIGETVEEVTDT-GFLSTAPTLAVQQL-GEDSLIQVHPRGIR 541
>sp|B0M0P5|DDB1_DICDI DNA damage-binding protein 1 OS=Dictyostelium discoideum GN=repE PE=1
SV=1
Length = 1181
Score = 48.1 bits (113), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 913 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL+GD+ KS+ L + G+ L +A++ + + + D + ++ N +
Sbjct: 973 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 1028
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
+ + + L S +H+G + +S R G+ P SD+
Sbjct: 1029 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1077
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+L+ +++GSIG +A + E F LQK L V V G + ++R F ++ H
Sbjct: 1078 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1135
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+ +D +L+ + L E QL+ G T
Sbjct: 1136 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1165
Score = 40.8 bits (94), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++A+ +Y L ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
V+ D + L G L +L +++ + V L + + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.134 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 521,901,764
Number of Sequences: 539616
Number of extensions: 22279362
Number of successful extensions: 51710
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 51273
Number of HSP's gapped (non-prelim): 220
length of query: 1431
length of database: 191,569,459
effective HSP length: 130
effective length of query: 1301
effective length of database: 121,419,379
effective search space: 157966612079
effective search space used: 157966612079
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 68 (30.8 bits)