BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000897
(1234 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Vitis vinifera]
Length = 1238
Score = 1295 bits (3351), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 743/1290 (57%), Positives = 886/1290 (68%), Gaps = 117/1290 (9%)
Query: 1 MG-KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLY 59
MG +DVEEGEISD+ASVEEISEEDF KQE V+V++E KP +A RVWTMRDL
Sbjct: 10 MGIEDVEEGEISDSASVEEISEEDFN-KQE--VRVLREAKP------KADTRVWTMRDLQ 60
Query: 60 N--KYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGA 117
+ KY C GY P L+NLAWAQAVQNKPLN+IFVM+ E+ S SS S S ++
Sbjct: 61 DLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVMDDEESKRSSSSSNTSRDDSSSA-- 118
Query: 118 AAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIE------LDLESESNEK-------VS 164
K V KV+IDDSGDE++ + ++ E E +DL+SE + K V+
Sbjct: 119 -------KEVAKVIIDDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEPDVKDEGGVLDVN 171
Query: 165 E---QVKEEMKLINVESIREALESV--LRGDISFEGVCSKLEFTLESLREL-----VNEN 214
E +KE + V+SI+E LESV + + SF GVCS+L+ TL SL+++ V E+
Sbjct: 172 EPEIDLKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGES 231
Query: 215 NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
+VPTKDAL Q +A+++++ VFCSMN KE NK++ SRLLS ++ + P+FS IKE
Sbjct: 232 SVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKE 291
Query: 275 MEAMLSSLVT-------RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPL---P 324
+E M+S L T A+DK D+ G+N + E++ +K+ L
Sbjct: 292 VEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSIS 351
Query: 325 VDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVV 384
V+S QN P +A KPG R R + PLLD HK HD DSLPSPT + PV ++ +V
Sbjct: 352 VESYNQNNP-DALKPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV 410
Query: 385 GDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEE 444
AK++H + YETDAL+A S+YQQKFG SF +LPSPTPSEE
Sbjct: 411 ----------TAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEE 460
Query: 445 SGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSA 504
SGD GD GE+SS++ + P N P LG VSS P MD S VQ T N++
Sbjct: 461 SGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQ-----MDSSIVQGPTVGRNTS 515
Query: 505 PASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVM 562
SSG P + + V A KSRDPRLR ASS+A L+LN +P P + N+PKV+P+G ++
Sbjct: 516 LVSSG--PHLDSSVVASA--KSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIV 571
Query: 563 SSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLL 622
SSRKQK+ EEP+LDGP KRQRNG + VRD + + SGGWLED++ PQ+MNRN L
Sbjct: 572 SSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQL 631
Query: 623 VDSAESNSRKLDNGAT-SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTML 681
+++ ++ +KL++ T + I P V V+GNE P STT SL +LLKDIAVNP +
Sbjct: 632 IENTGTDPKKLESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVW 691
Query: 682 LNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSI----PPVSVTCSIPSGILSKP--- 734
+NI +QQK S D + NT+ PP +SI PP SV PS + KP
Sbjct: 692 MNIFNKVEQQK--------SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGA 743
Query: 735 --------MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLN 785
MDE GKVRMKPRDPRR+LH N+ QRSGS G E FKT N
Sbjct: 744 LQVPQTGPMDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT---------------N 788
Query: 786 FQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQ 845
QKQ E K V S SV PDI+QQFTKNLK+IAD MS SQ + P Q Q Q
Sbjct: 789 AQKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQ 848
Query: 846 IKSG-ADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE 904
+ + D+KA V++ D+ T GS PE+ ++ WGDVEHLF+GYDDQQKAAIQ+E
Sbjct: 849 VNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRE 908
Query: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
R RR+EEQKKMFSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK RHLF
Sbjct: 909 RARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLF 968
Query: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
RFPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+
Sbjct: 969 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK 1028
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
GDDGD DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR
Sbjct: 1029 GDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1088
Query: 1085 QFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKIL 1144
QFGL GPSLLEIDHDER EDGTLASSL VIER+H+ FFS+++LD+VDVRNILA+EQRKIL
Sbjct: 1089 QFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKIL 1148
Query: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
AGCRIVFSRVFPVGEANPHLHPLWQTAE FGAVCT ID+QVTHVVANSLGTDKVNWALS
Sbjct: 1149 AGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 1208
Query: 1205 TGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
TGRFVVHPGWVEASALLYRRANEQDFAIKP
Sbjct: 1209 TGRFVVHPGWVEASALLYRRANEQDFAIKP 1238
>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
Length = 1117
Score = 1232 bits (3187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 707/1204 (58%), Positives = 831/1204 (69%), Gaps = 130/1204 (10%)
Query: 72 GLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVV 131
GL+NLAWA+AVQNKPLNE+ VV
Sbjct: 3 GLYNLAWARAVQNKPLNEL--------------------------------------TVV 24
Query: 132 IDDSGDE------IEKEEGELEEGEIELDLESE-----SNEKVSEQVKEEMKLINVESIR 180
IDDSGDE I+ E+ E E E E+DL+SE S VS V+ +K SIR
Sbjct: 25 IDDSGDEMDVVKVIDIEKEEGELEEGEIDLDSEPVVVQSEGMVSVDVENRVK-----SIR 79
Query: 181 EALE--SVLRGDISFEGVCSKLEFTLESLRELV--NENNVPTKDALIQLAFSAVQSVHSV 236
+ LE SV+ + SFE VC KL LESL+ELV N+N+ P+KD L+QL F A++ V+SV
Sbjct: 80 KDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSV 139
Query: 237 FCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAM 296
FCSMN LKEQNK + SR SL+ SH PP FS Q KE+ N+ D LA
Sbjct: 140 FCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV----------LNENHNDSLA- 188
Query: 297 HGVNGKDSNIVTENAVNDL-NFKEKVPLPVDSLMQNKP---LEASK-PGPPGYRSRGVLL 351
+ A DL EK+P ++ +QNKP +EA K PG P ++SRGVLL
Sbjct: 189 ------------KTAGYDLTTMSEKLP-AAETFVQNKPNKSIEAPKPPGVPSFKSRGVLL 235
Query: 352 PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
PLLD K HD DSLPSPT+ETTP PVQR L +GDGMV S K++ AE + YE
Sbjct: 236 PLLDLKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRMHPYE 294
Query: 412 TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMP 471
TDAL+A SSYQQKF RNSFF N ELPSPTPSEESG+GDGDT GE+SS++ V + VN P
Sbjct: 295 TDALKAVSSYQQKFNRNSFFTN-ELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPP 353
Query: 472 TLGQQ---PVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRD 528
Q+ P D S+++ + NSAP SSG P+ +KA KSRD
Sbjct: 354 VSDQKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSG------PSSTIKASAKSRD 407
Query: 529 PRLRFASSNALNLNH--QPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNG 586
PRLR+ + +A L+H + P+++N P+VEP G ++ S+K K +EE VLD P+LKRQRN
Sbjct: 408 PRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHK-IEEDVLDDPSLKRQRNS 466
Query: 587 FENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTP 646
F+N G VRD +++ G+GGWLEDTDM EPQ +N+N +++ N A SP G
Sbjct: 467 FDNYGAVRDIESMTGTGGWLEDTDMAEPQTVNKNQWAENSNVNG---SGNAQSPFM-GIS 522
Query: 647 NVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSM 706
N ++G+E A T+ +TT SLP LLKDIAVNPTML+NILKMGQQQ+LA D QQ +D +
Sbjct: 523 N--ITGSEQAQVTSTATT-SLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAK 579
Query: 707 NTMHPPIPSS----IPPVSVTCSIPSGILSKPM-----------DELGKVRMKPRDPRRV 751
+T HPPI ++ IP V+V S PSGI +P DE GK+RMKPRDPRR
Sbjct: 580 STSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMKPRDPRRF 639
Query: 752 LHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQ 811
LH N+LQR+GS+G E P TQG+K++ N QKQ G E KP + PDI+
Sbjct: 640 LHNNSLQRAGSMGSEQFKTTTLTPTTQGTKDDQNVQKQEGLAELKPTVP-----PDISFP 694
Query: 812 FTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS-GADMKAVVTNHDDKQTGTGSGP 870
FTK+L++IAD +SVSQ T+ P +SQN QP Q KS D K ++ D K TG S P
Sbjct: 695 FTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGISISDQK-TGPASSP 753
Query: 871 EAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHT 930
E +H Q+ W DVEHLFEGYDDQQKAAIQ+ER RRLEEQKKMF+ARKLCLVLDLDHT
Sbjct: 754 EVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLDLDHT 813
Query: 931 LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
LLNSAK +HDEILRKKEEQDREKP+RH+FR PHMGMWTKLRPGIW FLE+ASKLF
Sbjct: 814 LLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKASKLF 873
Query: 991 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME
Sbjct: 874 ELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 933
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS 1110
S VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA S
Sbjct: 934 SGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACS 993
Query: 1111 LGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
VIE++H+ FF+H+SLD+ DVRNILA+EQRKIL GCRI+FSRVFPVGE NPHLHPLWQ
Sbjct: 994 FAVIEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNPHLHPLWQM 1053
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
AEQFGAVCT ID+QVTHVVANSLGTDKVNWALSTGR VVHPGWVEASALLYRRANEQDF
Sbjct: 1054 AEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLYRRANEQDF 1113
Query: 1231 AIKP 1234
+IKP
Sbjct: 1114 SIKP 1117
>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
Length = 1184
Score = 1204 bits (3115), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 703/1282 (54%), Positives = 833/1282 (64%), Gaps = 195/1282 (15%)
Query: 1 MG-KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLY 59
MG +DVEEGEISD+ASVEEISEEDF KQE V+V++E KP +A RVWTMRDL
Sbjct: 50 MGIEDVEEGEISDSASVEEISEEDFN-KQE--VRVLREAKP------KADTRVWTMRDLQ 100
Query: 60 N--KYPAICRGYGPGLHNLAWAQAVQNKPLNEIFV--------MEAEQDDVSKRSSPASS 109
+ KY C GY P L+NLAWAQAVQNKPLN+IFV M+ + DDVS++
Sbjct: 101 DLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVIIDDSGDEMDVKMDDVSEKEEGELE 160
Query: 110 VASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKE 169
++ DS +++ E G L+ E E+DL KE
Sbjct: 161 EGEIDL------------------DSEPDVKDEGGVLDVNEPEIDL------------KE 190
Query: 170 EMKLINVESIREALESV--LRGDISFEGVCSKLEFTLESLREL-----VNENNVPTKDAL 222
+ V+SI+E LESV + + SF GVCS+L+ TL SL+++ V E++VPTKDAL
Sbjct: 191 RELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDAL 250
Query: 223 IQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSL 282
Q +A+++++ VFCSMN KE NK++ SRLLS ++ + P+FS IKE+E M+S L
Sbjct: 251 AQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFL 310
Query: 283 VT-------RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLE 335
T A+DK D+ G+N NI+ + V+S +
Sbjct: 311 DTPAAQSSAEASDKVNDVQVTDGMN---RNILDSS--------------VES--SGRAFA 351
Query: 336 ASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAA 395
++K +R R + PLLD HK HD DSLPSPT + PV ++ +V
Sbjct: 352 SAK----KFRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV----------T 397
Query: 396 AKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGE 455
AK++H + YETDAL+A S+YQQKFG SF +LPSPTPSEESGD GD GE
Sbjct: 398 AKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGE 457
Query: 456 ISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVK 515
+SS++ + P N P LG VSS P MDI VQ L N+ +S +N +++
Sbjct: 458 VSSSSTISAPITANAPALGHPIVSSAPQ-----MDI--VQGLVVPRNTGAVNSRFNSILR 510
Query: 516 PNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEP 573
A KSRDPRLR ASS+A L+LN +P P + N+PKV+P+G ++SSRKQK+ EEP
Sbjct: 511 ------ASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEP 564
Query: 574 VLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKL 633
+LDGP KRQRNG S + KL
Sbjct: 565 LLDGPVTKRQRNGLT---------------------------------------SPATKL 585
Query: 634 DNGAT-SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQK 692
++ T + I P V V+GNE P STT SL +LLKDIAVNP + +NI +QQK
Sbjct: 586 ESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK 645
Query: 693 LAADAQQKSNDSSMNTMHPPIPSSI----PPVSVTCSIPSGILSKP-------------- 734
S D + NT+ PP +SI PP SV PS + KP
Sbjct: 646 --------SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNP 697
Query: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAP 793
DE GKVRMKPRDPRR+LH N+ QRSGS G E FKT N QKQ
Sbjct: 698 QDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT---------------NAQKQEDQT 742
Query: 794 EAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADM 852
E K V S SV PDI+QQFTKNLK+IAD MS SQ + P Q Q Q+ + D+
Sbjct: 743 ETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDV 802
Query: 853 KAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
KA V++ D+ T GS PE+ ++ WGDVEHLF+GYDDQQKAAIQ+ER RR+EEQ
Sbjct: 803 KATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQ 862
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
KKMFSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK RHLFRFPHMGMW
Sbjct: 863 KKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMW 922
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+GDDGD D
Sbjct: 923 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLD 982
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPS
Sbjct: 983 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1042
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS 1152
LLEIDHDER EDGTLASSL VIER+H+ FFS+++LD+VDVRNILA+EQRKILAGCRIVFS
Sbjct: 1043 LLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFS 1102
Query: 1153 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 1212
RVFPVGEANPHLHPLWQTAE FGAVCT ID+QVTHVVANSLGTDKVNWALSTGRFVVHP
Sbjct: 1103 RVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHP 1162
Query: 1213 GWVEASALLYRRANEQDFAIKP 1234
GWVEASALLYRRANEQDFAIKP
Sbjct: 1163 GWVEASALLYRRANEQDFAIKP 1184
>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1244
Score = 1181 bits (3055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 680/1289 (52%), Positives = 842/1289 (65%), Gaps = 100/1289 (7%)
Query: 1 MGK---DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57
MGK DVEEGEISDTASVEEIS EDF KQ+ V K G +A RVW + D
Sbjct: 1 MGKEAEDVEEGEISDTASVEEISAEDFN-KQD----VKLLNNNNKPNGSDA--RVWAVHD 53
Query: 58 LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQD---DVSKRSSPASSVASVN 114
LY+KYP ICRGY GL+NLAWAQAVQNKPLN+IFVME + D + ++ SS + +VN
Sbjct: 54 LYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVN 113
Query: 115 SGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLI 174
D K ++ + + E E GE E + + +SE + V V + +L
Sbjct: 114 PKDVVVVDVDKEEGELEEGEIDADAEPE-GEAESVVVAVS-DSEKLDDVKMDVSDSEQL- 170
Query: 175 NVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQS 232
R LE V ++ SF CSKL+ TL E+++ KD L++L+F+A +
Sbjct: 171 ---GARGVLEGVTVANVVESFAQTCSKLQNTLP---EVLSRPAGSEKDDLVRLSFNATEV 224
Query: 233 VHSVFCSMNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSL-------VT 284
V+SVFCSM+ KEQNK+ + RLLS +K + LFS +KE++ M++++ +
Sbjct: 225 VYSVFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNS 284
Query: 285 RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPL--------E 335
A KEK++ ++S + E ++++ +E + L+ +KPL +
Sbjct: 285 EAIGKEKELQTTEIKTQENSAV--EVQIHEIKTQENQAVEAAELISYSKPLHRDITGTSQ 342
Query: 336 ASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAA 395
A K G + RGVLLPLLD HK HD DSLPSPTRE PV + L VG+ MV+S +A+
Sbjct: 343 ALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSAS 402
Query: 396 AKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGE 455
AK+ ++E K YETDAL+A S+YQQKFGR+S F N + PSPTPS + D DT E
Sbjct: 403 AKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEE 462
Query: 456 ISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVK 515
+SSA+ D L Q PVS+ MD S S + T S P
Sbjct: 463 VSSASTGDFLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDATGPGSFP---------- 512
Query: 516 PNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVL 575
VK+ K+RDPRLRF +S+A +++ + +++N KVE G +S RKQK EEP L
Sbjct: 513 ----VKSSAKNRDPRLRFINSDASAVDNL-STLINNMSKVEYSGTTIS-RKQKAAEEPSL 566
Query: 576 DGPALKRQRNGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSR 631
D KR ++ EN+ E N+ GSGGWLE+ Q++ RN L+D ++
Sbjct: 567 DVTVSKRLKSSLENT-----EHNMSEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAK 621
Query: 632 KLDNGATSPIT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQ 690
K N +S T S N NE AP T + SLPALLK+ +VNP ML+NIL++ +
Sbjct: 622 KTLNTVSSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILRLAEA 681
Query: 691 QKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSIPSGILSKPM----------- 735
QK +AD+ +++ +HP P + S+ S+ +G+L +
Sbjct: 682 QKKSADS------AAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTS 735
Query: 736 ------DELGKVRMKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQ 787
D+ GK+RMKPRDPRR+LH N +Q+SG LG E FK Q + +N+N
Sbjct: 736 TAQTLQDDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAP 795
Query: 788 KQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIK 847
K G + K V +QS QPDI +QFT+NLK+IAD MSVSQ ++ VSQN +
Sbjct: 796 KLEGRVDNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLT 855
Query: 848 SG-ADMKAVVTNHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
S + K+VV++ + Q S E A V + QS WGDVEHLFEGYD+QQKAAIQ+ER
Sbjct: 856 SDRGEQKSVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRER 915
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
RR+EEQ KMF+ARKLCLVLDLDHTLLNSAKF EVDP+HDEILRKKEEQDREKPHRHLFR
Sbjct: 916 ARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFR 975
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG
Sbjct: 976 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1035
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DD D DG+ERVPKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ
Sbjct: 1036 DDTDSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1095
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
FGL GPSLLEIDHDER E GTLASSL VIE++H+IFF+ QSL++VDVRNILA+EQRKILA
Sbjct: 1096 FGLPGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILA 1155
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT ID+QVTHVVANS GTDKVNWAL+
Sbjct: 1156 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNN 1215
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIKP 1234
GRFVVHPGWVEASALLYRRANEQDFAIKP
Sbjct: 1216 GRFVVHPGWVEASALLYRRANEQDFAIKP 1244
>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 3-like [Cucumis sativus]
Length = 1249
Score = 1170 bits (3026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 701/1301 (53%), Positives = 849/1301 (65%), Gaps = 130/1301 (9%)
Query: 3 KDVEEGEISDTASVEEISEEDF-KIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNK 61
+DVEEGEISDTASVEEISEEDF K+ KVV +K RVWTM DLY
Sbjct: 10 EDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSK-----DSNRETRVWTMSDLYKN 64
Query: 62 YPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGK 121
YPA+ GY GL+NLAWAQAVQNKPLN+IFVMEA+ D+ SK SS + + G+ K
Sbjct: 65 YPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTK 124
Query: 122 DDKKVVEKVVIDDSGDEIEKEEGELEEGEIE-----LDLESESNEKVSE----------- 165
++ ++VVIDDSGDE+ + E+ E E +D+++E E+V++
Sbjct: 125 EE----DRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDM 180
Query: 166 ---------QVKEEMKLINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNEN 214
+ KE +L+ I++ L+ V SF+ VCS++ ++E+ EL+
Sbjct: 181 DINGQEFDLETKELDELLKF--IQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGK 238
Query: 215 NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
VP KDALIQ ++A++ ++SVFCSMN KE++KE LSRLLS +K+ +PPLFS QIK
Sbjct: 239 VVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKS 298
Query: 275 MEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNF----------------- 317
+E + S D + +M G + K+ I N V D++F
Sbjct: 299 VEVKMPS-----TDSLDHLPSMRG-SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKL 352
Query: 318 -KEKVPLPVDSLMQNKPL-EASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPS 375
+ +P V L E + G + RG LLPLLD HK HD DSLPSPTRE
Sbjct: 353 ASDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTI 412
Query: 376 VPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSE 435
VQ KS A K++ + ++ YETDAL+A S+YQQKFGR+SF M
Sbjct: 413 FSVQ----------KSGNAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 462
Query: 436 LPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISS 493
LPSPTPSEE DG GD GGE+SS++ + K N+ GQ+ S+ + + MD SS
Sbjct: 463 LPSPTPSEEH-DGGGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSS 521
Query: 494 VQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHN 551
+ L + N AP SS NP VKP KSRDPRLR +S+A ++LN + + +
Sbjct: 522 TRVLISPLNVAPPSSVSNPTVKP------LAKSRDPRLRIVNSDASGMDLNPRTMASVQS 575
Query: 552 APKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTD 610
+ +E + RKQK EP DGP +KR R G +N V D + + GSGGWLEDT
Sbjct: 576 SSILESAA-TLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTM 634
Query: 611 MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
P++ NRN + + AE+N+ + N N SGNE P S SLP+L
Sbjct: 635 PAGPRLFNRNQM-EIAEANATEKSN---------VTNNSGSGNECTPTVNNSNDASLPSL 684
Query: 671 LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSI 726
LKDI VNPTMLLN+LKM QQQ+LAA+ + KS++ N + P P S P ++ +
Sbjct: 685 LKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT 744
Query: 727 PSGILSKP------------MDELGKVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPS 773
SGIL + D+LGKVRMKPRDPRRVLHGN+LQ+ GSLG + K P+
Sbjct: 745 -SGILQQSAGTPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803
Query: 774 APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
A T+GS++ N KQ G ++K SQ++L PDI +QFT NLK+IAD MSV P TS P
Sbjct: 804 ASNTEGSRDIPNGHKQEGQGDSKLASSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSP 862
Query: 834 MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGY 893
S S D K V T S Q AWGD+EHLF+ Y
Sbjct: 863 NSSSKP-----VGSSSMDSKPVTTAFQAVDMAASS---------RSQGAWGDLEHLFDSY 908
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
DD+QKAAIQ+ER RR+EEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEE
Sbjct: 909 DDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE 968
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
QDREK RHLFRFPHMGMWTKLRPG+W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPK
Sbjct: 969 QDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1028
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
GVLFAGRVISRGDDGDP DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVV
Sbjct: 1029 GVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVV 1088
Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVR 1133
ERYTYFPCSRRQFGLLGPSLLEIDHDER EDGTLASSLGVI+R+H+ FFS+ LD VDVR
Sbjct: 1089 ERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPELDQVDVR 1148
Query: 1134 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 1193
IL+AEQ+KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA CT ID+QVTHVVANS
Sbjct: 1149 TILSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANS 1208
Query: 1194 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
LGTDKVNWALSTGRFVVHPGWVEASALLYRRA EQDFAIKP
Sbjct: 1209 LGTDKVNWALSTGRFVVHPGWVEASALLYRRATEQDFAIKP 1249
>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Cucumis sativus]
Length = 1249
Score = 1169 bits (3025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 701/1301 (53%), Positives = 849/1301 (65%), Gaps = 130/1301 (9%)
Query: 3 KDVEEGEISDTASVEEISEEDF-KIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNK 61
+DVEEGEISDTASVEEISEEDF K+ KVV +K RVWTM DLY
Sbjct: 10 EDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSK-----DSNRETRVWTMSDLYKN 64
Query: 62 YPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGK 121
YPA+ GY GL+NLAWAQAVQNKPLN+IFVMEA+ D+ SK SS + + G+ K
Sbjct: 65 YPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTK 124
Query: 122 DDKKVVEKVVIDDSGDEIEKEEGELEEGEIE-----LDLESESNEKVSE----------- 165
++ ++VVIDDSGDE+ + E+ E E +D+++E E+V++
Sbjct: 125 EE----DRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDM 180
Query: 166 ---------QVKEEMKLINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNEN 214
+ KE +L+ I++ L+ V SF+ VCS++ ++E+ EL+
Sbjct: 181 DINGQEFDLETKELDELLKF--IQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGK 238
Query: 215 NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
VP KDALIQ ++A++ ++SVFCSMN KE++KE LSRLLS +K+ +PPLFS QIK
Sbjct: 239 VVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKS 298
Query: 275 MEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNF----------------- 317
+E + S D + +M G + K+ I N V D++F
Sbjct: 299 VEVKMPS-----TDSLDHLPSMRG-SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKL 352
Query: 318 -KEKVPLPVDSLMQNKPL-EASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPS 375
+ +P V L E + G + RG LLPLLD HK HD DSLPSPTRE
Sbjct: 353 ASDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTI 412
Query: 376 VPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSE 435
VQ KS A K++ + ++ YETDAL+A S+YQQKFGR+SF M
Sbjct: 413 FSVQ----------KSGNAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 462
Query: 436 LPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISS 493
LPSPTPSEE DG GD GGE+SS++ + K N+ GQ+ S+ + + MD SS
Sbjct: 463 LPSPTPSEEH-DGGGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSS 521
Query: 494 VQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHN 551
+ L + N AP SS NP VKP KSRDPRLR +S+A ++LN + + +
Sbjct: 522 TRVLISPLNVAPPSSVSNPTVKP------LAKSRDPRLRIVNSDASGMDLNPRTMASVQS 575
Query: 552 APKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTD 610
+ +E + RKQK EP DGP +KR R G +N V D + + GSGGWLEDT
Sbjct: 576 SSILESAA-TLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTM 634
Query: 611 MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
P++ NRN + + AE+N+ + N N SGNE P S SLP+L
Sbjct: 635 PAGPRLFNRNQM-EIAEANATEKSN---------VTNNSGSGNECTPTVNNSNDASLPSL 684
Query: 671 LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSI 726
LKDI VNPTMLLN+LKM QQQ+LAA+ + KS++ N + P P S P ++ +
Sbjct: 685 LKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT 744
Query: 727 PSGILSKP------------MDELGKVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPS 773
SGIL + D+LGKVRMKPRDPRRVLHGN+LQ+ GSLG + K P+
Sbjct: 745 -SGILQQSAGTPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803
Query: 774 APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
A T+GS++ N KQ G ++K SQ++L PDI +QFT NLK+IAD MSV P TS P
Sbjct: 804 ASNTEGSRDIPNGHKQEGQGDSKLASSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSP 862
Query: 834 MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGY 893
S S D K V T S Q AWGD+EHLF+ Y
Sbjct: 863 NSSSKP-----VGSSSMDSKPVTTAFQAVDMAASS---------RSQGAWGDLEHLFDSY 908
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
DD+QKAAIQ+ER RR+EEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEE
Sbjct: 909 DDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE 968
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
QDREK RHLFRFPHMGMWTKLRPG+W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPK
Sbjct: 969 QDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1028
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
GVLFAGRVISRGDDGDP DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVV
Sbjct: 1029 GVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVV 1088
Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVR 1133
ERYTYFPCSRRQFGLLGPSLLEIDHDER EDGTLASSLGVI+R+H+ FFS+ LD VDVR
Sbjct: 1089 ERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIHQSFFSNPELDQVDVR 1148
Query: 1134 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 1193
IL+AEQ+KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA CT ID+QVTHVVANS
Sbjct: 1149 TILSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANS 1208
Query: 1194 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
LGTDKVNWALSTGRFVVHPGWVEASALLYRRA EQDFAIKP
Sbjct: 1209 LGTDKVNWALSTGRFVVHPGWVEASALLYRRATEQDFAIKP 1249
>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1221
Score = 1149 bits (2971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 677/1281 (52%), Positives = 830/1281 (64%), Gaps = 107/1281 (8%)
Query: 1 MGK---DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57
MGK DVEEGEISDTASVEEIS EDF KQ+ VKV+ K G +A RVW + D
Sbjct: 1 MGKEVEDVEEGEISDTASVEEISAEDFN-KQD--VKVLNNNN--KPNGSDA--RVWAVHD 53
Query: 58 LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPA----SSVASV 113
LY+KYP ICRGY GL+NLAWAQAVQNKPLN+IFVME + D + +S +SVA V
Sbjct: 54 LYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNSNNSNRLASVA-V 112
Query: 114 NSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEG-EIELDLESESNEKVSEQVKEEMK 172
N D K ++ + + E E GE E + + +SE + V V +
Sbjct: 113 NPKDVVVVDVDKEEGELEEGEIDADAEPE-GEAESVVAVPVVSDSEKLDDVKRDVSNSEQ 171
Query: 173 LINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230
L +R LE V ++ SF CSKL+ +L E+++ +D L++L+F+A
Sbjct: 172 L----GVRGVLEGVTVANVAESFAQTCSKLQ---NALPEVLSRPADSERDDLVRLSFNAT 224
Query: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSLVTRANDK 289
+ V+SVFCSM+ + KEQNK+ + RLLS +K + LFS IKE++ M++++ D
Sbjct: 225 EVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAI-----DY 279
Query: 290 EKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPLE--------ASKPG 340
++ + GK+ + T +++ +E + L+ NKPL A K G
Sbjct: 280 FGALVNSEAI-GKEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFG 338
Query: 341 PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSH 400
+ RGVLLPLLD HK HD DSLPSPTRE PV + L G K+
Sbjct: 339 QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSPESG---------KMEL 389
Query: 401 NAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSAT 460
++E K YETDAL+A S+YQQKFGR+S F N + PSPTPS + D DT E+SSA+
Sbjct: 390 DSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSAS 449
Query: 461 AVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVV 520
D L PVS+ D S S + S P V
Sbjct: 450 TGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPGSLP--------------V 495
Query: 521 KAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPAL 580
K+ K+RDPRLRF +S+A +++ P+ ++HN PKVE G +S RKQK EEP LD
Sbjct: 496 KSSAKNRDPRLRFVNSDASAVDN-PSTLIHNMPKVEYAGTTIS-RKQKAAEEPSLDVTVS 553
Query: 581 KRQRNGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNG 636
KRQ++ EN+ E N+ G GGWLE+ Q + RN L+D +K N
Sbjct: 554 KRQKSPLENT-----EHNMSEVRTGIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 608
Query: 637 ATSPIT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA 695
+S T S N NE AP T+ + SLPALLK AVNPTML+N+L++ A
Sbjct: 609 VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLRI-------A 661
Query: 696 DAQQKSNDSSMNTM-HPPIPSSI----PPVSVTCSIPSGILSKPM------------DEL 738
+AQ+KS DS+ N + HP +S S+ S+ +G+L + D+
Sbjct: 662 EAQKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQQTLQDDS 721
Query: 739 GKVRMKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAK 796
GK+RMKPRDPRR+LH N +Q+SG+LG E FK QG+ +N+N QK G ++K
Sbjct: 722 GKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSK 781
Query: 797 PVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQ--NSPIQPGQIKSGADMKA 854
V +Q QPDI +QF +NLK+IAD MSVSQ ++ V+Q +S P G + K+
Sbjct: 782 LVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRG-EQKS 840
Query: 855 VVTNHDDKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQK 913
VV+N + + G S E G Q+ WGDVEHLFEGYD+QQKAAIQ+ER RR+EEQ
Sbjct: 841 VVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQN 900
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
KMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT
Sbjct: 901 KMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 960
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKG+LFAGRVISRGDD D DG
Sbjct: 961 KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDG 1020
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
+ER PKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSL
Sbjct: 1021 EERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSL 1080
Query: 1094 LEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
LEIDHDER E GTLASSL VIE++H+IFF+ +SL++VDVRNILA+EQRKILAGCRIVFSR
Sbjct: 1081 LEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSR 1140
Query: 1154 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPG 1213
VFPVGEANPHLHPLWQTAEQFGA CT ID+QVTHVVANS GTDKVNWAL+ GRFVVHPG
Sbjct: 1141 VFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPG 1200
Query: 1214 WVEASALLYRRANEQDFAIKP 1234
WVEASALLYRRANEQDFAIKP
Sbjct: 1201 WVEASALLYRRANEQDFAIKP 1221
>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 1195
Score = 1144 bits (2958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 614/982 (62%), Positives = 721/982 (73%), Gaps = 70/982 (7%)
Query: 288 DKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP---LEASKPGPPGY 344
+KEK+ L VN KD+++ +++ +D++ K LP DS + NK +E K G +
Sbjct: 249 EKEKEPLISTVVNKKDNDVNGKSSGHDMSAVNK--LPTDSFVNNKANLSIEGPKTGVSSF 306
Query: 345 RSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEV 404
+SR LLPLLD HK HD DSLPSPTRE+ +P R L MV + N+ +
Sbjct: 307 KSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYRVLT--PKMVLD-------TGNSRM 357
Query: 405 HKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQ 464
H YETDAL+A SSYQQKF ++SF + LPSPTPSEESG+GDGDTGGE+SS+ +V
Sbjct: 358 HP---YETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSS 414
Query: 465 PKPVNMPTLGQQPVSSQPMDISQP-MDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 523
+P N T GQ S IS P MD SS+ + + ++ ASS P+ VKA
Sbjct: 415 FRPANPLTSGQSNAS-----ISLPRMDGSSLPGVISIKSAVRASSA------PSLTVKAS 463
Query: 524 IKSRDPRLRF--ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALK 581
KSRDPRLRF + SNAL+ NH+ P++ N KVEP+G M+ ++QK V++P+ DG +LK
Sbjct: 464 AKSRDPRLRFVNSDSNALDQNHRAVPVV-NTLKVEPIGGTMNKKRQKIVDDPIPDGHSLK 522
Query: 582 RQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPI 641
RQ+N ENSGVVRD K + GSGGWLEDTDM PQ MN+N LVD+AES+ R+ D G
Sbjct: 523 RQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTS 582
Query: 642 TSGTPNVVVSGNEPAPATTPS------------TTVSLPALLKDIAVNPTMLLNILKMGQ 689
+S +V +SG E P T S +T ++P LLK+IAVNPTML+NILKMGQ
Sbjct: 583 SSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQ 642
Query: 690 QQKLAADAQQKSNDSSMNTMHPPIPSS-IPPVSVTCSIPSGILSKPM------------D 736
QQ+LA +AQQK D + +T +P +S + V V + SGIL +P D
Sbjct: 643 QQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTAD 702
Query: 737 ELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEA 795
+LGK+RMKPRDPRRVLH NALQR+GS+G E KT+ S P Q +K+N N QKQ G E
Sbjct: 703 DLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEK 762
Query: 796 KPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAV 855
KPV QS+ PDI+ FTKNLK+IAD +SVS TS+P+V QN QP M+
Sbjct: 763 KPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQP--------MRTT 814
Query: 856 VTNHDDKQTGTGSGPEAGPVGA---HPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
+++ D+ G GS P A A Q+AWGDVEHLFEGY+DQQKAAIQ+ER RR+EEQ
Sbjct: 815 ISS-SDQFLGIGSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQ 873
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
KK+FSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK HRHLFRFPHMGMW
Sbjct: 874 KKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMW 933
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP GVLF GRVISRGDDG+PFD
Sbjct: 934 TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFD 993
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
GDER+PKSKDLEGVLGMES VVI+DDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPS
Sbjct: 994 GDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPS 1053
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS 1152
LLEIDHDER EDGTLA SL VIER+H+ FF+H SLD+ DVRNILA+EQRKILAGCRIVFS
Sbjct: 1054 LLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFS 1113
Query: 1153 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 1212
RVFPVGEANPHLHPLWQTAEQFGAVCT ID+QVTHVVANSLGTDKVNWALSTGRFVV+P
Sbjct: 1114 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYP 1173
Query: 1213 GWVEASALLYRRANEQDFAIKP 1234
GWVEASALLYRRANEQDFAIKP
Sbjct: 1174 GWVEASALLYRRANEQDFAIKP 1195
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 122/231 (52%), Gaps = 43/231 (18%)
Query: 4 DVEEGEISDTASVEEISEEDFKIKQEEVVK----VVKETKPIKVGGGEAAARVWTMRDLY 59
DVEEGEISDTAS+EEISEEDF + VVK + TK + G G RVWT+ DLY
Sbjct: 14 DVEEGEISDTASIEEISEEDFNKQDVVVVKPPSSNNETTKQKEQGNGN--GRVWTISDLY 71
Query: 60 NKYPAICRGYGPGLHNLAWAQAVQ------NKPLNEIFVMEAEQDDVSKRSSPASSVASV 113
+Y + G+ GL+NLAWAQAVQ NKPLNE+F E+ D S + S SS A+
Sbjct: 72 -RYQMVG-GHVSGLYNLAWAQAVQSKPGKSNKPLNELFADVVEELDESSKRSSPSSSAAS 129
Query: 114 NSGAAAGKD--DKKVVEKVVIDDSGDEIEKEEGELEEGEI-----------ELDLESESN 160
+ D KKVVEKVVIDD+GDE+ + + ++ E+DL+ E
Sbjct: 130 VNSNNKDGDEEKKKVVEKVVIDDNGDEMMDDNNRNKIVDVVEKEEGELEEGEIDLDMEPG 189
Query: 161 EKVSEQVKEEMKLINVE-------------SIREALESVLRGDISFEGVCS 198
EK + M + +E SIR+ALESV I F C+
Sbjct: 190 EKANNGDVLNMNIDGLEVESGEKGFEKKMNSIRDALESVT---IEFVLACT 237
>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1213
Score = 1068 bits (2761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 639/1288 (49%), Positives = 814/1288 (63%), Gaps = 150/1288 (11%)
Query: 1 MGK---DVEEGEISDTASVEEISEEDFKIKQEEV------VKVVKETKPIKVGGGEAA-- 49
MGK DVEEGEISD+AS+EEI+EEDFK K ++V VK K +K GGG
Sbjct: 18 MGKEVEDVEEGEISDSASLEEITEEDFK-KGDDVKVNNSDVKTDKSDNKVKTGGGGGGGG 76
Query: 50 -ARVWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPAS 108
+RVW ++DLY+KYP ICRGY GL+NLAWAQAVQNKPLN+IFVME +++ + ++ +
Sbjct: 77 DSRVWAVQDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDKNANANSNNSGN 136
Query: 109 SVASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVK 168
+N + EK E + +G+ ++ + + E+ SN +V V+
Sbjct: 137 KDGELNKSSKEIVVVDDDDEKEE---GELEEGEIDGDADDDCVIVGSENFSNSEVL-GVR 192
Query: 169 EEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFS 228
++ + V S+ E SF C +++ TL+S ++ + + KD L++L F+
Sbjct: 193 GVLEGVTVASVAE----------SFAETCRRIQGTLQS--KVFSGFDSAEKDDLVRLLFN 240
Query: 229 AVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRAND 288
AV+ V+SVFC M+++ KE+NK+ +SRLLS +K+ LF+ +K++ + ++T +
Sbjct: 241 AVEVVYSVFCCMDNLQKEENKDNISRLLSFLKNQH--LFTMEHMKKVIFNIQVMITVIDS 298
Query: 289 KEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVP-LPVDSLMQNKPL---------EASK 338
+ A+ G + + E V LN E++P L D + + L EA +
Sbjct: 299 ----VFAL----GNNEVVGKEEKVEALNTTEQIPGLKADEYISSSQLVHDNSTYASEALQ 350
Query: 339 PGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALV-VGDGMVK------S 391
G RG++LPL D HK HD+DSLPSPTRE PV + +GDG+ +
Sbjct: 351 YGQSNVVGRGLMLPLFDLHKDHDLDSLPSPTREAPSCFPVNKLFSDLGDGIDRFGLPPAV 410
Query: 392 WAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDG- 450
A K+ + + K YETDAL+A S+YQQKF R+S+F + + PSPTPS GD +G
Sbjct: 411 CTEAEKMELDGKDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPTPS---GDCEGE 467
Query: 451 --DTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASS 508
DT E+SSA+ P L Q PVSS +D + + T + S PA
Sbjct: 468 AVDTNDEVSSASIASSLTSFKPPPLDQIPVSSTSLDRPNMHGLVDSRIDATGSGSYPA-- 525
Query: 509 GYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQK 568
K+ KSRDPRLRF + +A L+ + H+ P+VE GRV+S RKQK
Sbjct: 526 ------------KSSAKSRDPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVIS-RKQK 572
Query: 569 TVEEPVLDGPALKRQRNGFENS-GVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAE 627
TVEEP LD A KR R ENS R+E+ + G GGW E+ + Q+ RN L+ E
Sbjct: 573 TVEEPSLDATAPKRLRRSLENSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGE 632
Query: 628 SNSRKLDNGATSPITSGTPNVVVS--GNEPAPATTPSTTVSLPA-LLKDIAVNPTMLLNI 684
+ ++ I++ + N+ VS GNE A T+ S T SLP LL ++AVNP ML+++
Sbjct: 633 TELKR-------TISTSSSNLTVSNNGNELASVTSSSATASLPTYLLNNVAVNPAMLIHM 685
Query: 685 LKMGQQQKLAADAQQKSNDSSMNT-----MHPP------------IPSSIPPVSVTCSIP 727
+ Q + A+AQ+K DS+ T P +P+S P S+T ++P
Sbjct: 686 ILEHQHNE--AEAQKKPVDSARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLP 743
Query: 728 SGILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQ 787
++ GK+RMKPRDPRR LHG++ Q
Sbjct: 744 --------EDSGKIRMKPRDPRRFLHGSS----------------------------TLQ 767
Query: 788 KQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQN-SPIQPGQI 846
K E K QS+ QPDIT+QFTKNLK+IAD MSV Q +S P +QN S +
Sbjct: 768 KFDVRVETKLAPIQSIAQPDITRQFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFM 827
Query: 847 KSGADMKAVVTNHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
++ K+ V N + + G GS PE P + PQ+ W DVEHLFE YD +QKAAIQ+ER
Sbjct: 828 SDRSEQKSGVPNSQNLKDGVGSAPETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRER 887
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
+RRLEEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDE+LRKKE++DREKP RHLFR
Sbjct: 888 SRRLEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEMLRKKEQEDREKPQRHLFR 947
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FPHMGMWTKLRPG+W FLE+A KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG
Sbjct: 948 FPHMGMWTKLRPGVWNFLEKAGKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1007
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DD + D KSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ
Sbjct: 1008 DDAETAD-----TKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1062
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
FGL GPSLLEIDHDER E GTLASSLGVIER+H+ FF+ QSL++VDVRNILA+EQRKIL
Sbjct: 1063 FGLPGPSLLEIDHDERPESGTLASSLGVIERIHQNFFASQSLEEVDVRNILASEQRKILD 1122
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GCRIVFSR+FPVG+ANPHLHPLWQTAEQFGA CT IDDQVTHVVA+S GTDKVNWA++
Sbjct: 1123 GCRIVFSRMFPVGDANPHLHPLWQTAEQFGASCTNQIDDQVTHVVAHSPGTDKVNWAIAN 1182
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233
G+FVVHPGWVEASALLYRRANEQDFAIK
Sbjct: 1183 GKFVVHPGWVEASALLYRRANEQDFAIK 1210
>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
Length = 1227
Score = 1044 bits (2700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 640/1301 (49%), Positives = 798/1301 (61%), Gaps = 152/1301 (11%)
Query: 3 KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGE-------AAARVWTM 55
+D EEGEISD+ASVEEISE+ F + T IK+ E A RVWTM
Sbjct: 8 EDAEEGEISDSASVEEISEDAFNRQDPPTTST---TSKIKIASNENQNQNSTTATRVWTM 64
Query: 56 RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
RD+Y KYP I R Y GL+NLAWAQAVQNKPL+E+FVM + D S + + S ++
Sbjct: 65 RDVY-KYP-ISRDYARGLYNLAWAQAVQNKPLDELFVMTS---DNSNQCANGESKVIIDV 119
Query: 116 GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLIN 175
++ +E+ ID L+ ++ ++ E+N
Sbjct: 120 DVDDDAKEEGELEEGEID------------LDSADLVVNFGKEAN--------------- 152
Query: 176 VESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQSV 233
IRE L+SV + SF VCSKL+ +L +L EL + D LIQL +A++++
Sbjct: 153 --FIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQD--KNDILIQLFMTALRTI 208
Query: 234 HSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAML----SSLV---TRA 286
+SVF SMN K+QN +ILSRLL K+ P L SS Q+KE++A++ SLV T+
Sbjct: 209 NSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLVSSNTQD 268
Query: 287 NDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLM------QNKPLEASKPG 340
ND + + ++ KDS+ +ENA D K L S+ Q+ E+ KPG
Sbjct: 269 NDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSSESVKPG 328
Query: 341 PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVK---SWAAAAK 397
+++G+ PLLD HK HD D+LPSPTR+ P P + GMVK A+
Sbjct: 329 LDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQT----HGMVKLDLPIFPASL 384
Query: 398 LSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEIS 457
N+ +H YETDAL+A SSYQQKFGR+S F++ LPSPTPSEE G GDTGGE++
Sbjct: 385 DKGNSLLHP---YETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGGEVT 441
Query: 458 SATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPN 517
S V +N ++GQ +SS P +I Q L T + P S PN
Sbjct: 442 SFDVVHNASHLNESSMGQPILSSVPQ-----TNILDGQGLGTTRTADPLS------FLPN 490
Query: 518 PVVKAPI-KSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLD 576
P +++ KSRDPRLR A+S+ + N PI K+E ++ S+KQKTV+ D
Sbjct: 491 PSLRSSTAKSRDPRLRLATSDTVAQN-TILPIPDIDLKLEASLEMIVSKKQKTVDLSAFD 549
Query: 577 GPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNG 636
P KRQR+ +S +V D + G+GGWLED E I + N +++++ RKL+
Sbjct: 550 APLPKRQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQ- 608
Query: 637 ATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAAD 696
T+ I + P+V+V+ E P T ST+ +L +LLKDIA+NP++ +NI+K QQ+ A
Sbjct: 609 VTATIAT-IPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADAS 667
Query: 697 AQQKSNDSSMNTMHPPIPSSI---PPVSVTCSIPSGILSKPM------------------ 735
+ SS ++ +PS++ P S GIL P
Sbjct: 668 RTNTAQASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAASSIYNLLMNDFI 727
Query: 736 ----------------------DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPS 773
DE+ VRMKPRDPRRVLH A+ + GS+G + G +
Sbjct: 728 YSVIFTASIAQFPFYFFLTFSRDEVAIVRMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVA 787
Query: 774 APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
T + NL+FQ Q + K ++ S PDI QFTKNLK+IAD M P TS
Sbjct: 788 G--THATISNLSFQSQEDQLDRKSAVTLSTTPPDIACQFTKNLKNIAD-MISVSPSTSPS 844
Query: 834 MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGA-HPQSAWGDVEHLFEG 892
+ SQ + +S +++K V+ + G E G G+ PQ +WGDVEHLFEG
Sbjct: 845 VASQTQTLCIQAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSLQPQISWGDVEHLFEG 904
Query: 893 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
Y DQQ+A IQ+ERTRRLEEQKKMFS F E+DPVH+EILRKKE
Sbjct: 905 YSDQQRADIQRERTRRLEEQKKMFS-------------------FVEIDPVHEEILRKKE 945
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
EQDREKP+RHLFRFPHMGMWTKLRPGIW FLE+AS LFE+HLYTMGNKLYATEMAK+LDP
Sbjct: 946 EQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDP 1005
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1072
KG LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV
Sbjct: 1006 KGDLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1065
Query: 1073 VERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDV 1132
VERY YFPCSRRQFGL GPSLLEIDHDER EDGTLAS LGVI+R+H+ FF+H+S+D+ DV
Sbjct: 1066 VERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADV 1125
Query: 1133 RNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN 1192
RNILA EQ+KILAGCRIVFSRVFPVGEA+PHLHPLWQTAEQFGAVCT IDDQVTHVVAN
Sbjct: 1126 RNILATEQKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVAN 1185
Query: 1193 SLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
SLGTDKVNWALSTGR VVHPGWVEASALLYRRANE DFAIK
Sbjct: 1186 SLGTDKVNWALSTGRSVVHPGWVEASALLYRRANEHDFAIK 1226
>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
Length = 1248
Score = 1013 bits (2620), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 628/1289 (48%), Positives = 791/1289 (61%), Gaps = 124/1289 (9%)
Query: 4 DVEEGEISDTASVEEISEEDFKIKQEEVVKV--------VKETKPIKVGGGEAAARVWTM 55
DVEEGEI D+ + E ++KQ+ GG +RVWTM
Sbjct: 26 DVEEGEIPDSGNTE------IEVKQKTTTTADVGGDVDVGGRGGGGGGGGSNGNSRVWTM 79
Query: 56 RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
DL KYP GL N AW+QAVQNK LNE VM+ E + K S
Sbjct: 80 EDLLTKYPGYRLYATSGLSNFAWSQAVQNKSLNEGLVMDYEPRESDKIVIEDSGDEKEEG 139
Query: 116 GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLIN 175
G+ D +VE D+ ++KE E + + + ++++ +++ E K
Sbjct: 140 ELEEGEID--LVENASDDNLVASVDKET----ESVVLISADKVEDDRIQKEIDLEKK--- 190
Query: 176 VESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNENN-VPTKDALIQLAFSAVQS 232
V+ IR LES ++ FEGVCS++ LESLRELV++N+ P +D L+QL+F+++Q+
Sbjct: 191 VKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDNDDFPKRDTLVQLSFASLQT 250
Query: 233 VHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKD 292
++SVFCS+N+V KE+NKE +SRLL+L+ H SSNQ E+EAM D +
Sbjct: 251 INSVFCSLNNVSKERNKETMSRLLTLVNDHFSRFLSSNQKNEIEAM-------NQDLSRS 303
Query: 293 MLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLL 351
+A++ + +E VN + P DS + K E + G RSR +L
Sbjct: 304 AIAVY------TGTSSEENVNRMT----QPSNGDSFLAKKLSSEGTHRGASYVRSRLPML 353
Query: 352 PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
PLLD HK HD DSLPSPTRETTPS+PV MVK + S E K YE
Sbjct: 354 PLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVKPGFPVGRESQTTEGAKVYPYE 409
Query: 412 TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMP 471
+DAL+A S+Y QKFG NS F +LPSPTPS E DG+GD GE+SS+ P +
Sbjct: 410 SDALKAVSTYHQKFGLNSVFKTDDLPSPTPSGEPNDGNGDIDGEVSSSVVKSS-NPGTLL 468
Query: 472 TLGQQ-PVSSQPMDISQPMD--ISSV---QALTTANNSAPASSGYNPVVKPNPVVKAPIK 525
GQ P+ S S P+ +SS L+ SAP S V + VK K
Sbjct: 469 MYGQDVPLPSNFNSRSMPVANAVSSTVPPHHLSIHTISAPTGSTQT-VFASDQTVKPSAK 527
Query: 526 SRDPRLRFASSNALN--LNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQ 583
SRDPRLR A + N +N + N KVE +++ RKQK +E +DGPA KRQ
Sbjct: 528 SRDPRLRLAKPDTANVTINSYSSGDARNLFKVELSADLVNPRKQKAADELFIDGPAWKRQ 587
Query: 584 RNGFENSGVVRDEKNIYGSGGWLEDTDMFE-PQIMNRNLLVDSAESNSRKLDNGATSPIT 642
++ D G GGWLEDT+ P++ ES R ++NG TS T
Sbjct: 588 KSD-------TDAPKAAGIGGWLEDTESSGLPKL----------ESKPRLIENGVTSMTT 630
Query: 643 SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSN 702
S P VS ++ P T + SL +LL+DIAVNPTMLLN+LKMG++ K+ A QK
Sbjct: 631 SVMPTSAVSVSQKVP-TASTDAASLQSLLQDIAVNPTMLLNLLKMGERHKVPEKALQKPM 689
Query: 703 DSSMNTMHPPIPSSIPPVSVTCSIPS-----------GIL-----SKPMDELGKVRMKPR 746
D P S +P VS IP+ G+L + P DE G +RMKPR
Sbjct: 690 DPR-RAAQLPGSSVLPGVSAPLHIPASNALATNSSKRGVLQDSSQNAPTDESGSIRMKPR 748
Query: 747 DPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQL---------GAP 793
DPRR+LHG LQR+ S+ + K + S T +G E+L QL G
Sbjct: 749 DPRRILHGGTLQRTDSSMEKQSKVNDSSTLGTLTMKGKTEDLETPSQLLPRQNISQNGTS 808
Query: 794 EAK---PVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGA 850
+ K +LS+ PD + QFTK++K+IAD + VSQ + P + + Q+K+
Sbjct: 809 KMKISGELLSEKT--PDFSTQFTKSVKNIADMVVVSQQAGNPPASTHSI-----QLKTER 861
Query: 851 DMKAVVTNHDDKQ-----TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
D+K +N +D++ + AGP + ++WGDVEHLFEGYDD Q+ AIQ+ER
Sbjct: 862 DVKQNPSNPNDQEEDVSVSAASVTATAGPTRS--MNSWGDVEHLFEGYDDTQRVAIQRER 919
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
RRLEEQKKMF+++KL LVLD+DHTLLNSAKF+EV+ H+EILRKKEEQDREKP+RHLFR
Sbjct: 920 VRRLEEQKKMFASQKLSLVLDIDHTLLNSAKFNEVEFRHEEILRKKEEQDREKPYRHLFR 979
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKG+LF GRVIS+G
Sbjct: 980 FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGILFNGRVISKG 1039
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVWP+NK+NLI VERY YFP SRRQ
Sbjct: 1040 DDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPYNKMNLIAVERYLYFPRSRRQ 1099
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
FGLLGPSLLE+D DE E+GTLASSL VIE++HK FFSH SLD+VDVRNILA+EQRKILA
Sbjct: 1100 FGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHKNFFSHTSLDEVDVRNILASEQRKILA 1159
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GCRIVFSR+ PVGEA PHLHPLWQTAEQFGAVCT +D+ VTHVV NSLGTDKVNWAL+
Sbjct: 1160 GCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTR 1219
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIKP 1234
GRFVVHPGWVEASA LY+RANE +AI P
Sbjct: 1220 GRFVVHPGWVEASAFLYQRANENLYAINP 1248
>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
Short=FCP-like 3; AltName: Full=Carboxyl-terminal
phosphatase-like 3; Short=AtCPL3; Short=CTD
phosphatase-like 3
gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
Length = 1241
Score = 1004 bits (2595), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 620/1251 (49%), Positives = 772/1251 (61%), Gaps = 156/1251 (12%)
Query: 52 VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
VWTM +L ++YPA GL NLAWA+AVQNKP NE VM+ E P S
Sbjct: 79 VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 127
Query: 112 SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
+K+VI+DS DE I+ + ++ +E D ES
Sbjct: 128 ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 171
Query: 159 SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
S +KV + +KE V+ IR LES ++ FEGVCS++ LESLRELV++N
Sbjct: 172 SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 231
Query: 215 N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
+ P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+ H S NQ
Sbjct: 232 DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 291
Query: 274 EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
E+E M D + +A+ + N VN + P DS + K
Sbjct: 292 EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 334
Query: 333 PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
E++ G RSR +LPLLD HK HD DSLPSPTRETTPS+PV MV+
Sbjct: 335 TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVRPG 390
Query: 393 AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
+ S E K YE+DA +A S+YQQKFG NS F +LPSPTPS E DG+GD
Sbjct: 391 FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 450
Query: 453 GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
GGE+SS+ D P P N + PV++ P +S + A+
Sbjct: 451 GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 505
Query: 501 NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
SAP +S + VK KSRDPRLR A +A N+ + N KVE
Sbjct: 506 --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 555
Query: 559 GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
+++ RKQK +E ++DGPA KRQ++ D G+GGWLEDT+ +
Sbjct: 556 ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 601
Query: 619 RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
LL ES R ++NG TS +S P VS ++ T + T SL +LLKDIAVNP
Sbjct: 602 SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 658
Query: 679 TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
TMLLN+LKMG++QK+ A QK D P S P VS SIP
Sbjct: 659 TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 717
Query: 728 SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
SG+L + P E G +RMKPRDPRR+LHG+ LQR+ S+ + K + PS T +
Sbjct: 718 SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 777
Query: 779 GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
G E+L QL + +K +S +L PD + QFTKNLK IAD + VSQ
Sbjct: 778 GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 837
Query: 829 LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
L + P + + Q+K+ D+K +N D + AGP + ++W
Sbjct: 838 LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 890
Query: 884 GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+
Sbjct: 891 GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 950
Query: 944 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA
Sbjct: 951 HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1010
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 1011 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1070
Query: 1064 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS 1123
P +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE E+GTLASSL VIE++H+ FFS
Sbjct: 1071 PQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFS 1130
Query: 1124 HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 1183
H SLD+VDVRNILA+EQRKILAGCRIVFSR+ PVGEA PHLHPLWQTAEQFGAVCT +D
Sbjct: 1131 HTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVD 1190
Query: 1184 DQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
+ VTHVV NSLGTDKVNWAL+ GRFVVHPGWVEASA LY+RANE +AI P
Sbjct: 1191 EHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAINP 1241
>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
Length = 1241
Score = 1003 bits (2592), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 619/1251 (49%), Positives = 771/1251 (61%), Gaps = 156/1251 (12%)
Query: 52 VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
VWTM +L ++YPA GL NLAWA+AVQNKP NE VM+ E P S
Sbjct: 79 VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 127
Query: 112 SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
+K+VI+DS DE I+ + ++ +E D ES
Sbjct: 128 ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 171
Query: 159 SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
S +KV + +KE V+ IR LES ++ FEGVCS++ LESLRELV++N
Sbjct: 172 SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 231
Query: 215 N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
+ P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+ H S NQ
Sbjct: 232 DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 291
Query: 274 EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
E+E M D + +A+ + N VN + P DS + K
Sbjct: 292 EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 334
Query: 333 PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
E++ G RSR +LPLLD HK HD DSLPSPTRETTPS+PV MV+
Sbjct: 335 TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVRPG 390
Query: 393 AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
+ S E K YE+DA +A S+YQQKFG NS F +LPSPTPS E DG+GD
Sbjct: 391 FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 450
Query: 453 GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
GGE+SS+ D P P N + PV++ P +S + A+
Sbjct: 451 GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 505
Query: 501 NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
SAP +S + VK KSRDPRLR A +A N+ + N KVE
Sbjct: 506 --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 555
Query: 559 GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
+++ RKQK +E ++DGPA KRQ++ D G+GGWLEDT+ +
Sbjct: 556 ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 601
Query: 619 RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
LL ES R ++NG TS +S P VS ++ T + T SL +LLKDIAVNP
Sbjct: 602 SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 658
Query: 679 TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
TMLLN+LKMG++QK+ A QK D P S P VS SIP
Sbjct: 659 TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 717
Query: 728 SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
SG+L + P E G +RMKPRDPRR+LHG+ LQR+ S+ + K + PS T +
Sbjct: 718 SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 777
Query: 779 GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
G E+L QL + +K +S +L PD + QFTKNLK IAD + VSQ
Sbjct: 778 GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 837
Query: 829 LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
L + P + + Q+K+ D+K +N D + AGP + ++W
Sbjct: 838 LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 890
Query: 884 GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+
Sbjct: 891 GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 950
Query: 944 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLY
Sbjct: 951 HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYV 1010
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 1011 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1070
Query: 1064 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS 1123
P +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE E+GTLASSL VIE++H+ FFS
Sbjct: 1071 PQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFS 1130
Query: 1124 HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 1183
H SLD+VDVRNILA+EQRKILAGCRIVFSR+ PVGEA PHLHPLWQTAEQFGAVCT +D
Sbjct: 1131 HTSLDEVDVRNILASEQRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVD 1190
Query: 1184 DQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
+ VTHVV NSLGTDKVNWAL+ GRFVVHPGWVEASA LY+RANE +AI P
Sbjct: 1191 EHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVEASAFLYQRANENLYAINP 1241
>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
[Oryza sativa Japonica Group]
Length = 1272
Score = 897 bits (2319), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 573/1288 (44%), Positives = 753/1288 (58%), Gaps = 142/1288 (11%)
Query: 16 VEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLHN 75
+EEIS +DFK + + +RVW YN I R Y P H+
Sbjct: 57 LEEISADDFKKESSAAGGAAAAAA------AQQRSRVWMG---YN----IPRSYAPAFHS 103
Query: 76 LAWAQAVQNKPLNEIFVMEAEQDDV---------SKRSSPASSVASVNSGAAAGKDDKKV 126
AWAQAVQNKPL A++D+V K +V + +
Sbjct: 104 FAWAQAVQNKPLVPRAADAADEDEVEHVVDTSDEEKEEGEIEEGEAVQTTTTSSSSPPCA 163
Query: 127 VEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLINVESIREALE-- 184
ID D EK E + E+V + V SI E LE
Sbjct: 164 QPPETIDLDSDAPEKSESMVAMYGGGAAPAGAEEEEVDFDQR-------VGSILEELEMV 216
Query: 185 SVLRGDISFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVHSVFCSMNH 242
S+ + SFEG C++L E+L+ L E+ +P DAL+Q AF + ++ +V S +
Sbjct: 217 SIEEAEKSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDM 276
Query: 243 VLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGK 302
+EQ K +L +LL IK+ + + +Q E+++ + LV + KD NG
Sbjct: 277 PKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGP 328
Query: 303 DSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDV 362
++ A + E+ LP +S N + P ++R ++ PLLD H +D
Sbjct: 329 NATSTNAAAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDE 381
Query: 363 DSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSS 420
+SLPSPTR++ P V + +G G + + E K Y++ DAL+A
Sbjct: 382 NSLPSPTRDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCY 439
Query: 421 YQQKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQP 477
YQQK G+ S F + +LPSPTPS ++SGD GD GE+SS +A ++ + +P + Q P
Sbjct: 440 YQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP 496
Query: 478 VSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLR 532
S+P +SS +++ A GY N V N ++KA KSRDPRL+
Sbjct: 497 --------SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLK 543
Query: 533 F--------ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 584
F A +N +P P + + G ++SRK K V+EP++D ALKR R
Sbjct: 544 FLNRDTGGVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR 600
Query: 585 NGFENSGVVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLD 634
GV+ + +++ G GGW +D +D F+P N+N + + + + +
Sbjct: 601 ------GVIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIR 651
Query: 635 NGAT--------SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNIL 685
+T + + +P +V + AP T+ + VSLPA+LKDIAVNPTML+ +
Sbjct: 652 TDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWI 711
Query: 686 KMGQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKP 734
+M QQ+ A++ QQK S T M P+ + P + ++PS + S P
Sbjct: 712 QMEQQKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAP 770
Query: 735 M---DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQ 787
M ++ G +RMKPRDPRR+LH N +Q++ ++ P + K++G + P +Q SK++L Q
Sbjct: 771 MHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQ 830
Query: 788 KQLGAPEAKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQI 846
Q K Q++ P + + + A+ +S SQ + M + Q
Sbjct: 831 DQ------KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSS 884
Query: 847 KSGADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
+ AD + A N + T +GP P P S +GDV+HL +GYDDQQKA IQKER
Sbjct: 885 VNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKER 944
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
RR++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+ RHLF
Sbjct: 945 ARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFC 1004
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
F HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRG
Sbjct: 1005 FNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRG 1064
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQ
Sbjct: 1065 DDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQ 1124
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
FGL GPSLLEID DER EDGTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL
Sbjct: 1125 FGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILG 1184
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GCRIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT IDD+VTHVVANSLGTDKVNWALST
Sbjct: 1185 GCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALST 1244
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233
GRFVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1245 GRFVVHPGWVEASALLYRRASELDFAVK 1272
>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
Length = 1267
Score = 873 bits (2256), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/1101 (47%), Positives = 688/1101 (62%), Gaps = 111/1101 (10%)
Query: 192 SFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNK 249
SFEG C++L E+L+ L E+ +P DAL+Q AF + ++ +V S + +EQ K
Sbjct: 219 SFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDMPKREQTK 278
Query: 250 EILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTE 309
+L +LL IK+ + + +Q E+++ + LV + KD NG ++
Sbjct: 279 NMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGPNATSTNA 330
Query: 310 NAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPT 369
A + E+ LP +S N + P ++R ++ PLLD H +D +SLPSPT
Sbjct: 331 AAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDENSLPSPT 383
Query: 370 RETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSYQQKFGR 427
R++ P V + +G G + + E K Y++ DAL+A YQQK G+
Sbjct: 384 RDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQQKHGQ 441
Query: 428 NSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD 484
S F + +LPSPTPS ++SGD GD GE+SS +A ++ + +P + Q P
Sbjct: 442 KSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP------- 491
Query: 485 ISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRF------ 533
S+P +SS +++ A GY N V N ++KA KSRDPRL+F
Sbjct: 492 -SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNRDTG 545
Query: 534 --ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSG 591
A +N +P P + + G ++SRK K V+EP++D ALKR R G
Sbjct: 546 GVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR------G 596
Query: 592 VVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNGAT--- 638
V+ + +++ G GGW +D +D F+P N+N + + + + + +T
Sbjct: 597 VIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTDSTLAS 653
Query: 639 -----SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQK 692
+ + +P +V + AP T+ + VSLPA+LKDIAVNPTML+ ++M QQ+
Sbjct: 654 NLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKM 713
Query: 693 LAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM---DEL 738
A++ QQK S T M P+ + P + ++PS + S PM ++
Sbjct: 714 SASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDT 772
Query: 739 GKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPE 794
G +RMKPRDPRR+LH N +Q++ ++ P + K++G + P +Q SK++L Q Q
Sbjct: 773 GVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ----- 827
Query: 795 AKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK 853
K Q++ P + + + A+ +S SQ + M + Q + AD +
Sbjct: 828 -KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSSVNKADPR 886
Query: 854 -AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
A N + T +GP P P S +GDV+HL +GYDDQQKA IQKER RR++EQ
Sbjct: 887 LAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQ 946
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+ RHLF F HMGMW
Sbjct: 947 HKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMW 1006
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD
Sbjct: 1007 TKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFD 1066
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPS
Sbjct: 1067 SDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPS 1126
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS 1152
LLEID DER EDGTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL GCRIVFS
Sbjct: 1127 LLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFS 1186
Query: 1153 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 1212
R+FPVGEANPH+HPLWQTAEQFGAVCT IDD+VTHVVANSLGTDKVNWALSTGRFVVHP
Sbjct: 1187 RIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGRFVVHP 1246
Query: 1213 GWVEASALLYRRANEQDFAIK 1233
GWVEASALLYRRA+E DFA+K
Sbjct: 1247 GWVEASALLYRRASELDFAVK 1267
>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
Length = 1197
Score = 861 bits (2224), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/1263 (43%), Positives = 704/1263 (55%), Gaps = 141/1263 (11%)
Query: 14 ASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAI---CRGYG 70
S+EEIS +DF+ + +R W PA+ R +G
Sbjct: 33 GSIEEISADDFRKDSSSALGGPAAAA-----AAGQRSRSWV------GPPAVGYMARNFG 81
Query: 71 PGLHNLAWAQAVQNKPLNEIFVMEAEQD------DVSKRSSPASSVASVNSGAAAGKDDK 124
++ AW+QAV+NKPL +++D D S + + A +
Sbjct: 82 HAFNSFAWSQAVRNKPLGLQPPPASDEDEVEHAVDASDGEKEEGEIEEGEAVEAEASPAR 141
Query: 125 KVVEKVVIDDSGDEIEKEEG--------ELEEGEIELDLESESNEKVSEQVKEEMKLINV 176
E + +D D +EK E EE E+ LD S + EE++++++
Sbjct: 142 AQPETIDLDADADALEKSESLAGAVPASAAEEEEVNLDQRVGS-------ILEELEMVSI 194
Query: 177 ESIREALESVLRGDISFEGVCSKLEFTLESLRELVNE--NNVPTK--DALIQLAFSAVQS 232
E + SFEG C +L E+L+ L E N P + L+Q AF + +
Sbjct: 195 E----------EAEKSFEGACGRLHTCFENLKPLFQELENGSPMAILEPLMQQAFIGIDT 244
Query: 233 VHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKD 292
+ +V S N EQNK L + L IK+ + + Q E+++ + LV D D
Sbjct: 245 LTTVAISYNLPRSEQNKTTLLKSLFHIKNRYSDMLTPEQRDELDSRVRKLVFGEKDNVSD 304
Query: 293 MLAMHGVNGKDSNIVTENAVNDLNFKEKVP----LPVDSLMQNKPLEASKPGPPGYRSRG 348
G N A+N L +V LP +S N + P R
Sbjct: 305 PSTSSGTN----------AINVLAPSGQVSSSGGLPFESGAANPFSSLPRLEVPAKR--- 351
Query: 349 VLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTP 408
+ PLLD H +D +SLPSPTR+ P PV + +G G E K
Sbjct: 352 -ISPLLDLHADYDENSLPSPTRDNAPPFPVPKP--IGFGAFPMVPEKLSFPERVEPAKNS 408
Query: 409 HYET--DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEI-SSATAVDQP 465
Y + D L+A SSYQQK+G+ S F + +LPSPTPS + G D GG+I S ++ P
Sbjct: 409 LYPSLNDPLKAVSSYQQKYGQKSVFPSDDLPSPTPSGDEGKS-ADKGGDIFSEVSSFPVP 467
Query: 466 KPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIK 525
K + +P+ Q P S QP +SS + A PV PN +KA K
Sbjct: 468 KSIALPSTSQMPAS-------QPSTVSSSGISYASGPPGFAKQIEQPVAGPNHAIKAASK 520
Query: 526 SRDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 582
SRDPRLRF S+ A ++N + G + +RK K +++P +D LKR
Sbjct: 521 SRDPRLRFLNRDSAGATDVNRRAN--FSELKDGNLGGASVGNRKHKAIDDPQVDENVLKR 578
Query: 583 QRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 642
R G N RD +P L+ A +NS ++ P
Sbjct: 579 FRGGTANP---RD----------------LQPTGNPNQLMNIRAPTNSSGINMKTLQPPQ 619
Query: 643 SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQ-KS 701
+ P+V + P P+ LLKDIAVNPT+L+++++M Q+K A++ Q S
Sbjct: 620 TTAPHVSAAPAVPVPSM----------LLKDIAVNPTLLMHLIQMEHQKKSASETQGGMS 669
Query: 702 NDSSMNTMHPPI--PSSIPPVSVTCSIPSGILSKP--------MDELGKVRMKPRDPRRV 751
+ S N + + P + P + +PS P ++ G +RMKPRDPRR+
Sbjct: 670 SGMSNNGIAGMVFTPGNAPKTTEAAQVPSVRPQVPAQTPSLNSQNDGGILRMKPRDPRRI 729
Query: 752 LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 810
LH N Q+S ++ E KT+G + P +QG+K+ Q + ++P L SV +P
Sbjct: 730 LHNNVAQKSDAMVLEQVKTNGITQPDSQGTKD------QTSSMPSQPTLPSSVARP---- 779
Query: 811 QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGP 870
FT N KH+ D +S SQ + M + G I AV N + T
Sbjct: 780 -FT-NTKHV-DPVSNSQLAATAIMAPTQQAL--GSINKVDPRLAVEQNGQNADATTTDAS 834
Query: 871 EAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHT 930
P S WG+++HL +GYDD+QKA IQKER RR+ EQ KMFSARKLCLVLDLDHT
Sbjct: 835 ATELEATQPVSPWGNLDHLLDGYDDKQKALIQKERARRITEQHKMFSARKLCLVLDLDHT 894
Query: 931 LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
LLNSAKF EV+P+H+E+LRKKEEQDR P RHL+RF HM MWTKLRPGIW FLE+AS LF
Sbjct: 895 LLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPGIWNFLEKASNLF 954
Query: 991 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
E+HLYTMGNKLYATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+GVLGME
Sbjct: 955 ELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGME 1014
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS 1110
SAVVIIDDSVRVWPHN+ NLIVVERYTYFPCSRRQFGL GPSLLEID DER EDGTLASS
Sbjct: 1015 SAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASS 1074
Query: 1111 LGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
L VIER+H FFSH +L++ DVR+ILA+EQR+ILAGCRIVFSRVFPVG+A+PHLHPLWQT
Sbjct: 1075 LAVIERIHHNFFSHPNLNEADVRSILASEQRRILAGCRIVFSRVFPVGDASPHLHPLWQT 1134
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
AEQFGAVCT +DD+VTHVVANS GTDKVNWALS G+FVVHPGWVEASALLYRRANE DF
Sbjct: 1135 AEQFGAVCTNLVDDRVTHVVANSPGTDKVNWALSKGKFVVHPGWVEASALLYRRANEHDF 1194
Query: 1231 AIK 1233
A+K
Sbjct: 1195 AVK 1197
>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
Length = 1255
Score = 852 bits (2202), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 556/1288 (43%), Positives = 734/1288 (56%), Gaps = 159/1288 (12%)
Query: 16 VEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLHN 75
+EEIS +DFK +E + +RVW YN I R Y P H+
Sbjct: 57 LEEISADDFK---KESSAAGGVAAAAAAAAAQQRSRVWMG---YN----IPRSYAPAFHS 106
Query: 76 LAWAQAVQNKPLNEIFVMEAEQDDV---------SKRSSPASSVASVNSGAAAGKDDKKV 126
AWAQAVQNKPL A++D+V K +V + +
Sbjct: 107 FAWAQAVQNKPLVPRAADAADEDEVEHVVDTSDEEKEEGEIEEGEAVQTTTTSSSSPPCA 166
Query: 127 VEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVS-----EQVKEEMKLINVESI-R 180
ID D EK E + E+V + EE++++++E +
Sbjct: 167 QPPETIDLDSDAPEKSESMVAMDGGGAAPAGAEEEEVDFDQRVGSILEELEMVSIEEAEK 226
Query: 181 EALESVLRGDI----------------------SFEGVCSKLEFTLESLRELVNENN--V 216
L +L G + SFEG C++L E+L+ L E+ +
Sbjct: 227 YGLMILLYGKVHVLDVFWCMIQLLRDPILIFCRSFEGACTRLRTCFENLKPLFPESGSPM 286
Query: 217 PTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEME 276
P DAL+Q AF + ++ +V S + +EQ K +L +LL IK+ + + +Q E++
Sbjct: 287 PMLDALVQQAFVGIDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELD 346
Query: 277 AMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEA 336
+ + LV + KD NG ++ A + E+ LP +S N +
Sbjct: 347 SRVRQLVF---EDGKD-----NANGPNATSTNAAAPSGQVLSER--LPFESGAGNSFSKV 396
Query: 337 SKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAA 396
P ++R ++ PLLD H +D +SLPSPTR++ P V + +G G +
Sbjct: 397 EIPA----KNR-MVSPLLDLHADYDENSLPSPTRDSAPPFDVPKP--IGYGALPMAPDRP 449
Query: 397 KLSHNAEVHKTPHYET--DALRAFSSYQQKFGRNSFFMNSELPSPTPS---EESGDGDGD 451
+ E K Y++ DAL+A YQQK G+ S F + +LPSPTPS ++SGD GD
Sbjct: 450 SVLERVEPAKNSSYQSFNDALKAVCYYQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGD 509
Query: 452 TGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY- 510
GE+SS +A ++ + +P + Q P S+P +SS +++ A GY
Sbjct: 510 VFGEVSSFSASNK---IVLPIVNQMP--------SRPSTVSS-----NSDSFAGGPPGYA 553
Query: 511 ----NPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRK 566
N V N ++KA KSRDPRL+F + + G V + +
Sbjct: 554 KQIENSVSGSNHLLKATAKSRDPRLKFLNRD--------------------TGGVADANR 593
Query: 567 QKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDS- 625
+ EP N + V + ++I + ++T + N N+ DS
Sbjct: 594 RVNFAEP-----------NPLKIGPWVVEYQSIAPN----QNTRLGNNTTGNHNIRTDST 638
Query: 626 -AESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNI 684
A + + +N TSP P + + P ++ P+ VSLPA+LKDIAVNPTML+
Sbjct: 639 LASNLNNTTNNSGTSPGIVQAPQ---TNSAPQTSSAPA--VSLPAMLKDIAVNPTMLMQW 693
Query: 685 LKMGQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSK 733
++M + A++ QQK S T M P+ + P + ++PS + S
Sbjct: 694 IRMEHHKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSA 752
Query: 734 PM---DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNF 786
PM ++ G +RMKPRDPRR+LH N +Q++ ++ P + K++G + P +Q SK++L
Sbjct: 753 PMHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLN 812
Query: 787 QKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQI 846
Q Q A + + + S+ + T N A+ +S SQ + M + Q
Sbjct: 813 QDQ-KAEQLQAIALPSLPVTSSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSS 867
Query: 847 KSGADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
+ AD + A N + T +GP P P S +GDV+HL +GYDDQQKA IQKER
Sbjct: 868 VNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKER 927
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
RR++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+ RHLF
Sbjct: 928 ARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFC 987
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
F HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRG
Sbjct: 988 FNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRG 1047
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQ
Sbjct: 1048 DDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQ 1107
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
FGL GPSLLEID DER EDGTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL
Sbjct: 1108 FGLPGPSLLEIDRDERPEDGTLASSLTVIERIHKNFFSHPNLNDADVRSILASEQQRILG 1167
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GCRIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT IDD+VTHVVANSLGTDKVNWALST
Sbjct: 1168 GCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALST 1227
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233
GRFVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1228 GRFVVHPGWVEASALLYRRASELDFAVK 1255
>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Brachypodium distachyon]
Length = 1259
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 556/1298 (42%), Positives = 728/1298 (56%), Gaps = 150/1298 (11%)
Query: 8 GEISD---TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPA 64
GE SD + S+EEI+ DF+ + +RVW
Sbjct: 40 GETSDEDSSESLEEITAADFQKESSGGAAAGTAAS-----AAAQRSRVWMGY-------T 87
Query: 65 ICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDK 124
+ R Y P H+ AWAQAVQNKPL + A++D+V + D
Sbjct: 88 MSRSYAPAFHSFAWAQAVQNKPL--VPPPAADEDEVEHIVDTSDEEKEEGEIEEGEAVDT 145
Query: 125 KV----VEKVVIDDSGDEIEKEEGELEEGE----IELDLESESNEKVSEQVKEEMKLINV 176
+ ID D EK E EG + ++ E + +++V + EE++++++
Sbjct: 146 SFPSPHAQPETIDLDSDVPEKSESMAVEGSNTAAVAVEEEVDFDQRVGS-ILEELEMVSI 204
Query: 177 ESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVH 234
E + SFEG C +L E+L+ L E+ +P DAL+Q F + ++
Sbjct: 205 E----------EAEKSFEGACERLRTCFENLKPLFLESGSPMPMLDALVQQGFVGIDTIT 254
Query: 235 SVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDML 294
+V S + QNKE+L +LL +++ + + +Q E+++ + L ++ D
Sbjct: 255 TVANSYAMPKRVQNKEMLLKLLFHLRNRYSDMLTPDQRVELDSRVRQLAFVDGEENTD-- 312
Query: 295 AMHGVNGKDSNIVTENAVNDLNFKEKVP---LPVDSLMQNKPLEASKPGPPGYRSRGVLL 351
G +++ T N+ N + +VP LP +S N +S P ++
Sbjct: 313 ------GPNASCST-NSTNVVVPTGQVPSERLPFESGATNPFSGSSLPWLETQTKNRMVS 365
Query: 352 PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
PLLD H HD +SLPSPTR+ P V + + G L+ AE K Y
Sbjct: 366 PLLDLHADHDENSLPSPTRDNAPQFSVPKPIGFG---AFPMGPDRSLTERAEPSKKNLYP 422
Query: 412 T--DALRAFSSYQQKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPK 466
+ D+L SSY+QK+ + S F N +LPSPTPS ++S D DGD GEISS ++ ++
Sbjct: 423 SVNDSLD-VSSYKQKYSQKSNFANDDLPSPTPSGDGDKSEDKDGDMFGEISSFSSSNK-- 479
Query: 467 PVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVK 521
+P++ Q P S+P +SS +N S GY V PN +K
Sbjct: 480 -TALPSVSQIPA-------SRPSTVSS------SNGSFSGPPGYAKKIEQSVSGPNLALK 525
Query: 522 APIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSR-KQKTVEEPVLDGPAL 580
KSRDPRLR+ LN P EP + + K K V +P++D +
Sbjct: 526 PSAKSRDPRLRY-------LNRDPGDANRCMNFAEPNASLGGTLGKHKAVGQPLMDENMV 578
Query: 581 KRQRNGFENSGVV-----RDEKNI--YGSGGWL--EDTDMFEPQIMNRNLLVDSA-ESNS 630
KR R N + RD NI Y S ++T + N NL DS SN
Sbjct: 579 KRARGSIGNPRDLQVPPGRDGSNISFYPSDRVQSNQNTRLDTKTTGNPNLRADSQLLSNV 638
Query: 631 RKLDNG---ATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKM 687
+ N +T + +G P+ V P T+ + +VSLPA+LKDIAVNPT+L++ ++M
Sbjct: 639 SSITNSSVTSTKTLNAGQPDSV-------PQTSAAPSVSLPAVLKDIAVNPTVLMHWIQM 691
Query: 688 GQQQKLAADAQQKS-----------NDSSMNTMHPP-------IPSSIPPVSVTCSIPSG 729
QQ++ A++ QQ N+ + + PP + IP + C +
Sbjct: 692 EQQKRSASEPQQTVNTLGGISSGMINNDTAGMVIPPGSALKTADAAQIPSIRPQCPTQTA 751
Query: 730 ILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENL---- 784
+ D G +RMKPRDPRR+LH N ++ + E +++G P +Q SK+N+
Sbjct: 752 PVISQTDA-GVIRMKPRDPRRILHNNTSPKNDTTNSEQARSNGIVLPVSQDSKDNMINRE 810
Query: 785 --NFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQ 842
Q Q GA ++PV ++ +P + D +S SQ S M Q +
Sbjct: 811 QQAEQLQTGALPSQPVSLSNIARPSTMS------ASMVDPVSNSQLAASSLMAPQQT--- 861
Query: 843 PGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQ 902
G I A N + T + P A P + WGD++ L GYDDQQKA IQ
Sbjct: 862 SGSINRADPRLAPGQNDPNADAATNASPATTLGAAPPANQWGDLDDLLSGYDDQQKALIQ 921
Query: 903 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
KER RR+ EQ+KMFSARKLCLVLDLDHTLLNSAKF EVDP+H+EILRKKEEQDRE+P RH
Sbjct: 922 KERARRIMEQQKMFSARKLCLVLDLDHTLLNSAKFLEVDPIHEEILRKKEEQDRERPERH 981
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
LFR HM MWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LF GRVI
Sbjct: 982 LFRLHHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGALFEGRVI 1041
Query: 1023 SRGDDGDP-------FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVER 1075
SRG DG FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVER
Sbjct: 1042 SRGGDGTSRGGDGDSFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVER 1101
Query: 1076 YTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNI 1135
YTYFPCSRRQFGL GPSLLEID DER EDGTLASSL VI R+H+ FFSH +L+D DVR+I
Sbjct: 1102 YTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRSI 1161
Query: 1136 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 1195
LA+EQR+ILAGCRIVFSR+FPVGEANPHLHPLWQ+AEQFGAVCT IDD+VTHVVANSLG
Sbjct: 1162 LASEQRRILAGCRIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQIDDRVTHVVANSLG 1221
Query: 1196 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
TDKVNWAL TGR+VVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1222 TDKVNWALQTGRYVVHPGWVEASALLYRRASEHDFAVK 1259
>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
Length = 1234
Score = 842 bits (2175), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/1294 (42%), Positives = 721/1294 (55%), Gaps = 154/1294 (11%)
Query: 8 GEISD---TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPA 64
GE SD + S+EEI+ +DFK K +R W PA
Sbjct: 25 GEGSDRDSSGSIEEITADDFK-------KDSSSALGGAAAAAGPRSRSWVAP------PA 71
Query: 65 I---CRGYGPGLHNLAWAQAVQNKPLN-----------EIFVMEAEQDDVSKRSSPASSV 110
+ R + ++ AW+QAV+NKPL E V ++ + +V
Sbjct: 72 VGYMARNFRYAFNSFAWSQAVRNKPLGLQPPAPDDDEVEHAVDVSDGEKEEGEIEEGEAV 131
Query: 111 ASVNSGAAA-------GKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKV 163
++ S A A D + E V ID S + EE E+ LD S
Sbjct: 132 EALASPAPAQPETIDLDSDAPEKSESVAIDGSASVVPVPAA--EEEEVNLDQRVGS---- 185
Query: 164 SEQVKEEMKLINVESIREALESVLRGDI-------SFEGVCSKLEFTLESLRELVN--EN 214
+ EE++++++E + + + SFEG C++L E+L+ L EN
Sbjct: 186 ---ILEELEMVSIEEAEKYMGICFMFFLEQRLCFRSFEGACARLHTCFENLKPLFQELEN 242
Query: 215 NVPTK--DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQI 272
P + L+Q AF + ++ +V N +EQNK L +LL IK+ + + Q
Sbjct: 243 GSPMAILEPLMQQAFIGIDTLTTVANLYNLPRREQNKTTLLKLLFHIKNRYSDMLTPEQR 302
Query: 273 KEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK 332
+EM++ + LV D D G + + + + N LP +S N
Sbjct: 303 EEMDSRVRKLVFGEKDNVSDPSTSCGTSAINVSAPSGQVSNTGG------LPFESGAANL 356
Query: 333 PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
+ P R+ PLL+ H +D +SLPSPTR+ P P + +G G
Sbjct: 357 FSSLPRLEVPAKRNS----PLLNLHADYDENSLPSPTRDNAPPFPALKP--IGFGAFPMV 410
Query: 393 AAAAKLSHNAEVHKTPHYE--TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDG 450
E K Y D L+A SSYQQK+G+ S + + +LPSPTPS + G
Sbjct: 411 PEKLSFLDRVEPTKNSLYPPLNDPLKAVSSYQQKYGQKSVYPSDDLPSPTPSGDEGK-PA 469
Query: 451 DTGGEI-SSATAVDQPKPVNMPTLGQQPVSSQP---------------MDISQPMDISSV 494
D GG+I S ++ PK + +P+ Q P S M SQP+ +SS
Sbjct: 470 DKGGDIFSDVSSFPVPKSIVLPSTSQMPASQPSTVSSSSISYASSTSQMAASQPITVSSS 529
Query: 495 QALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA---SSNALNLNHQPAPILHN 551
+ A PN +KA KSRDPRLRF S+ A ++N + +
Sbjct: 530 GISYASGPPGFAKQIEQSTAGPNHAIKAASKSRDPRLRFLNRDSAGATDVNWRAN---FS 586
Query: 552 APKVEPVGRV-MSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTD 610
K +G V + +RKQK V++P +D ALKR R G N ++++ +G
Sbjct: 587 ELKDGNLGGVSVGNRKQKAVDDPQVDDNALKRFRGGIAN------QRDMQPTGN------ 634
Query: 611 MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
Q+MN S+ N + L P + P+V + P P P L
Sbjct: 635 --PNQLMNIRAPTHSSSINMKTL-----QPPQTTAPHVSAAPAVPLP----------PML 677
Query: 671 LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPI---PSSIPPVSVTCSIP 727
LKDIAVNP +L+++++M Q+K A+++Q + N + P + P ++ +P
Sbjct: 678 LKDIAVNPALLMHLIQMEHQKKSASESQGGMSSGMTNNGIAGMVFTPGNAPKITEAAQVP 737
Query: 728 S-----GILSKPM---DELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQ 778
S + + P+ ++ G VRMKPRDPRR+LH N Q+S ++ E K +G + P +Q
Sbjct: 738 SVRPQVPVQTPPLNSQNDGGIVRMKPRDPRRILHNNIAQKSDAMSLEQVKNNGTTQPDSQ 797
Query: 779 GSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQN 838
G+K+ + PV SQ L I + F+ + KH+ D +S SQ + M
Sbjct: 798 GTKD-----------QTTPVPSQPALPSSIARPFS-SAKHV-DPVSNSQLAATAIM---- 840
Query: 839 SPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGA-HPQSAWGDVEHLFEGYDDQQ 897
+P Q + D + V + T +G A + A P S WGDV+HL +GYDDQQ
Sbjct: 841 APTQALSSVNKVDPRLAVEQNGQNADATTNGASATTLEATQPVSPWGDVDHLLDGYDDQQ 900
Query: 898 KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
KA IQKER RR+ EQ KMFSARKLCLVLDLDHTLLNSAKF EV+P+H+E+LRKKEEQDR
Sbjct: 901 KALIQKERARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPIHEEMLRKKEEQDRT 960
Query: 958 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
P RHL+RF HM MWTKLRPGIW FL++AS LFE+HLYTMGNKLYATEMAKVLDP G LF
Sbjct: 961 LPERHLYRFHHMNMWTKLRPGIWNFLQKASNLFELHLYTMGNKLYATEMAKVLDPTGTLF 1020
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
AGRVISRGDDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHN+ NLIVVERYT
Sbjct: 1021 AGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYT 1080
Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILA 1137
YFPCSRRQFGL GPSLLEID DER EDGTLASSL VIER+H FFSH +L++ DVR+ILA
Sbjct: 1081 YFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHHNFFSHPNLNEADVRSILA 1140
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
+EQR+IL GCRIVFSRVFPVG+A+PHLHPLWQTAEQFGAVCT +DD+VTH+VANS GTD
Sbjct: 1141 SEQRRILTGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHIVANSPGTD 1200
Query: 1198 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
KVNWALS G+FVVHPGWVEASALLYRRANE DFA
Sbjct: 1201 KVNWALSKGKFVVHPGWVEASALLYRRANEHDFA 1234
>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 891
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/923 (49%), Positives = 576/923 (62%), Gaps = 76/923 (8%)
Query: 352 PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLS-HNAEVHKTPHY 410
PLLD H +D SLPSPTR++ P PV + + G V A S E+ K Y
Sbjct: 4 PLLDLHADYDESSLPSPTRDSAPPFPVPKPIGFG---VFPMAPDRYFSVERVELSKKVLY 60
Query: 411 ET--DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPV 468
+ DAL+ SSY+QK+G+ S F + +LPSPTPS++ GD D G I V
Sbjct: 61 PSVNDALKDVSSYRQKYGQTSTFASYDLPSPTPSDD-GDKSEDKDGGIF----------V 109
Query: 469 NMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAP 523
+P+ S+ P P SV +++ ++ A GY V P+ +K
Sbjct: 110 EVPSFSDSNKSAPPSGNLLPASRPSV-VISSNDSFAGGPPGYAKQIEQSVSGPSHALKPS 168
Query: 524 IKSRDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRVMS--SRKQKTVEEPVLDGP 578
KSRDPRLRF S + N NA + + V+S SRK K +P++D
Sbjct: 169 AKSRDPRLRFLNRDSGGTADANRHVNLAEPNASRDGTLWGVVSDNSRKHKATGQPLIDES 228
Query: 579 ALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT 638
LKR R E +G RD + G G + + N++ +++ + ++ + N ++
Sbjct: 229 VLKRAR---ECAGNPRDMQVPPGRDGSNISSYSGDRVQSNQHTWLETKTAGNQLISNVSS 285
Query: 639 SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQ 698
P ++G + S P T+ + VSLPA+LKDIAVNPT+L++ ++M Q++ ++ Q
Sbjct: 286 IPDSTGA--LHASQPNSFPQTSAAPIVSLPAVLKDIAVNPTVLMHWIQMEHQKRSPSEPQ 343
Query: 699 QKS--------NDSSMNTMHPP-----------IPSSIPPVSVTCSIPSGILSKPMDELG 739
S N+ + + P IPS P + T S+ S ++ G
Sbjct: 344 PASGIISSGMINNVTAGMVISPGNALKTAEVAHIPSYRPQATSTASVNS------QNDPG 397
Query: 740 KVRMKPRDPRRVLHGNALQRSGSLGP-EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 798
+RMK RDPRRVLH N ++ + + K++G + P Q SK+NL ++Q+ V
Sbjct: 398 VIRMKSRDPRRVLHNNTSHKNDTPNSDQAKSNGIALPANQDSKDNLINREQVAEQLQTIV 457
Query: 799 L-SQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVT 857
L SQ V I +Q T + + D +S SQ S + Q + + + AD +
Sbjct: 458 LPSQPVSSSSIARQSTMSASKV-DSVSNSQLAASSLIAPQETLVSINR----ADPRVATG 512
Query: 858 NHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFS 917
+D + P P + WGD++ L GYDDQQKA IQKER RR+ EQ MFS
Sbjct: 513 QNDSNDAAPATTLGTRP----PANQWGDLDDLLNGYDDQQKALIQKERARRIMEQHTMFS 568
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+RKLCLVLDLDHTLLNSAKF EVDP+H+EIL KKEEQDRE+ RHLFRF HM MWTKLRP
Sbjct: 569 SRKLCLVLDLDHTLLNSAKFIEVDPIHEEILWKKEEQDRERSERHLFRFHHMQMWTKLRP 628
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG-------DP 1030
GIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LFAGRVISRG DG D
Sbjct: 629 GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGTLFAGRVISRGGDGISRGGDGDT 688
Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVERYTYFPCSRRQFGL G
Sbjct: 689 FDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVERYTYFPCSRRQFGLPG 748
Query: 1091 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIV 1150
PSLLEID DER EDGTLASSL VI R+H+ FFSH +L+D DVR+ILA+EQR+ILAGCRIV
Sbjct: 749 PSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRSILASEQRRILAGCRIV 808
Query: 1151 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 1210
FSR+FPVGEANP LHPLWQTAEQFGAVCT IDD+VTHVVANSLGTDKVNWAL TGRFVV
Sbjct: 809 FSRIFPVGEANPQLHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALQTGRFVV 868
Query: 1211 HPGWVEASALLYRRANEQDFAIK 1233
HPGWVEASALLYRRANE DFA+K
Sbjct: 869 HPGWVEASALLYRRANEHDFAVK 891
>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
Length = 664
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/669 (58%), Positives = 476/669 (71%), Gaps = 45/669 (6%)
Query: 601 GSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNGAT--------SPITSG 644
G GGW +D +D F+P N+N + + + + + +T + +
Sbjct: 5 GRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGT 61
Query: 645 TPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSND 703
+P +V + AP T+ + VSLPA+LKDIAVNPTML+ ++M QQ+ A++ QQK
Sbjct: 62 SPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTA 121
Query: 704 SSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM---DELGKVRMKPRDPR 749
S T M P+ + P + ++PS + S PM ++ G +RMKPRDPR
Sbjct: 122 SVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDTGVIRMKPRDPR 180
Query: 750 RVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQ 805
R+LH N +Q++ ++ P + K++G + P +Q SK++L Q Q A + + + S+
Sbjct: 181 RILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ-KAEQLQAIALPSLPV 239
Query: 806 PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK-AVVTNHDDKQT 864
+ T N A+ +S SQ + M + Q + AD + A N +
Sbjct: 240 TSSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDA 295
Query: 865 GTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 924
T +GP P P S +GDV+HL +GYDDQQKA IQKER RR++EQ KMF+ARKLCLV
Sbjct: 296 ATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLV 355
Query: 925 LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
LDLDHTLLNSAKF EVD +H EILRKKEEQDRE+ RHLF F HMGMWTKLRPGIW FLE
Sbjct: 356 LDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLE 415
Query: 985 RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+
Sbjct: 416 KASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLD 475
Query: 1045 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 1104
GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPSLLEID DER ED
Sbjct: 476 GVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPED 535
Query: 1105 GTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 1164
GTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL GCRIVFSR+FPVGEANPH+
Sbjct: 536 GTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHM 595
Query: 1165 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 1224
HPLWQTAEQFGAVCT IDD+VTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR
Sbjct: 596 HPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 655
Query: 1225 ANEQDFAIK 1233
A+E DFA+K
Sbjct: 656 ASELDFAVK 664
>gi|2459436|gb|AAB80671.1| unknown protein [Arabidopsis thaliana]
Length = 1066
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 493/1093 (45%), Positives = 632/1093 (57%), Gaps = 156/1093 (14%)
Query: 52 VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
VWTM +L ++YPA GL NLAWA+AVQNKP NE VM+ E P S
Sbjct: 56 VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 104
Query: 112 SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
+K+VI+DS DE I+ + ++ +E D ES
Sbjct: 105 ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 148
Query: 159 SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
S +KV + +KE V+ IR LES ++ FEGVCS++ LESLRELV++N
Sbjct: 149 SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 208
Query: 215 N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
+ P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+ H S NQ
Sbjct: 209 DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 268
Query: 274 EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
E+E M D + +A+ + N VN + P DS + K
Sbjct: 269 EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 311
Query: 333 PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
E++ G RSR +LPLLD HK HD DSLPSPTRETTPS+PV MV+
Sbjct: 312 TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGRHT----MVRPG 367
Query: 393 AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
+ S E K YE+DA +A S+YQQKFG NS F +LPSPTPS E DG+GD
Sbjct: 368 FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 427
Query: 453 GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
GGE+SS+ D P P N + PV++ P +S + A+
Sbjct: 428 GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 482
Query: 501 NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
SAP +S + VK KSRDPRLR A +A N+ + N KVE
Sbjct: 483 --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 532
Query: 559 GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
+++ RKQK +E ++DGPA KRQ++ D G+GGWLEDT+ +
Sbjct: 533 ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 578
Query: 619 RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
LL ES R ++NG TS +S P VS ++ T + T SL +LLKDIAVNP
Sbjct: 579 SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 635
Query: 679 TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
TMLLN+LKMG++QK+ A QK D P S P VS SIP
Sbjct: 636 TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 694
Query: 728 SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
SG+L + P E G +RMKPRDPRR+LHG+ LQR+ S+ + K + PS T +
Sbjct: 695 SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 754
Query: 779 GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
G E+L QL + +K +S +L PD + QFTKNLK IAD + VSQ
Sbjct: 755 GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 814
Query: 829 LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
L + P + + Q+K+ D+K +N D + AGP + ++W
Sbjct: 815 LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 867
Query: 884 GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+
Sbjct: 868 GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 927
Query: 944 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA
Sbjct: 928 HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 987
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 988 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1047
Query: 1064 PHNKLNLIVVERY 1076
P +K+NLI VERY
Sbjct: 1048 PQHKMNLIAVERY 1060
>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1064
Score = 603 bits (1556), Expect = e-169, Method: Compositional matrix adjust.
Identities = 282/369 (76%), Positives = 310/369 (84%), Gaps = 7/369 (1%)
Query: 867 GSGPEAGPVGA-HPQSAWG-DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 924
GS E G+ P + W +VEHL EGYD QQKA IQ+ER RRLEEQ KMF+ARKLCLV
Sbjct: 691 GSAHETCASGSCQPHNTWAANVEHLLEGYDAQQKAVIQRERARRLEEQNKMFAARKLCLV 750
Query: 925 LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
LD+DHTLLNSAKF EVDP HD+ILRKKE+Q+R KP RHLFR PHMGMWTKLRPG+W FLE
Sbjct: 751 LDIDHTLLNSAKFVEVDPEHDKILRKKEKQERGKPRRHLFRLPHMGMWTKLRPGVWNFLE 810
Query: 985 RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
+ASKLFEMHLYTMGNKLYATEMAKVLDP GVLFAGRVISRGDD + D K KDLE
Sbjct: 811 KASKLFEMHLYTMGNKLYATEMAKVLDPNGVLFAGRVISRGDDPETVD-----IKCKDLE 865
Query: 1045 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 1104
GVLG+ES+VVIIDDS RVWPHN+LNLI VERY YF CSRRQFGL GPSL EIDHDER
Sbjct: 866 GVLGLESSVVIIDDSPRVWPHNQLNLITVERYIYFLCSRRQFGLSGPSLFEIDHDERPGA 925
Query: 1105 GTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 1164
GTLASSLGVIER+H+ FF+ QSL+++DVRNILA+EQRKIL GCRIVFS VFPVGE NPHL
Sbjct: 926 GTLASSLGVIERIHQNFFASQSLEEMDVRNILASEQRKILGGCRIVFSGVFPVGETNPHL 985
Query: 1165 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 1224
HPLW+TAEQFGA CT +D QVTHVVA S GTDKVNW +S G+FVV+P WVEAS LLYRR
Sbjct: 986 HPLWRTAEQFGASCTNKVDPQVTHVVAQSPGTDKVNWGISNGKFVVYPNWVEASTLLYRR 1045
Query: 1225 ANEQDFAIK 1233
NEQDFA+K
Sbjct: 1046 MNEQDFAVK 1054
Score = 245 bits (625), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 255/803 (31%), Positives = 381/803 (47%), Gaps = 174/803 (21%)
Query: 3 KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKP-------IKVGGGEAAARVWTM 55
+DVEEGEISDT+SV+ I E+D K + VVKV + K IK G +RV
Sbjct: 6 EDVEEGEISDTSSVKVIIEKDLN-KVDHVVKVDSDVKSNNNNIDKIKTCGN---SRVL-- 59
Query: 56 RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
DL N Y + G GL+NLAWAQAVQNKPLN+IF ME ++D +S + N+
Sbjct: 60 -DLQNFYSSCYYASGGGLYNLAWAQAVQNKPLNDIFAMEIDKD----TDVNVTSNTNSNN 114
Query: 116 GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEG-EIELDLESESNEKVSEQ----VKEE 170
K K+V+ D E+E+ E ++++ + +S E VSE V++
Sbjct: 115 NDDLNKPLKEVIFVDDDDKEEGELEEGEIDVDDDTNCAIVGGGDSFENVSESDVIGVRDV 174
Query: 171 MKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230
+K I+V ++ E SF C++++ L+S ++ + KD L+ L F+AV
Sbjct: 175 LKCISVANVSE----------SFAETCTRIQSALQS--KVFSGIAGSEKDDLVCLLFNAV 222
Query: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKE 290
+ V+SV SM++ KE+NK+ + RLLS +K+ LF+ +KE+ AM++ + N
Sbjct: 223 EVVYSVLFSMDNFQKEENKDNILRLLSFLKNERAHLFTPEHMKEIHAMITLVDVSGN--- 279
Query: 291 KDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVL 350
++ N +EK +++L + + + G R
Sbjct: 280 ----------------------SEANSEEK---KLEALDETRKI-------LGLR----- 302
Query: 351 LPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGM-------VKSWAAAAKLSHNAE 403
HD+D+LPS T+E VPV + VGDG VK+ A K+ + +
Sbjct: 303 ---------HDLDNLPSLTQE----VPVNKLFSVGDGTDRFGLPPVKT--EAEKMELDGK 347
Query: 404 VHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVD 463
+K +ETDAL+A S+ QQKF R+SFF + E PSPTPS + G DT E+SSA+
Sbjct: 348 DYKLHIHETDALKAASTCQQKFSRSSFFTDDEFPSPTPSGDCEGGAVDTNDEVSSASIAS 407
Query: 464 QPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 523
P L Q VSS ++ S + + + + S PA K
Sbjct: 408 SLTSSKPPPLDQMLVSSTYINRSNMHGLINSRIDASGAGSYPA--------------KTS 453
Query: 524 IKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQ 583
+KSRDPRLRF N++ Q + + PKVE V+ SRK+KTVEE LD A KR
Sbjct: 454 VKSRDPRLRF------NISDQ-SSTKNIMPKVEYAEGVI-SRKRKTVEESSLDATAPKRL 505
Query: 584 RNGFENSG-VVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 642
ENS R+E+ + GGWL +N S +T
Sbjct: 506 TRSLENSQHNSREEQTMDAKGGWLA--------------------------ENTVASNLT 539
Query: 643 SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA------- 695
+ + +GNE AP + L AL +VN TMLLN L + Q+LA
Sbjct: 540 TTS-----NGNEQAPVISSCAATPLLALFNSESVNSTMLLNKL-LDIHQRLAEVKRPINF 593
Query: 696 ---------DAQQKSNDSSMNTMHPPIPSSIPPVSVT---CSIPSGILSKPMD-ELGKVR 742
+ +S++NT P + S +P S+ S P+ +++ + + K+
Sbjct: 594 ATSALHLTNSNSARGTNSTVNT-SPTMTSGVPQNSIGMLPTSSPTTSMAQTLQVDSEKIC 652
Query: 743 MKPRDPRRVLHGNA-LQRSGSLG 764
+KPRDPRR LH ++ +Q+SGSLG
Sbjct: 653 LKPRDPRRSLHASSTVQKSGSLG 675
>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
Length = 766
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 232/355 (65%), Positives = 279/355 (78%), Gaps = 5/355 (1%)
Query: 885 DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
+++ D+ ++ A KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++
Sbjct: 412 ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 471
Query: 945 DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 999
D LR E +R K R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 472 DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 531
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
K YATEMAK+LDP GVLFAGRVIS+GDDGD GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 532 KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 591
Query: 1060 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK 1119
RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER DG LAS LGV+ER+H+
Sbjct: 592 ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHE 651
Query: 1120 IFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1179
F+S +VD+R +L+ QR+IL GC+I+FSRVFPV E P LHPLW+ AEQFGAVCT
Sbjct: 652 EFYSRPLPKEVDIREVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCT 711
Query: 1180 KHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
+++ VTHVVA S+GTDK NWAL+TGRF+V P WVEAS +LYRRANE+DF + P
Sbjct: 712 TRMEEDVTHVVAISMGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDFPVPP 766
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 14/176 (7%)
Query: 158 ESNEKVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVP 217
+S K ++ +KL+N S+ A +S C++L L+ L EL +
Sbjct: 77 DSESKHDNSIQTVVKLVNNVSVGNACKSP-------NDACAQLHDALQILEELDQPSKKS 129
Query: 218 TKDALIQ-LAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEME 276
L+Q L + + V+C N + EQ +L L L K + LF++ Q++E++
Sbjct: 130 PNCELVQSLVLKILDKIRIVYCVYNAIGDEQKPGVLQSLAKLAKFYTDKLFTAKQVEELK 189
Query: 277 AMLSS----LVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSL 328
A+ + L T ++ KE + G G + T +++ P+D L
Sbjct: 190 ALYEAVNPKLETSSDGKEHAYIPWDG--GTSAEHPTSSSLTSYTHYMSASFPMDDL 243
>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
Length = 762
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 232/355 (65%), Positives = 279/355 (78%), Gaps = 5/355 (1%)
Query: 885 DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
+++ D+ ++ A KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++
Sbjct: 408 ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 467
Query: 945 DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 999
D LR E +R K R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 468 DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 527
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
K YATEMAK+LDP GVLFAGRVIS+GDDGD GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 528 KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 587
Query: 1060 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK 1119
RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER DG LAS LGV+ER+H+
Sbjct: 588 ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHE 647
Query: 1120 IFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 1179
F+S +VD+R +L+ QR+IL GC+I+FSRVFPV E P LHPLW+ AEQFGAVCT
Sbjct: 648 EFYSRPLPKEVDIREVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCT 707
Query: 1180 KHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
+++ VTHVVA S+GTDK NWAL+TGRF+V P WVEAS +LYRRANE+DF + P
Sbjct: 708 TRMEEDVTHVVAISMGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDFPVPP 762
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 18/176 (10%)
Query: 158 ESNEKVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVP 217
+S K ++ +KL+N S+ A +S C++L L+ L EL + P
Sbjct: 77 DSESKHDNSIQTVVKLVNNVSVGNACKSP-------NDACAQLHDALQILEEL----DQP 125
Query: 218 TKDALIQ-LAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEM- 275
+ L+Q L + + V+C N + EQ +L L L K + LF++ Q++E+
Sbjct: 126 SNCELVQSLVLKILDKIRIVYCVYNAIGDEQKPGVLQSLAKLAKFYTDKLFTAKQVEELK 185
Query: 276 ---EAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSL 328
EA+ L T ++ KE + G G T +++ P+D+L
Sbjct: 186 GLYEAVNPKLETSSDGKEHAYIPWDG--GTSGEHPTSSSLTSYTHYMSASFPMDAL 239
>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1984
Score = 494 bits (1271), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/521 (50%), Positives = 335/521 (64%), Gaps = 66/521 (12%)
Query: 736 DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEA 795
+++G+ RM+PRDPRR+L NA++ + + + + P + + Q +PE
Sbjct: 1303 EDVGRHRMRPRDPRRILLENAVETA-------QVNPMNVPVNDAAGSEMTLQYTNRSPEV 1355
Query: 796 KPVLSQSVLQPDITQQFTKNL----------------KHIADFMSVSQPLTSEPMVSQNS 839
V P++ Q + N +D VS L++E ++
Sbjct: 1356 ADV------SPNLNNQPSTNTLGNPPNQRDPRLNPYESSQSDSAVVSIQLSTEHKTTEWK 1409
Query: 840 PIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQ-------------SAWG-- 884
+ +S D ++ ++ + SG G G H S WG
Sbjct: 1410 TLDERIKESERD------SNRNQGSEVSSGESTGDKGDHLHPWDPLLRKPRFGPSHWGGN 1463
Query: 885 ----DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV 940
D E L E D++Q+ AIQ ER RRL+EQ +MF A KLCLVLDLDHTLLNSAKF E+
Sbjct: 1464 DMHRDFEQLLEDLDEKQRIAIQNERKRRLQEQDRMFIAGKLCLVLDLDHTLLNSAKFSEI 1523
Query: 941 DPVHDEILRKKEEQDREKP------HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHL 994
+P + LR+ E +R + + L+RFPHM MWTKLRPGIW FL +AS+L+E+H+
Sbjct: 1524 EPEWEARLRQAENMERSRALKDPSMKQELYRFPHMSMWTKLRPGIWKFLAKASELYELHV 1583
Query: 995 YTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV 1054
YTMGNK YATEMAK+LDP G LFAGRVIS+GD+ D D KSKDL+GVLGMESAVV
Sbjct: 1584 YTMGNKAYATEMAKLLDPTGTLFAGRVISKGDEVDGSD------KSKDLDGVLGMESAVV 1637
Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVI 1114
IIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSLLE+ HDER+ DG L+S+ GVI
Sbjct: 1638 IIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSLLEVGHDERAADGMLSSASGVI 1697
Query: 1115 ERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQF 1174
+R+HK FFS++ L +VDVR ILAAEQR++L GCR++FSR+FPVGEANPHLHPLW+ AEQF
Sbjct: 1698 DRIHKNFFSNKRLREVDVRAILAAEQRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQF 1757
Query: 1175 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1215
GA C HI+D+VTHVVA SLGTDKVNWA +TGR VV P W
Sbjct: 1758 GASCCLHINDKVTHVVAISLGTDKVNWAAATGRPVVRPAWC 1798
>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1881
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 268/541 (49%), Positives = 337/541 (62%), Gaps = 67/541 (12%)
Query: 719 PVSVTCSIPSGILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQ 778
P + + G + ++GK RM+PRDPRR L +A + + S P +
Sbjct: 1325 PFGINSTKSEGQAGEEEQDIGKHRMRPRDPRRALLDSA-------ADIVQVNQRSPPIIE 1377
Query: 779 GSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIAD--------------FMS 824
+ Q + G S V QP I +N ++ D ++
Sbjct: 1378 AADSGTTLQIETGTSLPTNTSSDLVKQPSINS--LENPLNLRDPRLSSNNSTQSNNATLA 1435
Query: 825 VSQPLTS--------EPMV-----------SQNSPIQPGQIKSGADMKAVVTNHDDKQTG 865
QP T EP+V S+N I ++ SG + V +H
Sbjct: 1436 PEQPSTEQKNMTVEEEPVVDERNNARERESSRNQGIDAREVFSGESILDEV-DHLHPWDP 1494
Query: 866 TGSGPEAGPVGAHPQSAWG------DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919
P GP S WG D E L E D+ Q+ IQ ER RR++EQ +MFSA
Sbjct: 1495 VLRKPRFGP------SHWGGSNLHRDFEQLLEDLDEDQRITIQNERKRRIQEQDRMFSAG 1548
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRFPHMGMWT 973
KLCLVLDLDHTLLNSAKF E++P + LR+ E +R + + L+RFPHM MWT
Sbjct: 1549 KLCLVLDLDHTLLNSAKFSEIEPEFEARLRQAENMERSRSTKDPNMKQELYRFPHMSMWT 1608
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRPGIW FL +AS+L+E+H+YTMGNK YATEMAK+LDP G+LF+GRVIS+GD+ D D
Sbjct: 1609 KLRPGIWKFLAKASELYELHVYTMGNKAYATEMAKLLDPTGILFSGRVISKGDEVDGSD- 1667
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
KSKDL+GVLGMESAVVIIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSL
Sbjct: 1668 -----KSKDLDGVLGMESAVVIIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSL 1722
Query: 1094 LEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
LE+ HDER+ DG L+S+ GVI+R+H+ FFS++ L +VDVR ILAAEQR++L GCR++FSR
Sbjct: 1723 LEVGHDERAVDGMLSSASGVIDRIHRNFFSNKKLREVDVRAILAAEQRRVLDGCRVLFSR 1782
Query: 1154 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPG 1213
+FPVGEANPHLHPLW+ AEQFGA C +I+D+VTHVVA SLGTDKVNWA +TGR VV P
Sbjct: 1783 IFPVGEANPHLHPLWRLAEQFGASCCLYINDKVTHVVAISLGTDKVNWATATGRPVVRPT 1842
Query: 1214 W 1214
W
Sbjct: 1843 W 1843
>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 215/238 (90%), Positives = 224/238 (94%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
MGNKLYATEMAKVLDPKGVLFAGRV+SRGDDGD DGDERVPKSKDLEGVLGMES VVII
Sbjct: 1 MGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVII 60
Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 1116
DDS+RVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA SL VIER
Sbjct: 61 DDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIER 120
Query: 1117 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA 1176
+H+ FF+H SLD+ DVRNILA+EQRKILAGCRIVFSRVFPVGE NPHLHPLWQ+AEQFGA
Sbjct: 121 IHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGA 180
Query: 1177 VCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
VCT ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP
Sbjct: 181 VCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 238
>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 563
Score = 287 bits (735), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 151/334 (45%), Positives = 208/334 (62%), Gaps = 18/334 (5%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R E +++ +KL LV+DLDHT+LNSA+F EV P + I + L +
Sbjct: 173 RNAELRRVTGKQKLLLVVDLDHTMLNSARFSEV-PAEERIYLTWTAGQQHGRVSSLHQLT 231
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
+GMWTKLRP FLE ASKL+EM++YTMG K+YA MA++LDP G LF GR+IS+ D
Sbjct: 232 KLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIISQTDS 291
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
+KDL+ VLG ESAVVI+DD+ VWP+++ NLI++ERY +F S QF
Sbjct: 292 TKR--------HTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFR 343
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQS--------LDDVDVRNILAA 1138
+ PSL ++ DE DGTLA++L ++ +H FF+ H+ L+ DVR+++ +
Sbjct: 344 VRAPSLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRS 403
Query: 1139 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 1198
+ K+L+GC IVFSR+FP G NP HP WQ A + GA C+ D THVVA GTDK
Sbjct: 404 IRGKLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDK 463
Query: 1199 VNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
WA G +VHP WVEA++ L++R E+DF +
Sbjct: 464 ARWAKQHGISLVHPRWVEAASYLWKRPREKDFPV 497
>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
C-169]
Length = 1018
Score = 285 bits (729), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 146/334 (43%), Positives = 205/334 (61%), Gaps = 22/334 (6%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFPHMGMW 972
+ R+LCLVLDLDHTL+NSAKF EV+P H ++L ++ +++ P + L R + MW
Sbjct: 693 LLRQRRLCLVLDLDHTLVNSAKFSEVEPEHLKLLERQLQREAALPAEEKRLHRLDRIAMW 752
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
T LRPG+ L + LF++ + T ++ YA MA++LDP G LF R+IS+GDDG
Sbjct: 753 TALRPGLRQMLAAVAPLFQLWIQTNASRAYALAMAELLDPTGELFGQRIISKGDDGSAL- 811
Query: 1033 GDERVPKSKDL-EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
+ SK L +G+ E+ +I+DDS VW H+ NL+ VERYTYFP SRRQ L GP
Sbjct: 812 ----INHSKRLMQGLEECEAVCIIVDDSDDVWRHHAHNLLHVERYTYFPSSRRQLNLRGP 867
Query: 1092 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD-----------DVDVRNILAAEQ 1140
S LE DE + G LA +LGV+ R+H F+ +LD DVR++L +
Sbjct: 868 SFLEAHKDECDKTGILAVTLGVLLRVHIAVFA--ALDAPPTAGIREEHHWDVRHVLGLLR 925
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 1200
+++L G R++FS+VFP+G+A P W+ AE +GA CT +D+ VTHVVA S GT K
Sbjct: 926 KQVLLGVRVLFSKVFPLGQA-PSEQLYWKQAEAYGASCTSQLDEHVTHVVALSRGTHKAQ 984
Query: 1201 WALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 1234
WAL G+ VV P W+E S L++RA E+ + P
Sbjct: 985 WALQAGKHVVSPAWLECSCTLWQRAKERAYPAPP 1018
>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 478
Score = 280 bits (715), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 148/327 (45%), Positives = 205/327 (62%), Gaps = 12/327 (3%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R + K + RKL LVLDLDHTLLNS + + +E L+ + + ++ + LF
Sbjct: 154 RNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHL-TAEEEYLKSQIDSMQDVSNGSLFMVD 212
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
M M TKLRP I TFL+ AS++FEM++YTMG++ YA EMAK LDP F RVISR D
Sbjct: 213 FMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDD- 271
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
G +R K D+ VLG ESAV+I+DD+ W +K NLI++ERY +F S RQFG
Sbjct: 272 -----GTQRHQKGLDI--VLGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFG 324
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKILA 1145
SL ++ DE DG LAS L V+ R+H IFF ++D DVR +L+ ++ +L
Sbjct: 325 FECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELEDAIDGRDVRQVLSTVRKDVLK 384
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC+IVFSRVFP + H LW+ AEQ GA C++ +D VTHVV+ GT+K WAL
Sbjct: 385 GCKIVFSRVFPT-QFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKN 443
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
+F+VHP W+EA+ +++R E++F++
Sbjct: 444 DKFLVHPRWIEATNYMWQRQPEENFSV 470
>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
Length = 536
Score = 279 bits (713), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 199/323 (61%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K ++ +E+ + ++ P R LF M M
Sbjct: 163 KNLLRERKLVLILDLDHTLINSTKLFDLSAAENELGIQSAAKEV-VPDRSLFTLETMQML 221
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP V F +VIS D
Sbjct: 222 TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD------ 275
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K D+ VLG ES VI+DD+ VW +K NLI++ERY YF S RQFG S
Sbjct: 276 CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 333
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRI 1149
L E DER DG LA+ L V+ER+H IFF + L DVR ++ ++++L GC++
Sbjct: 334 LSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKL 393
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VF+RVFP+ + +W+ AEQ GAVC +D VTHVVA LGT+K WA+S +F+
Sbjct: 394 VFTRVFPLHQRQQD-QMIWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFL 452
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EA+ ++R E+DF +
Sbjct: 453 VHPRWIEAANFRWQRQQEEDFPV 475
>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
Length = 558
Score = 278 bits (712), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 142/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K ++ + L + ++ P+R +F M M
Sbjct: 157 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQTAASKDDPNRSIFSLDSMQML 215
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP + F +VIS D
Sbjct: 216 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 269
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K D+ +LG ES VI+DD+ VW +K NLI++ERY +F S RQFG S
Sbjct: 270 CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 327
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRI 1149
L E DER DG LA+ L V++R+H IFF L DVR ++ A +++IL GC+I
Sbjct: 328 LSESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLSSQDVRQVIKAVRKEILQGCKI 387
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VFSRVFP P LW+ AE GAVC+ +D VTHVV LGT+K W ++ +F+
Sbjct: 388 VFSRVFP-NNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFL 446
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EA+ + R E+DF +
Sbjct: 447 VHPRWIEAANFRWHRQPEEDFPV 469
>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
Length = 574
Score = 278 bits (710), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 197/323 (60%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K ++ +E L + P R LF M M
Sbjct: 189 KNLLRERKLVLILDLDHTLINSTKLFDLSAAENE-LGIQSAAKEVVPDRSLFTLETMQML 247
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP V F +VIS D
Sbjct: 248 TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD------ 301
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K D+ VLG ES VI+DD+ VW +K NLI++ERY YF S RQFG S
Sbjct: 302 CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 359
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRI 1149
L E DER DG LA+ L V+ER+H IFF + L DVR ++ ++++L GC++
Sbjct: 360 LSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKL 419
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VF+RVFP+ + LW+ AEQ GAVC +D VTHVVA LGT+K WA+S +F+
Sbjct: 420 VFTRVFPLHQRQQD-QMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFL 478
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EA+ ++R E+DF +
Sbjct: 479 VHPRWIEAANFRWQRQQEEDFPV 501
>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
Length = 411
Score = 276 bits (706), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 156/349 (44%), Positives = 204/349 (58%), Gaps = 34/349 (9%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----- 962
R +E +++ RKL LVLDLDHTLLNSA++ EV P L E P
Sbjct: 44 REDELRQVLGKRKLFLVLDLDHTLLNSARWMEVFPDETAYL---EHTYMNVPEDKIPALS 100
Query: 963 ----------------LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
L R M +WTKLRP FLE ASKLFEM++YTMG ++YA M
Sbjct: 101 NGAPAVAGVIQPGGGGLHRIHGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTM 160
Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
A +LDP G F GRVIS+ D ++KDL+ VLG +SAV+I+DD+ VWP +
Sbjct: 161 AHLLDPTGKFFKGRVISQRDST--------CRQTKDLDIVLGADSAVLILDDTEAVWPKH 212
Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF--SH 1124
+ NLIV+ERY +F S RQFGL PSL + + DE ++G LA+ L V++R+H FF S
Sbjct: 213 RANLIVMERYHFFQSSCRQFGLENPSLTKAERDESKDEGALANVLKVLQRIHSDFFMESD 272
Query: 1125 QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 1184
S DVR+I + + +IL+GC++VFSR+FP P L PLW+ GA C DD
Sbjct: 273 DSRYTCDVRDITSVVRSEILSGCKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDD 332
Query: 1185 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
VTHVVA TDK WA +F+VHP WVEA+ L+RR NE +F ++
Sbjct: 333 SVTHVVALDRFTDKAKWAKEHRKFLVHPAWVEAAHSLWRRPNELEFPVR 381
>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Cucumis sativus]
Length = 452
Score = 276 bits (706), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 203/328 (61%), Gaps = 12/328 (3%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R +E K++ +KL LVLDLDHTLLNS + + V +E LR + + + LF
Sbjct: 125 RNKEMKELLQRKKLILVLDLDHTLLNSTELRYL-TVEEEYLRSQTDSLDDVTKGSLFLLN 183
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
+ TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK F+ +VISR DD
Sbjct: 184 SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 242
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
G K L+ VLG ESAV+I+DD+ W +K NLI++ERY +F S RQFG
Sbjct: 243 GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 295
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQRKILA 1145
SL E+ +DE DG L + L V++++H +FF+ S D D DVR +L + ++L
Sbjct: 296 FNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLE 355
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC++VFSRVFP + H LW+ EQ G C+ +D VTHVVA GT+K WAL
Sbjct: 356 GCKVVFSRVFPT-KFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKE 414
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233
+F+VHP W+EAS ++R E++F ++
Sbjct: 415 KKFLVHPRWIEASNYFWKRQMEENFTVE 442
>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 493
Score = 274 bits (700), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K H++ + L + + P + LF M M
Sbjct: 156 KSLLRERKLVLILDLDHTLINSTKLHDISAAERD-LGIQTFASEDAPEKSLFTLEAMQML 214
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP V F +VIS D
Sbjct: 215 TKLRPFVCKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPGNVYFGSKVISNSD------ 268
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K L+ VLG E+ +I+DD+ VW +K NLI++ERY YF S RQFG +
Sbjct: 269 CTQR--HQKGLDVVLGAENVAIILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSVKA 326
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRI 1149
L E DER DG LA++L V++R+H +FF +L DVR ++ ++++L GC++
Sbjct: 327 LSESMQDERESDGALATTLDVLKRIHTLFFDSAVETALSSRDVRQVIKKVRQEVLQGCKV 386
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VFSRVFP + P +W+ AEQ GA+C +D VTHVVA GT+K WA+ + +
Sbjct: 387 VFSRVFP-SSSRPQDQIIWKMAEQLGAICCADMDSTVTHVVAVDSGTEKARWAVGNNKIL 445
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EAS + R E+DF +
Sbjct: 446 VHPRWIEASNFRWHRQQEEDFPV 468
>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
Length = 1065
Score = 273 bits (699), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 33/329 (10%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 966
RKL LVLDLDHTLLN+ ++ P E++ K H H LF
Sbjct: 746 RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 795
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG F RVISR D
Sbjct: 796 EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 854
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
DG V K L+ VLG ESAV+I+DD+ WP +K NLIV+ERY +F S RQF
Sbjct: 855 DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 907
Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKIL 1144
SL E+ DE DG LA+ L V+++ H +FF + + + + DVR +L +++IL
Sbjct: 908 DHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEIL 967
Query: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
GC+IVFSRVFP +A P HPLW+ AE+ GA C +D VTHVVA +GT+K WA+
Sbjct: 968 KGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVR 1026
Query: 1205 TGRFVVHPGWVEASALLYRRANEQDFAIK 1233
++VVH GW++A+ L+ + E++F ++
Sbjct: 1027 EKKYVVHRGWIDAANYLWMKQPEENFGLE 1055
>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
Length = 533
Score = 272 bits (695), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K ++ + L + ++ P+R +F M M
Sbjct: 155 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 213
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP + F +VIS D
Sbjct: 214 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 267
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K L+ +LG ES VI+DD+ VW +K NLI++ERY +F S RQFG S
Sbjct: 268 CTQR--HQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 325
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRI 1149
L E DER DG LA+ L V++R+H FF + D+ D+R ++ +++IL GC+I
Sbjct: 326 LSESLQDERESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKI 385
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VFSRVFP P +W+ AE GAVC K +D VTHVV LGT+K W L+ +F+
Sbjct: 386 VFSRVFP-NNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKARWGLNNKKFL 444
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EA+ + R E+DF +
Sbjct: 445 VHPRWIEAANFRWHRQPEEDFPV 467
>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4;
Short=FCP-like 4; AltName: Full=Carboxyl-terminal
phosphatase-like 4; Short=AtCPL4; Short=CTD
phosphatase-like 4
gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
Length = 440
Score = 271 bits (694), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 33/329 (10%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 966
RKL LVLDLDHTLLN+ ++ P E++ K H H LF
Sbjct: 121 RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 170
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG F RVISR D
Sbjct: 171 EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 229
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
DG V K L+ VLG ESAV+I+DD+ WP +K NLIV+ERY +F S RQF
Sbjct: 230 DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 282
Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKIL 1144
SL E+ DE DG LA+ L V+++ H +FF + + + + DVR +L +++IL
Sbjct: 283 DHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEIL 342
Query: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
GC+IVFSRVFP +A P HPLW+ AE+ GA C +D VTHVVA +GT+K WA+
Sbjct: 343 KGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVR 401
Query: 1205 TGRFVVHPGWVEASALLYRRANEQDFAIK 1233
++VVH GW++A+ L+ + E++F ++
Sbjct: 402 EKKYVVHRGWIDAANYLWMKQPEENFGLE 430
>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
Length = 531
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K + RKL L+LDLDHTL+NS K ++ + L + ++ P+R +F M M
Sbjct: 153 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 211
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP + F +VIS D
Sbjct: 212 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 265
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K D+ +LG ES VI+DD+ VW +K NLI++ERY +F S RQFG S
Sbjct: 266 CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 323
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRI 1149
L E DER DG LA+ L V++R+H FF + D+ D+R ++ +++IL GC+I
Sbjct: 324 LSESLQDERESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKI 383
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 1209
VFSRVFP P +W+ AE GAVC K +D VTHVV LGT+K W L+ +F+
Sbjct: 384 VFSRVFP-NNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKSRWGLNNKKFL 442
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VHP W+EA+ + R E+DF +
Sbjct: 443 VHPRWIEAANFRWHRQPEEDFPV 465
>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
Length = 513
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 194/336 (57%), Gaps = 27/336 (8%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVH---------DEILRKK-EEQDREKPHRHL 963
++ + RKL L+LDLDHTLLNS +F EV P ++ LR + E Q + P L
Sbjct: 111 RLLAHRKLLLILDLDHTLLNSTRFTEVPPQGAVTEQREGGEQALRAQLEAQPKGAPM--L 168
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKL------FEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
+ PHM MWTKLRPG+ FLE A FE+ +YTMG++ YA EMAK+LDP G LF
Sbjct: 169 YCLPHMRMWTKLRPGVREFLEAAKDRQVGQVGFELAVYTMGDRDYAGEMAKLLDPAGSLF 228
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
GR+IS GD + KDL+ VLG E V+I+DD+ VWP ++ NL+ +ERY
Sbjct: 229 HGRIISSGDSTQRY--------VKDLDVVLGRERCVLILDDTEGVWPRHRDNLVQIERYL 280
Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILA 1137
YFP +FG SLLE DE G LA+ L V+ + + FF DVR +L
Sbjct: 281 YFPADAARFGFRSQSLLERAVDEEGGGGALATCLRVMSGVQQQFFEQGDPGAADVRPLLG 340
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
A +R +LA CR++FSRV P+ A+P HPLWQ A + GA C + VTHVVA TD
Sbjct: 341 AARRAVLAECRLLFSRVMPLDCADPSAHPLWQLALKLGAECVRETGQGVTHVVATDT-TD 399
Query: 1198 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
K WA G+ VV P W+ A ++RA+E F +K
Sbjct: 400 KTRWACGQGKHVVSPSWLWCCAYTWQRADEAGFPVK 435
>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 197/329 (59%), Gaps = 13/329 (3%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI-LRKKEEQDREKPHRHLFRF 966
R E K + RKL L+LDLDHTL+NS + H++ ++ ++ ++ + P R LF
Sbjct: 150 RESEVKNLLRERKLVLILDLDHTLINSTRLHDISAAEMDLGIQTAASKNADDPERSLFTL 209
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
M M TKLRP + FLE AS +F+M++YTMG+K YA E+AK+LDP V F +VIS D
Sbjct: 210 QGMHMLTKLRPFVRKFLEEASNMFDMYIYTMGDKAYAIEIAKLLDPGNVYFDSKVISNSD 269
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
+R K D+ VLG + VIIDD+ VW +K NLI++ERY YF S RQF
Sbjct: 270 ------CTQRHQKGLDV--VLGDDKVAVIIDDTEHVWQKHKENLILMERYHYFAASCRQF 321
Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKI 1143
G SL E+ DER DG LA+ L V++R+H IFF +L DVR ++ ++++
Sbjct: 322 GFSDQSLSELMQDERESDGALATILDVLKRIHTIFFDSGVETALSSRDVRQVIKRVRQEV 381
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1203
L GC++VFSRVFP + +W+ AEQ GAVC +D VTHVVA GT+K WA
Sbjct: 382 LQGCKLVFSRVFP-SDCRSQDQIMWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEKARWAA 440
Query: 1204 STGRFVVHPGWVEASALLYRRANEQDFAI 1232
+F++HP W+EA + R E+DF +
Sbjct: 441 GNKKFLLHPRWIEACNYRWHRQPEEDFPV 469
>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
Length = 547
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 197/329 (59%), Gaps = 14/329 (4%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R + K + RKL L+LDLDHTL+NS K + + L + ++ P+R +F
Sbjct: 154 RCADLKNLLRERKLVLILDLDHTLINSTKLQNISSAEKD-LGIQTAASKDDPNRSIFALE 212
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
M + TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP + F +VIS
Sbjct: 213 SMQLLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPLKVIS---- 268
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
+ D K L+ +LG S VI+DD+ VW +K NLI++ERY +F S R+FG
Sbjct: 269 ----NSDCTKRHQKGLDVILGAASVAVILDDTEFVWKKHKENLILMERYHFFASSCREFG 324
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ-SLDDV---DVRNILAAEQRKI 1143
SL E+ DER DG LA+ L V++R+H IFF DD+ DVR ++ A +++I
Sbjct: 325 FAVRSLSELMQDERESDGALATVLDVLKRIHAIFFDMAVETDDLSSRDVRQVIKAVRKEI 384
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1203
L GC+IVFSRVFP P +W+ AE GAVC+ +D VTHVV LGT+K W +
Sbjct: 385 LQGCKIVFSRVFP-NNTRPQKQMVWKMAEYLGAVCSTDVDSSVTHVVTVDLGTEKARWGV 443
Query: 1204 STGRFVVHPGWVEASALLYRRANEQDFAI 1232
+ +F+VHP W+EA+ + R E+DF +
Sbjct: 444 ANKKFLVHPRWIEAANFRWHRQPEEDFPV 472
>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 492
Score = 268 bits (685), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 198/322 (61%), Gaps = 12/322 (3%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
KK+ RKL L+LDLDHTL+NS + H++ + L + ++ P R LF M M
Sbjct: 157 KKLLRERKLVLILDLDHTLINSTRLHDISAAEMD-LGIQTAALKDDPDRSLFTLERMHML 215
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + FL+ AS +FEM++YTMG+K Y+ E+AK+LDP V F +VIS D
Sbjct: 216 TKLRPFVRRFLKEASNMFEMYIYTMGDKAYSIEVAKLLDPGNVYFGSKVISNSD------ 269
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+R K D+ VLG ES VI+DD+ VW +K NLI++ERY YF S RQFG S
Sbjct: 270 CTQRHQKGLDV--VLGAESIAVILDDTEDVWQKHKENLILMERYHYFASSCRQFGFSVRS 327
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAE--QRKILAGCRIV 1150
L E+ DER DG L++ L V++R+H IFF + R ++ + ++++L GC++V
Sbjct: 328 LSELMVDERESDGALSTILDVLKRIHTIFFDSGVETALSSRTLMVIKRVRQEVLQGCKLV 387
Query: 1151 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 1210
FSRVFP + P +W+ AE+ GA C H+D VTHVVA +GT+K WA+ +F++
Sbjct: 388 FSRVFP-SNSCPQDQIIWKMAEKLGASCCAHVDSTVTHVVAVDVGTEKARWAVENKKFLL 446
Query: 1211 HPGWVEASALLYRRANEQDFAI 1232
HP W+EAS +RR E+DF +
Sbjct: 447 HPRWIEASNYRWRRQPEEDFPV 468
>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
Length = 1156
Score = 267 bits (682), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 202/335 (60%), Gaps = 22/335 (6%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP--HRHLFRFPHM 969
+++ KLCLVLDLDHTLLNSA F EV P +HD + + + P R LFR +
Sbjct: 361 EELLGRGKLCLVLDLDHTLLNSATFAEVGPTLHDSLKARAASEAATLPEDQRLLFRIDGI 420
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
MWTKLRPG+ FL+RA++ +++ ++T GN+ YA + ++LD G +F R+I++G +
Sbjct: 421 KMWTKLRPGVHKFLQRAARYYQLWIHTNGNRAYADSVVRLLDRGGAIFGDRIIAQGAE-- 478
Query: 1030 PFDGDERVP--KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
D+ VP + ++G+ ES VI+DDS VW ++ NL+ VERY YFP SR G
Sbjct: 479 --RVDQMVPDQAKRLMQGLDERESITVIVDDSHSVWSQHRHNLVAVERYIYFPSSRASLG 536
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS------------LDDVDVRNI 1135
L GPSLL+ + DE E G L +L V+ R+H + + D R
Sbjct: 537 LKGPSLLDANRDECPEQGMLMVALSVLVRVHGAVMRALAAPPTVLPGGEVVFQNWDARQA 596
Query: 1136 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 1195
LA E++K+LAG +VF+RV P+ E P HPLW+ A+ FGA C+ +D THV+A + G
Sbjct: 597 LAQERQKVLAGVHLVFTRVIPL-EMEPESHPLWRLAQSFGARCSGSLDASTTHVIAGASG 655
Query: 1196 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
T+KV A S G++VV P W+E S +L++RA+E+ F
Sbjct: 656 TEKVLSARSMGKWVVTPAWLECSCILWKRAHEERF 690
>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 4-like, partial [Cucumis sativus]
Length = 340
Score = 267 bits (682), Expect = 4e-68, Method: Composition-based stats.
Identities = 146/328 (44%), Positives = 202/328 (61%), Gaps = 12/328 (3%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R +E K++ KL LVLDLDHTLLNS + + V +E LR + + + LF
Sbjct: 13 RNKEMKELLQRXKLILVLDLDHTLLNSTELRYLT-VEEEYLRSQTDSLDDVTKGSLFLLN 71
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
+ TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK F+ +VISR DD
Sbjct: 72 SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 130
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
G K L+ VLG ESAV+I+DD+ W +K NLI++ERY +F S RQFG
Sbjct: 131 GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 183
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQRKILA 1145
SL E+ +DE DG L + L V++++H +FF+ S D D DVR +L + ++L
Sbjct: 184 FNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLE 243
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC++VFSRVFP + H LW+ EQ G C+ +D VTHVVA GT+K WAL
Sbjct: 244 GCKVVFSRVFPT-KFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKE 302
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAIK 1233
+F+VHP W+EAS ++R E++F ++
Sbjct: 303 KKFLVHPRWIEASNYFWKRQMEENFTVE 330
>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 428
Score = 264 bits (675), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 203/336 (60%), Gaps = 17/336 (5%)
Query: 901 IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E RL + K + +KL LVLDLDHTLLNS + +L + + R+
Sbjct: 104 LHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLNQTDSL-RDV 162
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
LF+ HM M TKLRP + FL+ AS++FEM++YTMG++ YA EMAK+LDP+G F
Sbjct: 163 SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 222
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
+VISR DDG K L+ VLG ESAV+I+DD+ W +K NLI++ERY +
Sbjct: 223 AKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHF 274
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ-SLDDVDVRNILA 1137
F S RQFG SL E+ DE DG LA L V++++H +FF Q DD DVR +L+
Sbjct: 275 FGSSCRQFGFNCKSLAELKSDENETDGALAKILKVLKQVHCMFFDKQEDFDDRDVRQMLS 334
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
+R++L+GC I+FSR+ V A P L + AEQ GA C ID VTHVVA GT+
Sbjct: 335 LVRREVLSGCVIIFSRI--VHGAIPSLRKM---AEQMGATCLTEIDPSVTHVVATDAGTE 389
Query: 1198 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
K WA+ +FVVHP W+EA+ +++ E++F +K
Sbjct: 390 KCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFILK 425
>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 442
Score = 264 bits (675), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 147/336 (43%), Positives = 204/336 (60%), Gaps = 17/336 (5%)
Query: 901 IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E RL + K + +KL LVLDLDHTLLNS ++ +L + +
Sbjct: 118 LHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLLNQTDSLTNVS 177
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
LF+ HM M TKLRP + FL+ AS++FEM++YTMG++ YA EMAK+LDP+G F
Sbjct: 178 KGS-LFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 236
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
+VISR DDG K L+ VLG ESAV+I+DD+ W +K NLI++ERY +
Sbjct: 237 AKVISR-DDGTQ-------KHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHF 288
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ-SLDDVDVRNILA 1137
F S RQFG SL E+ DE DG LA L V++++H +FF Q DD DVR +L+
Sbjct: 289 FGSSCRQFGFNCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQEDFDDQDVRQVLS 348
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
+ +R++L+GC I+FSR+ V A P L + AEQ GA C ID VTHVVA GT+
Sbjct: 349 SVRREVLSGCVIIFSRI--VHGAIPSLRKM---AEQMGATCLTEIDPSVTHVVATDAGTE 403
Query: 1198 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
K WA+ +FVVHP W+EA+ +++ E++F++K
Sbjct: 404 KCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFSLK 439
>gi|384251210|gb|EIE24688.1| carboxyl-terminal phosphatase-like 4 [Coccomyxa subellipsoidea C-169]
Length = 439
Score = 264 bits (674), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 150/358 (41%), Positives = 208/358 (58%), Gaps = 38/358 (10%)
Query: 899 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+A + ER R+ + K+ S RKL LVLDLDHTLLNS +F E +++ + + ++
Sbjct: 59 SASEAERVRQ-QSLKRALSNRKLLLVLDLDHTLLNSTRFDEAVGFEEQLAAIQRARPEDQ 117
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
P L+ HM +WTKLRP + FLE+A ++ EMH+YT GN YA EMA++LDP FA
Sbjct: 118 P-VSLYHLEHMRLWTKLRPYVREFLEKAHEVSEMHIYTHGNAEYAIEMARLLDPTKRFFA 176
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
R+IS+GD V KDL+ VLG E+AVVI+DD+ VWP ++ NL+ VERY +
Sbjct: 177 ERIISQGDST--------VKHVKDLDVVLGAETAVVILDDTAGVWPSHQQNLLQVERYVF 228
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILA 1137
FP R+F L SLLE+ DE + G LAS+L R+H FF + DVR L
Sbjct: 229 FPACARRFQLNVQSLLELGRDEDEQHGMLASAL----RVHSRFFGASAGGGQQDVRQHLQ 284
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAE-----------------------QF 1174
A + ++L+GCRI FSR+ P G+ P HP WQ A+ Q
Sbjct: 285 ALRLQVLSGCRISFSRIIPRGDRFPETHPHWQMAQQHKPIKFAVSNGGSRESQSIEWVQL 344
Query: 1175 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
GAV T +++ THVVA + TDKV+WA++ R +V P W+ ASA L+R+ +E F +
Sbjct: 345 GAVVTLGVEEDTTHVVAAAKDTDKVHWAVANERHIVSPDWLTASACLWRKMDEDRFPV 402
>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
Length = 1006
Score = 260 bits (664), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 199/333 (59%), Gaps = 46/333 (13%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH---------------- 962
RKL LVLDLDHTLLNS ++ P E++ K H H
Sbjct: 691 RKLYLVLDLDHTLLNSTVLRDLKP----------EEEYLKSHTHSLQEPFDFLLISDVSG 740
Query: 963 --LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
LF M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDP+G F R
Sbjct: 741 GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ISR DDG V K L+ VLG ESAV+I+DD+ WP++K NLIV+ERY +F
Sbjct: 801 IISR-DDGT-------VRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFA 852
Query: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
S RQF SL E+ DE DG LA+ L ++ + + + DVR++L +
Sbjct: 853 SSCRQFDHKYKSLSELKSDESEPDGALATVLKNVD---------EDISNRDVRSMLKQVR 903
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 1200
+++L GC++VFSRVFP +A P HPLW+ AE+ GA C +D VTHVVA +GT+K
Sbjct: 904 KEVLKGCKVVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKAR 962
Query: 1201 WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
WA+ ++VVH GW++A+ L+++ E+ F+++
Sbjct: 963 WAVREKKYVVHRGWIDAANYLWKKQPEEKFSLE 995
>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
Length = 312
Score = 254 bits (650), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 197/322 (61%), Gaps = 12/322 (3%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+ K + +KL L+LDLDHTLLNS + + + +E L + + ++ LF M
Sbjct: 3 DMKNLLRHKKLYLILDLDHTLLNSTQLMHM-TLDEEYLNGQTDSLQDVSKGSLFMLSSMQ 61
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP F +VISR D
Sbjct: 62 MMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD---- 117
Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
G +R K D+ VLG ESAV+I+DD+ W +K NLI++ERY +F S QFG
Sbjct: 118 --GTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNC 173
Query: 1091 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIV 1150
SL E DE +G LAS L V+ ++H+IFF +L +L ++ +L GC+IV
Sbjct: 174 KSLSEQKTDESESEGALASILKVLRKIHQIFFEDHTLS--LALQVLKTVRKDVLKGCKIV 231
Query: 1151 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 1210
FSRVFP ++ H LW+ AEQ GA C+ +D VTHVV+ GT+K +WA +F+V
Sbjct: 232 FSRVFPT-QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWASKHNKFLV 290
Query: 1211 HPGWVEASALLYRRANEQDFAI 1232
PGW+EA+ ++R E++F++
Sbjct: 291 QPGWIEATNYFWQRQPEENFSV 312
>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 643
Score = 253 bits (647), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 202/347 (58%), Gaps = 32/347 (9%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR----------KKEEQDREKPHRH 962
+++ ++RKL LVLDLDHTLLNS ++ + LR K+ E + R
Sbjct: 302 ERLIASRKLALVLDLDHTLLNSVLVPDL-RMDSNWLRNAMRLLDADVKRAEDANDPLKRS 360
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
+F H + TKLRPG+ FLERAS+LFE+H+ TMG++ YA +M ++LDP+ G V
Sbjct: 361 VFHLQHFDLLTKLRPGVRRFLERASRLFEIHINTMGSQAYADQMVELLDPEKRWIHGTV- 419
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
RG G+ G P K L+G L + A +I DD+ VW ++ NL+ ERY +FP
Sbjct: 420 -RGL-GEMEGGKLWAPAEKTLDGALEHLADACLIFDDTASVWESHRRNLVTCERYLFFPQ 477
Query: 1082 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--------------HQSL 1127
+RRQFGL G SLLEI DE ++G L++++ V E +H +F+ Q
Sbjct: 478 ARRQFGLSGMSLLEIGQDESEDEGMLSTAMKVFESVHSAYFAGGYDKNVKHKVRALKQHA 537
Query: 1128 DDVD-VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 1186
DV V+ IL A+++K+LA RIVFSRVFP+ +A+P HPLW AE FGA C + + D
Sbjct: 538 SDVRAVQEILCAQRKKVLADVRIVFSRVFPI-DADPTTHPLWILAEDFGATCGRTLCDDT 596
Query: 1187 THVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
THVV + TDKV A + G V P W+E S LL+RRANE F I
Sbjct: 597 THVVGTASSTDKVKAAKARGNVHAVTPHWLECSMLLWRRANEATFRI 643
>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
Length = 430
Score = 252 bits (643), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 127/267 (47%), Positives = 170/267 (63%), Gaps = 12/267 (4%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M M TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP V F +VIS D
Sbjct: 1 MQMLTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD-- 58
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
+R K D+ VLG ES VI+DD+ VW +K NLI++ERY YF S RQFG
Sbjct: 59 ----CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGF 112
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILA 1145
SL E DER DG LA+ L V+ER+H IFF + L DVR ++ ++++L
Sbjct: 113 GARSLSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQ 172
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC++VF+RVFP+ + P LW+ AEQ GAVC +D VTHVVA LGT+K WA+S
Sbjct: 173 GCKLVFTRVFPLHQ-RPQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAISN 231
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
+F+VHP W+EA+ ++R E+DF +
Sbjct: 232 KKFLVHPRWIEAANFRWQRQQEEDFPV 258
>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 248 bits (632), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 127/273 (46%), Positives = 173/273 (63%), Gaps = 12/273 (4%)
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
LF M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9 LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
S+ D K L+ VLG ESAV+I+DD+ VW +K NLI++ERY +F S
Sbjct: 69 SQADCTQR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120
Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAE 1139
RQFG SL E+ DE DG LA+ L V++R+H +FF + DD DVR ++
Sbjct: 121 CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRV 180
Query: 1140 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199
++++L GC+IVFSRVFP H LW+ AEQ GA C +D VTHVV+ GT+K
Sbjct: 181 RKEVLKGCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKS 239
Query: 1200 NWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
WAL +F+VHPGW+EA+ +++ E++F +
Sbjct: 240 RWALQEKKFLVHPGWIEAANYFWQKQPEENFPV 272
>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 247 bits (630), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 127/273 (46%), Positives = 172/273 (63%), Gaps = 12/273 (4%)
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
LF M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9 LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
S+ D K L+ VLG ESAV+I+DD+ VW +K NLI++ERY +F S
Sbjct: 69 SQADCTQR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120
Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAE 1139
RQFG SL E+ DE DG LA+ L V++R+H +FF + DD DVR ++
Sbjct: 121 CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRV 180
Query: 1140 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199
++ +L GC+IVFSRVFP H LW+ AEQ GA C +D VTHVV+ GT+K
Sbjct: 181 RKDVLKGCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKS 239
Query: 1200 NWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
WAL +F+VHPGW+EA+ +++ E++F +
Sbjct: 240 RWALQEKKFLVHPGWIEAANYFWQKQPEENFPV 272
>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
Length = 638
Score = 247 bits (630), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 126/267 (47%), Positives = 172/267 (64%), Gaps = 12/267 (4%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D
Sbjct: 1 MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQAD-- 58
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
+R K L+ VLG ESAV+I+DD+ VW +K NLI++ERY +F S RQFG
Sbjct: 59 ----CTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 1145
SL E+ DE DG LA+ L V++R+H +FF + DD DVR ++ ++ +L
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLK 172
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC+IVFSRVFP H LW+ AEQ GA C +D VTHVV+ GT+K WAL
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
+F+VHPGW+EA+ +++ E++F +
Sbjct: 232 KKFLVHPGWIEAANYFWQKQPEENFPV 258
>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
Length = 264
Score = 246 bits (628), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 125/267 (46%), Positives = 171/267 (64%), Gaps = 12/267 (4%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D
Sbjct: 1 MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCT 60
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
K L+ VLG ESAV+I+DD+ VW +K NLI++ERY +F S RQFG
Sbjct: 61 QR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 1145
SL E+ DE DG LA+ L V++R+H +FF + DD DVR ++ ++++L
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKEVLK 172
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC+IVFSRVFP H LW+ AEQ GA C +D VTHVV+ GT+K WAL
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
+F+VHPGW+EA+ +++ E++F +
Sbjct: 232 KKFLVHPGWIEAANYFWQKQPEENFPV 258
>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 243 bits (621), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 124/266 (46%), Positives = 173/266 (65%), Gaps = 11/266 (4%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP F +VISR D
Sbjct: 5 MQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD-- 62
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
G +R K D+ VLG ESAV+I+DD+ W +K NLI++ERY +F S QFG
Sbjct: 63 ----GTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGF 116
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAG 1146
SL E DE +G LAS L V+ ++H+IFF +++D DVR +L ++ +L G
Sbjct: 117 NCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEENMDGRDVRQVLKTVRKDVLKG 176
Query: 1147 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 1206
C+IVFSRVFP ++ H LW+ AEQ GA C+ +D VTHVV+ GT+K +WAL
Sbjct: 177 CKIVFSRVFPT-QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHN 235
Query: 1207 RFVVHPGWVEASALLYRRANEQDFAI 1232
+F+V PGW+EA+ ++R E++FA+
Sbjct: 236 KFLVQPGWIEAANYFWQRQPEENFAV 261
>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
Length = 518
Score = 243 bits (620), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 193/335 (57%), Gaps = 27/335 (8%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF- 966
R+ + + + ARKL L+LDLDHTLLNS ++ P E + P LFR
Sbjct: 195 RVSDLETLLRARKLTLILDLDHTLLNSTGLDDLSPA--EQANGLTRHTKGDPTAGLFRLG 252
Query: 967 -PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
M TKLRP FLE+AS +FEM +YT+G++ YA + K+LDP G F GRV+S
Sbjct: 253 RARFRMLTKLRPFARGFLEQASAMFEMSVYTLGDRGYARAVVKLLDPDGAYFGGRVVS-- 310
Query: 1026 DDGDPFDGDERVPK-SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1083
DE + K L+ V G E+A VVI+DDS VWP ++ NLIV++RY YF S
Sbjct: 311 -------SDESTRRDRKSLDVVPGAEAAAVVILDDSSHVWPEHQENLIVMDRYLYFADSC 363
Query: 1084 RQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV------DVRNILA 1137
R +G SL E+ DER DG LA +L V+ R+H+ FF D V DVR ++
Sbjct: 364 RTYGCGVSSLAELRRDEREHDGALAVALQVLTRVHQGFF-----DSVLGGRFSDVREVIR 418
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
A + ++L GC + FSRV P+ E HP+W+ AEQ GAVCT D VTHVVA GTD
Sbjct: 419 AVRSEVLRGCTVAFSRVIPL-EGVAGDHPMWKLAEQLGAVCTADADATVTHVVALDPGTD 477
Query: 1198 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
K WA +F+V+P W+ A+++ + R EQ+F I
Sbjct: 478 KARWARDNCKFLVNPKWIMAASIRWCRPCEQEFPI 512
>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
Length = 464
Score = 236 bits (603), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 190/318 (59%), Gaps = 17/318 (5%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 975
RKL LVLDLDHTLLNS + ++ + + +D + H LFR + + M TKL
Sbjct: 155 RKLILVLDLDHTLLNSTRLQDLSALEQRNGFTPDTED--ELHMELFRLEYSDNVRMLTKL 212
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FL++AS FEMH+YT+G + YA + +LDP GV F GRV+SR + +
Sbjct: 213 RPFVRGFLDQASSRFEMHVYTLGRQDYAKAVIDLLDPDGVYFRGRVVSRKE------STQ 266
Query: 1036 RVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
R KS D+ + G + +AVVI+DD+ WP ++ NLI+++RY YF C+ R+F PS+
Sbjct: 267 RDVKSLDV--IPGADPAAVVILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMA 324
Query: 1095 EIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 1154
E DER DG+LA LGV+ R+H+ FF D DVR ++A +R++L C +VFS +
Sbjct: 325 EQARDEREHDGSLAVVLGVLNRIHQAFFDD---DRADVREVIAEVRRQVLPVCTVVFSYL 381
Query: 1155 FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 1214
E P +W AE+ GA C K +D+ VTHVVA GT K WA G+F+V+P W
Sbjct: 382 EEYMEDFPEDTLMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEW 441
Query: 1215 VEASALLYRRANEQDFAI 1232
++A + R +E+DF +
Sbjct: 442 IKAVNFRWCRVDERDFPV 459
>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 389
Score = 236 bits (602), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 192/324 (59%), Gaps = 24/324 (7%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
E +++ +KL LV+DLDHT+LNSA+F +V PV + + + H+ +G
Sbjct: 3 ELRRVNKTKKLLLVVDLDHTVLNSARFADV-PVGMTWIAGELQAGGSSLHQ----MTKLG 57
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+WTKLRP FL+ ASKL+EM++YTMG + YA +MAK+LDP LFA R+IS+ D
Sbjct: 58 LWTKLRPFAHEFLQEASKLYEMYIYTMGERKYAKKMAKLLDPTRQLFADRIISQNDSTKR 117
Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
+ +KDL+ VLG +SAVVI+DD+ VWP +K NLI++ERY +F S QFG+
Sbjct: 118 Y--------TKDLDVVLGADSAVVILDDTEAVWPSHKSNLILMERYHFFSSSCSQFGVNS 169
Query: 1091 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS--LDDVDVRNILAAEQRKILAGCR 1148
SL ++ DE +GTLA++L + +H +F+ + + + ++ + + K+LAGC
Sbjct: 170 ASLAQLYRDESETEGTLATTLKTLRAIHHEYFNGKVYFFKQLSLFFVIRSLRAKLLAGCN 229
Query: 1149 IVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF 1208
+V P +HP WQ + GA C+ D THVVA GTD+ WA F
Sbjct: 230 VVL---------GPEIHPFWQLPAELGARCSTFCDHTTTHVVALDPGTDQALWAKEHDVF 280
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP WV+A++ L+ R E+D+ +
Sbjct: 281 LVHPRWVDATSYLWSRPPEEDYPV 304
>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 382
Score = 235 bits (599), Expect = 1e-58, Method: Composition-based stats.
Identities = 138/333 (41%), Positives = 196/333 (58%), Gaps = 20/333 (6%)
Query: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HL 963
R R L + + + RKLCLVLDLDHTLLN+ H + P +E+ K E R L
Sbjct: 63 RVRSLH-TRNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTCTDSLEDIARGRL 119
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F H KLRP + TFL+ ASK+FEM++YTMG++ Y+ EMA++LDP+G F +VIS
Sbjct: 120 FVLEHRQRMAKLRPFVRTFLKEASKMFEMYIYTMGDRRYSLEMARLLDPQGKFFKDKVIS 179
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1083
R DDG K KDL VLG ES+++I+DD+ +VW +K NLI++ERY +F S
Sbjct: 180 R-DDGTEM-------KEKDLNLVLGTESSILILDDNKKVWRMHKDNLILMERYHFFNSSC 231
Query: 1084 RQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQR 1141
++F L SL E+ DE DG LA L V+ ++ FF D D DVR +L++ +
Sbjct: 232 QEFDLNCKSLAELHIDENETDGALARILKVLRHINSKFFDELQGDLVDRDVRQVLSSLRG 291
Query: 1142 KILAGCRIVFSRVFPVGEANPH-LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 1200
++L+GC IVFS F N H L L + AE+ GA C + VTH VAN L T++
Sbjct: 292 EVLSGCIIVFSCAF-----NGHDLRKLRRIAERLGATCLTELGPTVTHAVANELVTEESM 346
Query: 1201 WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
WA +F+V+ W+EAS ++ E+++ ++
Sbjct: 347 WAEKENKFLVNRRWLEASNFFLQKQPEENYIVR 379
>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
Length = 420
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 184/327 (56%), Gaps = 18/327 (5%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
+ ARKL LV+DLDHTL+NS +F + DE E+ + R LFR M TK
Sbjct: 101 LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
LRP + FL AS +FEMH+YT+GN+ YAT +AK+LDP G F R+IS G+
Sbjct: 159 LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213
Query: 1035 ERVPKSKDLEGVLG----ME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
P K L V G ME +AVVI+DD+ VW + NLI +ERY YF SR +FG+
Sbjct: 214 ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270
Query: 1090 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ--SLDDVDVRNILAAEQRKILAGC 1147
SL E + DE +G LA +L V+ R+H FFS S DVR ++ +R++L GC
Sbjct: 271 ARSLAERNRDESEREGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGC 330
Query: 1148 RIVFSRVFPVGEAN-PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 1206
+ F+ V P G+ HP+W+ AEQ GA C + + VTHVVA T K WA + G
Sbjct: 331 TVAFTGVIPSGDGGRASDHPVWRKAEQLGATCADDVGEGVTHVVAGKPVTGKALWAQTHG 390
Query: 1207 RFVVHPGWVEASALLYRRANEQDFAIK 1233
+F+V W+ A+ + + E+ + +K
Sbjct: 391 KFLVDTEWINAAHFRWSKPEERMYPVK 417
>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
Length = 457
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 187/338 (55%), Gaps = 20/338 (5%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKF----HEVDPVHDEILR-----KKEEQDREKPHR 961
E ++ ARKL LVLDLDHTLLNS E + + + + + E+ + R
Sbjct: 121 ELSRLIKARKLALVLDLDHTLLNSVLVPSLRTEANSLQNAMRLLDHDVARAERTGDPLQR 180
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F PH ++TKLRPG+ +FLERASKLFE+H+ TMG++ YA +M +LDP G V
Sbjct: 181 SCFHLPHFDLFTKLRPGVRSFLERASKLFEIHISTMGSQAYADQMVALLDPAKKWINGTV 240
Query: 1022 ISRGDDGDPFDGDERVPKSKDLE--GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
G+ + G P+ K L+ G+ + VI DD+ VW N +L ERY +F
Sbjct: 241 KGLGEMEN---GRLIAPRYKSLDDCGLGELTDVSVIFDDTTDVWAQNLKSLFTCERYLFF 297
Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSL----DDVDVRNI 1135
P +RRQFGLLG SLLE+ DE +G L +++ V E +H +F + +++I
Sbjct: 298 PQARRQFGLLGSSLLEVGQDESESEGMLMTAINVFESVHAEYFKRRDALKGKKSPCMQDI 357
Query: 1136 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 1195
L ++ +L+G +VFSRVFP+ P PLW AE FGA C+ I THVV S
Sbjct: 358 LEERRKVVLSGVHVVFSRVFPL-HVKPEEQPLWILAENFGANCSSEITSHTTHVVGTSKA 416
Query: 1196 TDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
T KV AL G V P W+E S L +RRA+E++F I
Sbjct: 417 TAKVREALKRGGIHAVTPHWLECSMLFWRRASEKNFTI 454
>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
Length = 420
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/327 (40%), Positives = 183/327 (55%), Gaps = 18/327 (5%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
+ ARKL LV+DLDHTL+NS +F + DE E+ + R LFR M TK
Sbjct: 101 LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
LRP + FL AS +FEMH+YT+GN+ YAT +AK+LDP G F R+IS G+
Sbjct: 159 LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213
Query: 1035 ERVPKSKDLEGVLG----ME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
P K L V G ME +AVVI+DD+ VW + NLI +ERY YF SR +FG+
Sbjct: 214 ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270
Query: 1090 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ--SLDDVDVRNILAAEQRKILAGC 1147
SL E + DE +G LA +L V+ R+H FFS S DVR ++ +R++L GC
Sbjct: 271 VRSLAERNRDESEREGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGC 330
Query: 1148 RIVFSRVFPVGEAN-PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 1206
+ F+ V P G+ HP+W+ AEQ GA C + + VTH VA T K WA + G
Sbjct: 331 TVAFTGVIPSGDGGRASDHPVWRRAEQLGATCADDVGEGVTHFVAGKPVTRKALWAQTHG 390
Query: 1207 RFVVHPGWVEASALLYRRANEQDFAIK 1233
+F+V W+ A+ + + E+ + +K
Sbjct: 391 KFLVDTEWINAAHFRWSKPEERMYPVK 417
>gi|308802003|ref|XP_003078315.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
gi|116056766|emb|CAL53055.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
Length = 480
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 200/349 (57%), Gaps = 30/349 (8%)
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD------RE 957
E+ +R EE++++ KL L+LDLDHTLLNSA+F E+ ++L + Q+ RE
Sbjct: 142 EKAKR-EEKERVLKDGKLTLILDLDHTLLNSAQFKELTQEQHDLLHQCIAQEANGLAERE 200
Query: 958 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
+P + R HMG +TKLRP ++ FLE S++ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 201 RPMLYCLR--HMGFFTKLRPHVFEFLEEVSQICQPYVYTMGDKAYAKEMVKLIDPEGKIF 258
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
GRVIS + D KDL+ VLG E++ VI+DD+ RVWP N NLI ++RY
Sbjct: 259 HGRVIS--------NNDSTSSHVKDLDIVLGGETSAVIVDDTERVWPANHGNLIRLDRYH 310
Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS------LGVIERLHKIFFSHQSLDDVD 1131
+FP S F G S++E + E G++ + L VI+ H+ +F H S+++ D
Sbjct: 311 FFPSSAASFQQKGQSVMERSMVDEGELGSMGARAVLLDVLAVIQSAHRSYFKHASIEEPD 370
Query: 1132 VRNILAAEQRKI--LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID---DQV 1186
VR +L R L+G + V S V P+ + NP HPL A GA I+ D V
Sbjct: 371 VRTLLVKPDRIDLPLSGVKFVMSGVTPLSDRNPERHPLRLLASTLGAEFVSSIERDGDSV 430
Query: 1187 THVVANSLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAIK 1233
THV+A S GTDKV WA TG +V P W+ A A R +E + I+
Sbjct: 431 THVIARSSGTDKVKWAKKTGGRVLIVEPSWLVACAQANTRVSESLYPIE 479
>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
Length = 493
Score = 227 bits (579), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 179/321 (55%), Gaps = 16/321 (4%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 971
+ RKL LVLDLD TL+NSA+ D E +KPH LFR +
Sbjct: 156 LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 213
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
TKLRP + FLE+AS +FEMH+YT+ + YA + +LDP GV F GRV+SR D
Sbjct: 214 LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKD----- 268
Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
R KS D+ + G + V+I D VWP ++ NLI+++RY YF + R+F P
Sbjct: 269 -STRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 325
Query: 1092 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
SL E DER +D +LA L V+ R+H+ FF D DVR ++ +R++L C + F
Sbjct: 326 SLAEQGRDEREQDNSLAVVLNVLRRIHQDFFDG---DQADVREVIREVRRQVLPECTVAF 382
Query: 1152 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVH 1211
S + E P +W AE+ GAVC K +D+ VTHVVA GT K WA G+F+V+
Sbjct: 383 SYLDDCMEDFPENTLMWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVN 442
Query: 1212 PGWVEASALLYRRANEQDFAI 1232
P W++AS + R +EQ F +
Sbjct: 443 PEWIKASGFRWCRVDEQGFPV 463
>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
Length = 490
Score = 227 bits (579), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 179/321 (55%), Gaps = 16/321 (4%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 971
+ RKL LVLDLD TL+NSA+ D E +KPH LFR +
Sbjct: 153 LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 210
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
TKLRP + FLE+AS +FEMH+YT+ + YA + +LDP GV F GRV+SR D
Sbjct: 211 LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKD----- 265
Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
R KS D+ + G + V+I D VWP ++ NLI+++RY YF + R+F P
Sbjct: 266 -STRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 322
Query: 1092 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
SL E DER +D +LA L V+ R+H+ FF D DVR ++ +R++L C I F
Sbjct: 323 SLAEQGRDEREQDNSLAVVLNVLRRIHQDFFDG---DQADVREVIREVRRQVLPECTIAF 379
Query: 1152 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVH 1211
S + E P +W AE+ GAVC K +D+ VTHVVA GT K WA G+F+V+
Sbjct: 380 SYLDDCMEDFPENTLMWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVN 439
Query: 1212 PGWVEASALLYRRANEQDFAI 1232
P W++AS + R +EQ F +
Sbjct: 440 PEWIKASGFRWCRVDEQGFPV 460
>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 720
Score = 227 bits (579), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 135/332 (40%), Positives = 196/332 (59%), Gaps = 19/332 (5%)
Query: 903 KERTR-RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
KE +R R + K + + RKLCLVLDLDHTLLN+ H + P +E+ K E +
Sbjct: 198 KEISRVRSRDVKNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTHTDSLEDISK 255
Query: 962 -HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
LF H+ + TKLRP + TFL+ AS++FEM++YTMG++ Y+ EMA++LDP+G F +
Sbjct: 256 GSLFMLEHVQVMTKLRPFVRTFLKEASEMFEMYIYTMGDRQYSLEMARLLDPQGEYFKDK 315
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
VISR DDG KDL+ VLG E+++VI+DD VWP + NLI++ERY +F
Sbjct: 316 VISR-DDGTQ-------KNVKDLDLVLGTENSIVILDDKEEVWPKYRDNLILMERYHFFN 367
Query: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAA 1138
S + FGL SL ++ DE DG LA L V+ +++ FF D D DVR +L++
Sbjct: 368 SSCQDFGLQCKSLAALNIDENEIDGALAKILEVLRQINYKFFDELQGDLVDRDVRQVLSS 427
Query: 1139 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 1198
+ ++L GC IVFS F + L L + AE+ GA C K +D VTHV+ T +
Sbjct: 428 FRGEVLRGCVIVFSLNF-----HGDLRILRRIAERLGATCLKKLDPTVTHVIGTDFVTKE 482
Query: 1199 VNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
WA+ +F+V W+EA+ ++ E++F
Sbjct: 483 SRWAVQEKKFLVSRRWLEAANFFLQKQPEENF 514
>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
Length = 641
Score = 222 bits (566), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/245 (48%), Positives = 156/245 (63%), Gaps = 12/245 (4%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D
Sbjct: 1 MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQAD-- 58
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
+R K L+ VLG ESAV+I+DD+ VW +K NLI++ERY +F S RQFG
Sbjct: 59 ----CTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 1145
SL E+ DE DG LA+ L V++R+H +FF + DD DVR ++ ++ +L
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLK 172
Query: 1146 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 1205
GC+IVFSRVFP H LW+ AEQ GA C +D VTHVV+ GT+K WAL
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231
Query: 1206 GRFVV 1210
+F+V
Sbjct: 232 KKFLV 236
>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
Length = 312
Score = 211 bits (536), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 181/319 (56%), Gaps = 14/319 (4%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHMGMWTKLRP 977
RKL LVLDLDHTL+NSA F EV L +D K L + + +WTK+RP
Sbjct: 4 RKLMLVLDLDHTLVNSASFDEVCAEEKPFLESMYARDPPKGRSKLLHKLDDLQLWTKIRP 63
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
FL +ASKLF++++YTMG ++YA M K+LDP GVLF G ++SR +D D D +R
Sbjct: 64 FALEFLAQASKLFDLYVYTMGTRIYAEAMLKLLDPTGVLFKG-LVSR-NDNDLTDHRDR- 120
Query: 1038 PKSKDLEGVLGMESAVVIIDDSVRVWPHNK-LNLIVVERYTYFPCSRRQFGL-LGPSLLE 1095
KDL+ VLG ES+V+I+DD WP + NLI ++RY +F S + FG SL
Sbjct: 121 ---KDLDTVLGQESSVLIVDDLPEAWPEEQHKNLIQIDRYHFFSSSCKSFGFDESSSLAR 177
Query: 1096 IDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRV 1154
DE G+LAS L +E +H+ FF + + DVR+ ++ + IL GC++ FS V
Sbjct: 178 RGIDESHSGGSLASLLQGLETIHRDFFQYGEFSFLEDVRDTVSELRSHILEGCKLAFSSV 237
Query: 1155 FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 1214
P+ + LW E GA C IDD VTHVVA + + WA+ G+ +V+P W
Sbjct: 238 VPIDCED----SLWILCEGLGAECVLEIDDSVTHVVAMDPESARARWAVENGKHLVNPSW 293
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ A+A R E +F ++
Sbjct: 294 MRAAAFRLGRPRESEFQVR 312
>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
Length = 462
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 188/325 (57%), Gaps = 24/325 (7%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGMW 972
+F RKL LVLDLD TLLNSA+ + V +E + +K +FR ++GM
Sbjct: 124 LFRERKLILVLDLDRTLLNSARL-DAFSVGEEWFGFTPDTG-DKVDMDIFRLDSDNLGML 181
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
TKLRP + R S +FEMHLYT+GN +YA +LDP GV F GRV+SR D+
Sbjct: 182 TKLRPFV-----RGS-MFEMHLYTLGNLVYAKAAIHLLDPNGVYFGGRVVSRDDESTQ-- 233
Query: 1033 GDERVPKSKDLEGVLGME--SAVVI--IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
+K L+ + G + +AV++ +DD+ WP ++ NLI+ RY YF + R+
Sbjct: 234 -----GGTKSLDVIPGADPVAAVILDALDDTDVAWPEHQDNLILTNRYRYFASTCRKSRH 288
Query: 1089 LGPSLLEIDHDERSE-DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 1147
PSL E+ DE+ E G+LA +LGV++R+H FF + DVR ++A + ++L GC
Sbjct: 289 DIPSLAELRRDEKGEHGGSLAVALGVLKRVHDAFFDGRP--HADVREVIAELRGQVLRGC 346
Query: 1148 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 1207
+ FS + E +P LW AE+ GAVC K +D+ VTHVVA GT K WA G+
Sbjct: 347 TVAFSYLEQRMEDSPDDTRLWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWAREHGK 406
Query: 1208 FVVHPGWVEASALLYRRANEQDFAI 1232
F+V+P W++A++ + R + Q+F +
Sbjct: 407 FLVNPEWIKAASFRWCRQDPQEFPV 431
>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
Length = 318
Score = 202 bits (515), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 169/290 (58%), Gaps = 23/290 (7%)
Query: 961 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
R L + +WTKLRPG+ FL + + +FE+H+ TMG + YA EM +++DP G
Sbjct: 25 RTLHFVERLQIWTKLRPGVKKFLRQVASMFEVHVITMGTQSYADEMRQLIDPGRQHIKGS 84
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
VI G D F G+ + K L+G L G++S V++DD V VWP ++ NLI ++RY YF
Sbjct: 85 VIGLGQ-MDEF-GELQPADKKRLDGELSGLDSIAVVLDDHVGVWPDHEENLIEIDRYLYF 142
Query: 1080 PCSRRQFGLL--GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS----------- 1126
P + +QFG+ G SLLE DE ++ TLA++ V+ R+H+ FF+ ++
Sbjct: 143 PSALKQFGVWRNGASLLEKKVDEIADRSTLAAAFEVLRRVHQDFFAERAGHLALANKKAK 202
Query: 1127 ------LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 1180
L V V ILA E++K+LAG +VFS VFP+ +A PH +W+ AEQFGA C
Sbjct: 203 DAAMAELARVTVPEILALEKKKVLAGTGLVFSGVFPL-DAPPHEQKMWRLAEQFGARCET 261
Query: 1181 HIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
+HVVA + GT K WA GR VV P W+ SA L+ +A+E+ F
Sbjct: 262 QPGPNTSHVVAKTWGTGKCQWAKENGRHVVSPDWLFCSAFLWSKADERAF 311
>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
Length = 546
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 137/413 (33%), Positives = 199/413 (48%), Gaps = 72/413 (17%)
Query: 876 GAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA 935
GA+ ++A V +L EG K R + EE+ + KL LVLDLDHTLLNS
Sbjct: 147 GANAETA---VRYLHEGLTVSDKLL----REAKNEERMATLNQGKLFLVLDLDHTLLNSC 199
Query: 936 KFHEVDPVHDEIL-RKKEEQDREKPHRH---------------------LFRFPHMGMWT 973
+F E++ E L RK E+++ E R L+ H +T
Sbjct: 200 RFDELNDEERESLDRKVEKREEEDELRSKLLGLVGGGDAGGGRRPRFPDLYCLSHFSTYT 259
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRP ++ FLE+ASK+ MH+YTMG+K YA EMA ++DP+G F GR+I +
Sbjct: 260 KLRPYVFEFLEQASKICRMHVYTMGDKNYAHEMASLIDPEGKYFHGRIIG--------NS 311
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG-LLGPS 1092
D K+KDL+ VLG + +I+DD+ RVWP + NLI V+RY +F S F + S
Sbjct: 312 DSTCSKTKDLDIVLGGDDCTMIVDDTSRVWPRHARNLIRVDRYHFFRKSATSFREMEKSS 371
Query: 1093 LLEIDHDERSEDG---------TLASSLGVIERLHKIF-FS----------HQSLDDVDV 1132
++E DE + L L V+ H++ FS H+ + V
Sbjct: 372 VMERGLDEGEAEEEGAPAKHREVLKDVLAVLTVAHRMMAFSDGSGHREKKDHKDVRQVFE 431
Query: 1133 RNILAAEQRK--------------ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC 1178
RN +A+E +L GC I+ S + P + P HPL A GA
Sbjct: 432 RNGMASEGENGGNVESVDPASSSLLLKGCVILPSGITPSNDERPDRHPLLLVAVGLGATI 491
Query: 1179 TKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
++D VTHV+A + T+KV W G F+V+ W+ A+ + +E +FA
Sbjct: 492 ATAMNDNVTHVLARADNTEKVKWGRKRGLFIVNGNWLRECAMQNAKLDEHNFA 544
>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
Length = 346
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 175/331 (52%), Gaps = 23/331 (6%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 965
R E +K+ +KL LVLDLDHTLLNSA F +VD L K + + P R L +
Sbjct: 25 RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+ +WTK+RP + FLE ASK F++H+YT G ++YA MAK+LDP G LF G + SR
Sbjct: 85 VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143
Query: 1026 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLI-VVERYTYFPC 1081
D K+ KDL+ V G ES +I+DDS VWP H+K NLI V +RY +F
Sbjct: 144 --------DHNCMKAMKDLDTVPGDESITLIVDDSDCVWPKKHHK-NLIPVYDRYLFFRS 194
Query: 1082 SRRQFGLL-GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
S FGL SL DE + TLA L ++R+H FF DVR + +
Sbjct: 195 STGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIHSEFFQESGCFAGDVRQTMREVK 254
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTH-VVANSLGTDKV 1199
L+GC+IV + +A L LW + ++ GA C IDD VTH VVA+ +
Sbjct: 255 GHALSGCKIV---ICAKSQAAHEL--LWDSCQELGAECVVDIDDTVTHVVVASKQQPQGL 309
Query: 1200 NWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
+ G+++V P W+ + R +E F
Sbjct: 310 ELSAQAGKYLVWPSWIHTAHYRCCRPDEAAF 340
>gi|255080370|ref|XP_002503765.1| predicted protein [Micromonas sp. RCC299]
gi|226519032|gb|ACO65023.1| predicted protein [Micromonas sp. RCC299]
Length = 574
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 145/448 (32%), Positives = 191/448 (42%), Gaps = 128/448 (28%)
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV----------------------- 940
E+ +R E+++ + S R L LVLDLDHTLLNSA+F E+
Sbjct: 100 EKAKREEKRRILLSGR-LVLVLDLDHTLLNSARFSELSQEEHYAMHRIIAAADCEANGGS 158
Query: 941 ---------------DPVHDEILRKKEE-----------------QDREKPH-------R 961
DPV E +KE+ + R+ P R
Sbjct: 159 KEEVQQAAAAIQPVEDPVAAESAEEKEDGADVDGEKGEEEAGGKERARDGPFPGTDPPLR 218
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
HL HM M+TKLRP FL AS+L M++YTMG++ YA EMAK+LDP G LF GRV
Sbjct: 219 HLNCLRHMAMFTKLRPHAHAFLRAASQLCTMYIYTMGDRNYAREMAKLLDPTGELFNGRV 278
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
I GD + KDL+ VLG E V+I DD+ RVWP N NLI ++RY +F
Sbjct: 279 IGSGDSTSQY--------KKDLDIVLGAEPTVLITDDTDRVWPKNLANLIRIDRYHFFKQ 330
Query: 1082 SRRQFGLLGPSLLEIDHDERSEDG---TLASSLGVIERLHKIFF---------------- 1122
S F G S++E + ++G L L VI H+ FF
Sbjct: 331 SAAGFRQPGRSVMERQWRDEGDNGDRAQLRDVLAVIAAAHRRFFEGTAAANTADDAIAEM 390
Query: 1123 ----------------------------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 1154
+ L+ DVR +L + LA R+VFSRV
Sbjct: 391 DASMLRSASATEGGDIRQTNHKPVSDEEASTKLESRDVRRLLTVPEDGPLANVRVVFSRV 450
Query: 1155 FPVGEANPHLHPLWQTAEQFGAVCTKHIDD--QVTHVVANS-------LGTDKVNWALST 1205
E P HPLW A GA +DD THVVA++ TDKV WA T
Sbjct: 451 VAQSEPRPERHPLWLLATALGAEVLNAVDDGKGATHVVAHAGTGDGDGRRTDKVKWAAKT 510
Query: 1206 -GRFVVHPGWVEASALLYRRANEQDFAI 1232
G V W+ A + R +E +++
Sbjct: 511 PGASAVSADWLAKCADEWARVDESRYSL 538
>gi|326510557|dbj|BAJ87495.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 132/215 (61%), Gaps = 9/215 (4%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R + K + RKL L+LDLDHTL+NS K H++ + L + ++ P+ LF
Sbjct: 152 RGSDLKNLLRERKLILILDLDHTLINSTKLHDISAAENN-LGIQAAASKDDPNGSLFTLE 210
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
M M TKLRP + FL+ AS +FEM++YTMG+K YA E+AK+LDP+ V F +VIS D
Sbjct: 211 GMQMLTKLRPFVRKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPRNVYFNSKVISNSD- 269
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
+R K D+ VLG ES VI+DD+ VW +K NLI++ERY YF S RQFG
Sbjct: 270 -----CTQRHQKGLDM--VLGAESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFG 322
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
SL E+ DER DG LA+ L V++R+H IFF
Sbjct: 323 FSVKSLSELMQDERGSDGALATILDVLKRIHTIFF 357
>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
Length = 346
Score = 183 bits (464), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 127/331 (38%), Positives = 174/331 (52%), Gaps = 23/331 (6%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 965
R E +K+ +KL LVLDLDHTLLNSA F +VD L K + + P R L +
Sbjct: 25 RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+ +WTK+RP + FLE ASK F++H+YT G ++YA MAK+LDP G LF G + SR
Sbjct: 85 VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143
Query: 1026 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLI-VVERYTYFPC 1081
D K+ KDL+ V G ES +I+DDS VWP H+K NLI V ++Y +F
Sbjct: 144 --------DHNCMKAMKDLDTVPGDESITLIVDDSDYVWPKKHHK-NLIPVYDQYRFFRS 194
Query: 1082 SRRQFGLL-GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
S FGL SL DE + TLA L ++R+H FF DVR + +
Sbjct: 195 STGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIHSEFFQEYGCFAGDVRQTMREVK 254
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTH-VVANSLGTDKV 1199
L+GC+IV + +A L LW + + GA C IDD VTH VVA+ +
Sbjct: 255 GHALSGCKIV---ICAKTQAAHEL--LWDSCQALGAECVVDIDDTVTHVVVASKQQPQGL 309
Query: 1200 NWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
+ G+++V P W+ + R +E F
Sbjct: 310 ELSAQAGKYLVWPSWIHTAHYRCCRPDEAAF 340
>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 692
Score = 182 bits (463), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 140/232 (60%), Gaps = 15/232 (6%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDP--VHDEILR-KKEEQDREKPHRHLFRFPHMGM 971
+ R+L LVLDLDHTLLNS F D + +L ++ E ++ R L R H+G+
Sbjct: 299 LLDRRRLTLVLDLDHTLLNSESFESKDGGRLQRGLLEIERLESTKDSNDRTLHRLNHIGL 358
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
WTKLRPG+ TFL +AS +FE+H+ TMG++ YA + ++LDP + G VI G F
Sbjct: 359 WTKLRPGVQTFLHKASAMFEIHISTMGSQPYADSIRRLLDPCRNVIKGSVIGLGG----F 414
Query: 1032 D--GDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
D G + P K LEGVL G E A VI+DD+ VW NLIV ERY YFP + + FG+
Sbjct: 415 DEFGAFKSPPQKKLEGVLAGTEPAAVILDDTAEVWTGYSENLIVCERYMYFPSACKNFGV 474
Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
+GPSLLE DE + GTLA+ L V+ R+H FF+ ++ R +LA +Q
Sbjct: 475 VGPSLLERGVDESEKSGTLATVLEVLTRVHSEFFARRA-----ARQVLAEKQ 521
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 60/109 (55%), Gaps = 5/109 (4%)
Query: 1130 VDVRNILAAEQRKILAGCRIVFSRVFPVGEAN--PHLHPLWQTAEQFGAVCTKHIDDQVT 1187
V V +LA E++ +L G IVFS VF + P HPLW+ AE+ GA T
Sbjct: 581 VTVPKLLADERKNVLRGVEIVFSGVFDHNDKTLTPREHPLWRLAERLGARVVSEPGTSTT 640
Query: 1188 HVVANSLGTD---KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
HVVA +GTD K W L + VV P WVEASA+++ R +E F +K
Sbjct: 641 HVVAKCVGTDLTHKAMWGLQYLKHVVSPAWVEASAMMWTRQDEGRFPVK 689
>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
Length = 271
Score = 167 bits (422), Expect = 5e-38, Method: Composition-based stats.
Identities = 112/317 (35%), Positives = 156/317 (49%), Gaps = 61/317 (19%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 975
RKL LVLDLDHTLLNS + H+ D E ++ H LFR + + M TKL
Sbjct: 8 RKLILVLDLDHTLLNSTRLHQ-DLSALEQRNGFTPDTEDELHMELFRLEYSDNVRMLTKL 66
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FLE+AS +T +DP V
Sbjct: 67 RPFVRGFLEQAS------------SRASTSSRAPIDPAAV-------------------- 94
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1095
VI+DD+ WP ++ NLI+++RY YF C+ R+F PS+ E
Sbjct: 95 ------------------VILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMAE 136
Query: 1096 IDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 1155
DER DG+LA LGV+ R+H+ FF D DVR ++A +R++L C +VFS +
Sbjct: 137 QARDEREHDGSLAVVLGVLNRIHQAFFDD---DRADVREVIAEVRRQVLPVCTVVFSYL- 192
Query: 1156 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1215
E P +W AE+ GA C K +D+ VTHVVA GT K WA G+F+V+P W+
Sbjct: 193 ---EDFPEDTLMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEWI 249
Query: 1216 EASALLYRRANEQDFAI 1232
+A + R +E+DF +
Sbjct: 250 KAVNFRWCRVDERDFPV 266
>gi|145344421|ref|XP_001416731.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576957|gb|ABO95024.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 248
Score = 166 bits (420), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 102/248 (41%), Positives = 151/248 (60%), Gaps = 28/248 (11%)
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEE-QDRE 957
E+ +R EE+ ++ KL L+LDLDHTLLNS +F E+ D +H+ I R+ E ++ +
Sbjct: 14 EKAKR-EEKARVLQNGKLTLILDLDHTLLNSTQFKELTQEQHDLLHECIAREAEGLKEGQ 72
Query: 958 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
+P + R HMG +TKLRP ++ FLE SK+ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 73 RPMLYCLR--HMGFFTKLRPHVFEFLESVSKICQPYVYTMGDKPYAREMVKLIDPEGTIF 130
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
GRVIS + D KDL+ VLG E++ +I+DD+ RVWP N+ NLI ++RY
Sbjct: 131 HGRVIS--------NNDSTSSHVKDLDIVLGGEASAIIVDDTERVWPQNQGNLIRLDRYH 182
Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS------LGVIERLHKIFFSHQSLDDV- 1130
+FP S F G S++E + E G++ S L VIE +H+ FF ++ DD
Sbjct: 183 FFPGSASSFQQKGQSVMESSMVDEGELGSVGSRAVLLDVLAVIESVHRSFF--KNTDDGE 240
Query: 1131 --DVRNIL 1136
DVR +L
Sbjct: 241 EPDVRKLL 248
>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f. nagariensis]
gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f. nagariensis]
Length = 699
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 160/321 (49%), Gaps = 67/321 (20%)
Query: 926 DLDHTLLNSAKFHEVDPVHD----EILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
DLDHTLLNS EV P E+LR++EE + P R L R +WTKLRPG++
Sbjct: 377 DLDHTLLNSVHTSEVGPDTATQLAEVLRREEEANL-GPRRLLHRLAENKLWTKLRPGVFE 435
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
FLE +EMH+YTMG+K YA E+ K+LDP G LF+ VI++ +K
Sbjct: 436 FLEGLRDDYEMHIYTMGDKTYAAEVRKLLDPTGKLFSS-VIAK--------DHSTTATAK 486
Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 1101
DL+ +L + +++DD+ VWP ++ NL+ D DE
Sbjct: 487 DLDVLLSADELALVLDDTEAVWPGHRRNLLQ------------------------DSDES 522
Query: 1102 SEDGTLASSLGVIERLHKIFFSHQS-----LDDVDVRNILAAEQRKIL----AGCRIVFS 1152
+ DG LA+ + V+ +H FFS L+ DVR+IL+ ++R+IL GC I FS
Sbjct: 523 ATDGALAAHMRVLRAVHTRFFSADDPSLPPLERRDVRDILSEQRREILQLMPQGCCITFS 582
Query: 1153 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVVANSLGTDKVNWALSTGRFVVH 1211
R +P + NP PLWQ A GA C D V THVVA + GT+K
Sbjct: 583 RCWPQ-DRNPLREPLWQLAMSLGANCLTTYDPGVTTHVVAAAGGTEK------------- 628
Query: 1212 PGWVEASALLYRRANEQDFAI 1232
A + R E+DFA+
Sbjct: 629 -----GCAHRWTRLPEEDFAV 644
>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 694
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 173/349 (49%), Gaps = 51/349 (14%)
Query: 913 KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFP- 967
++ A+KL LVLDLDHTLL++ A E P DEI H F+ P
Sbjct: 147 ERQLIAKKLSLVLDLDHTLLHAVYVADLLEQRPTASDEI--------------HYFKIPG 192
Query: 968 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
M KLRPG+ FL+ + +++ +YT G ++YA +A+++DP LF R+++R
Sbjct: 193 VMTMEYVVKLRPGLHQFLKSLREQYDLFIYTHGTRIYAEAIAEIIDPDDTLFRHRIVART 252
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
D D D KS L +S ++I+DD + VW N+ N+++++ + +F C+
Sbjct: 253 DTPD-IDH-----KSLKLLFPSCDDSMILILDDRLDVWKENEGNVLLIKPFHFFNCTAEI 306
Query: 1086 FGLLG----PSLLEIDHDE-----RSEDGTLASSLGVIERLHKIFFS----------HQS 1126
G PS + D D L +++R+H+ F+ S
Sbjct: 307 NNAPGETISPSASSQNQDSDPVEPTKMDTDFEYILKILQRVHQAFYQCTKAGRTAEEQMS 366
Query: 1127 LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE-ANPHLHPLWQTAEQFGAVCTKHIDD- 1184
DV+ ILA EQRKIL GC IVFS VFPV + P H LW+ A GAV + IDD
Sbjct: 367 GRGNDVKQILAQEQRKILQGCFIVFSGVFPVSDPRGPKSHSLWRLAADMGAVPSLVIDDF 426
Query: 1185 QVTHVVAN--SLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
VTH+V + LGT K A T VV P W+ SA ++ RA E D+
Sbjct: 427 PVTHLVIHPMRLGTQKHVKARETPHIHVVTPDWLVRSARIWHRAPESDY 475
>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
Length = 673
Score = 154 bits (388), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 174/346 (50%), Gaps = 53/346 (15%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--H 968
++ + S RKL L++DLD T+++++ K +D EK H+ + R+ H
Sbjct: 134 DETNLVSNRKLVLLVDLDQTIIHTSD-------------KPMSEDSEK-HKDITRYGLNH 179
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
TKLRP FL + + ++EMH+ T G + YA ++A++LDP+ LF R++SR
Sbjct: 180 RKYITKLRPHTTEFLNKMATMYEMHIVTYGQRQYAHKIAQILDPEARLFGQRILSR---D 236
Query: 1029 DPFDGDERVPKSKDLEGVLGMESA--------VVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ F K+++L+ ++ + A VVIIDD VW ++ LI ++ Y +F
Sbjct: 237 ELFSAQH---KTRNLKVIILFQKALFPCGDNLVVIIDDRADVWMYSDA-LIQIKPYRFFK 292
Query: 1081 ---------CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-- 1129
S+ Q P +E D+ ED L V+ +H ++ L
Sbjct: 293 EVGDINAPQNSKEQM----PVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKYDLKGSD 345
Query: 1130 ---VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 1186
+DV+ ++ E+RK+L GC IVFS + P GE +++ +QFGA + DQV
Sbjct: 346 QCLLDVKEVIKEERRKVLDGCVIVFSGIVPTGEKLERT-DIYRLCQQFGATILPEVTDQV 404
Query: 1187 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
TH+V GT K++ AL G+FVV WV A + +A+E+ F +
Sbjct: 405 THIVGARYGTQKIHQALRLGKFVVTVQWVYACVEKWMKADEKLFEL 450
>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
Length = 390
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 164/330 (49%), Gaps = 40/330 (12%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
+ ARKL LV+DLDHTL+NS +++ E + E H
Sbjct: 89 LLRARKLILVVDLDHTLVNSTADYDISGT--EYVNGLAELLVLGVHHQA---------QA 137
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
+RP W + + +YT+G++ YA +AK+LDP+GV F R+ISR D P
Sbjct: 138 VRP--WLPARSERHVRDARVYTLGDRDYAAAVAKLLDPEGVYFGERIISR--DESP---- 189
Query: 1035 ERVPKSKDLEGVLG-------MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
P K L+ V G +AVVI+DD+ VW N NLI +ERY YF S R FG
Sbjct: 190 --QPDRKSLDVVFGSAPASAAERAAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG 247
Query: 1088 LLGPSLLEIDHD--ERSEDGT-LASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKI 1143
S E H ER D + A++L V+ R+H FF+ V DVR ++ +R++
Sbjct: 248 ----SPWECTHSLSERGVDESERAAALRVLRRVHAGFFAGGGGSFVADVREVIRRTRREV 303
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1203
L GC + F+R A+ H +W+ EQ GA C + VTHVVA + T K WA
Sbjct: 304 LRGCTVAFTRAI----ASDDHHSVWRRTEQLGATCADDVGPAVTHVVATNPTTFKAVWAQ 359
Query: 1204 STGRFVVHPGWVEASALLYRRANEQDFAIK 1233
G+F+V+P W+ + + + E+ F ++
Sbjct: 360 VFGKFLVNPEWINTAHFRWSKPKEEHFPVR 389
>gi|302769312|ref|XP_002968075.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
gi|300163719|gb|EFJ30329.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
Length = 141
Score = 151 bits (381), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 101/149 (67%), Gaps = 8/149 (5%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRP FLE ASKLFEM++YTMG ++YA MA +LDP G F GRVIS+ D
Sbjct: 1 KLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRDST----- 55
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
++KDL+ VLG +SAV+I+DD+ VWP ++ NLIV+ERY +F S RQFGL PSL
Sbjct: 56 ---CRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSL 112
Query: 1094 LEIDHDERSEDGTLASSLGVIERLHKIFF 1122
+ + DE ++G LA+ L V++R+H FF
Sbjct: 113 TKAERDESKDEGALANVLKVLQRIHSDFF 141
>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 405
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 163/331 (49%), Gaps = 33/331 (9%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
R E + S +KL LVLDLD TLL+S VD +E K + + + + R
Sbjct: 86 RDAETDFVLSKKKLFLVLDLDQTLLHST----VDLTPEENYLKNQMDSLQDIFKLITREG 141
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
+ KLRP + FL+ AS +F+M++YT NK YA +M +LDP + F R+I+R
Sbjct: 142 FSPSYAKLRPFVRNFLQEASTMFKMYVYTNANKSYARKMVNLLDPDNIYFKSRLITR--- 198
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
D V K+L+ V+G E AVVI+DD VWP +K NLI V+RY YF +
Sbjct: 199 -----EDSTVSCQKNLDVVMGQERAVVILDDRTDVWPMHKDNLIQVQRYKYFASTANWSN 253
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF----SHQSLDDVDVRNILAAEQRKI 1143
S + + DE ++ +A+ L +++++H FF +L DVR+++ Q I
Sbjct: 254 --SKSFAQREVDESTD--IMATYLEILKKIHSQFFDPKLDEDNLASRDVRDVMRTVQAGI 309
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVA---------NSL 1194
L GC+++ + A L L + AE+ GA+C +D VTHVV L
Sbjct: 310 LQGCKLILRKNLT---AKYKLDNLSKMAEKLGAICVSELDPTVTHVVTLEAKPEDDQLQL 366
Query: 1195 GTDKVNWALSTGRF-VVHPGWVEASALLYRR 1224
K G + +V P W+ S L+ R
Sbjct: 367 ENKKKKKNKQKGTYHLVFPEWIRDSYKLWHR 397
>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
Length = 723
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 174/348 (50%), Gaps = 49/348 (14%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ + + RKL L++DLD T+++++ K VD ++ ++R KP + F H
Sbjct: 133 DETNLITTRKLVLLVDLDQTIIHTSDKPMSVDA--------EKRRNRVKPQDNNLNFQHK 184
Query: 970 GMW----------TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
+ TKLRP FL + S ++EMH+ T G + YA +A++LDP LF
Sbjct: 185 DITKYNLHSRVYTTKLRPHTTEFLNKMSAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQ 244
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
R++SR + F K+++L+ + ++ VVIIDD VW +++ LI ++ Y +
Sbjct: 245 RILSR---DELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSE-ALIQIKPYRF 297
Query: 1079 FPCSRRQFGLLG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD 1129
F ++ G + P +E D+ ED L V+ +H ++ L D
Sbjct: 298 F----KEVGDINAPKNSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHDKYYEKHDLRD 350
Query: 1130 -----VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 1184
+DV+ ++ E+ K+L GC IVFS + P GE +++ QFGA + D
Sbjct: 351 AEQPLLDVKEVIKDERHKVLDGCVIVFSGIVPTGEKLERT-DIYRLCMQFGATIVPEVTD 409
Query: 1185 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
+VTHVV GT K++ A G+FVV WV A + +A+E+ F +
Sbjct: 410 EVTHVVGARYGTQKIHQAHRLGKFVVTVQWVYACVEKWMKADEKQFEL 457
>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
Length = 592
Score = 146 bits (368), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 178/366 (48%), Gaps = 49/366 (13%)
Query: 900 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-K 958
A +K R Q+++ +KL LVLDLDHTLL++ E H ++ + + +
Sbjct: 170 AYEKGLERGKANQQRLIEKKKLSLVLDLDHTLLHTINDFEYRREHHKVTYFNDIYNNSPE 229
Query: 959 PHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+H+ +F G + K RP + +FL+R S++FE+H++T G + YA ++ K+LD L
Sbjct: 230 LQKHIHKFFMRGSYHFVKFRPRLESFLKRCSEIFELHVFTHGERAYADQIGKMLDSSKSL 289
Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVER 1075
FA R++SR + D +K L V + +V++IDD VW N N+I +
Sbjct: 290 FADRILSRDECPD--------INTKTLSQVFPYSDKSVLVIDDKTDVWKDNVDNVIQIAP 341
Query: 1076 YTYFPCSRRQFGLLGPSLLEI------DHDERSE------------DGTLASS------- 1110
Y YF RR FG+ L++ D D + + T+
Sbjct: 342 YDYF---RRIFGV----QLDVNNAPGNDADRKKNNSQQTPRTFKVIENTIDDYNDDDQLD 394
Query: 1111 --LGVIERLHKIFF-SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL 1167
++E++H +F S Q L + DV+ IL +++ IL G IVFS V P+ + P H
Sbjct: 395 IIYNLLEKVHLDYFNSPQDLSERDVKAILKEKKKDILKGAHIVFSGVIPLKQ-QPETHID 453
Query: 1168 WQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRAN 1226
W+ A GA C I +TH+VA GT+KV A + G VV W+ A +R N
Sbjct: 454 WKIATDLGAKCYTDITPNMTHLVARQKGTEKVKKAENMPGVHVVDLSWLHACLKYLKRQN 513
Query: 1227 EQDFAI 1232
E D+ +
Sbjct: 514 EFDYFL 519
>gi|226498676|ref|NP_001145873.1| hypothetical protein [Zea mays]
gi|219884795|gb|ACL52772.1| unknown [Zea mays]
gi|413939308|gb|AFW73859.1| hypothetical protein ZEAMMB73_968817 [Zea mays]
Length = 425
Score = 143 bits (361), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 30/244 (12%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------LF 964
++ + RKL L+LDLDHTLLNS +++ PV E+ P+ LF
Sbjct: 202 DRATLMRERKLILILDLDHTLLNSTSLYDLSPV--------EQAKGFTPYTFGDTSIDLF 253
Query: 965 R--FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
R ++ M KL FL++A+ LFEMH+YT+G + YA ++LDP G+ F GR++
Sbjct: 254 RVDIDNLSMLVKLGAFARGFLKQANALFEMHVYTLGIRAYARAAVRLLDPNGIYFGGRIV 313
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
SR + +K L+ + G + A VVI+DD+ VWP NLI+++RY YF
Sbjct: 314 SRNE--------STKENTKSLDVIQGADPAMVVILDDTDGVWPGYPDNLILMDRYRYFAS 365
Query: 1082 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILAAEQ 1140
+ R F PSL E +ER DG+LA LG ++R+H+ FF H++ DVR ++A +
Sbjct: 366 TCRTFDYDIPSLAEQGLEEREHDGSLAVVLGALQRIHQGFFDGHRA----DVREVIAKVR 421
Query: 1141 RKIL 1144
++
Sbjct: 422 SQVC 425
>gi|159476674|ref|XP_001696436.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
reinhardtii]
gi|158282661|gb|EDP08413.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
reinhardtii]
Length = 2174
Score = 143 bits (360), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 113/365 (30%), Positives = 162/365 (44%), Gaps = 65/365 (17%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVD-PVHDEILRKKEEQDREKP--HRHLFRFPHMG--MWTK 974
+L LV+DLD L +S ++D P ++R+ + P R LFR P G +W K
Sbjct: 809 RLVLVVDLDGVLADSCWDAQLDGPTAAALVRRAAVEAAALPEDRRELFRLPLEGGALWLK 868
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
LRPG FL RA++ +E+ T + YA + ++LDP LF RV++ G
Sbjct: 869 LRPGARAFLARAAERYELWARTRQGRPYADAVVELLDPHQQLFGSRVVAAG--------- 919
Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN--LIVVERYTYF---PCSR----RQ 1085
V + L + ++D W L+ L+ + Y YF PC+
Sbjct: 920 --VLAKRLLAALECRAPIAAVLDTPDAAWMGESLSGSLLALPPYAYFAVRPCAPGGAVAA 977
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK-------------------------- 1119
G+ +LE+D DE +E G LA L ++E LH
Sbjct: 978 SGMASRCMLEVDRDEDAERGALAVGLPLLEALHGRVLHAYASGGAGASANGGTGSGSAAA 1037
Query: 1120 ----IFFSHQ----------SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
+ S Q +L+ DVR +L + +IL G I FSRV+ G A H
Sbjct: 1038 PPSPVHGSQQPPPAAQSAGPALEAWDVRRVLRELRERILTGTHITFSRVYSGGTAAGPQH 1097
Query: 1166 PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 1225
PLW+ AE GA + D THVV+ S T K WA GRFVV+P W+E S +R+A
Sbjct: 1098 PLWRLAEACGATVSAACTDSTTHVVSLSGATAKALWAQQHGRFVVYPSWLECSCYTWRKA 1157
Query: 1226 NEQDF 1230
+E F
Sbjct: 1158 DESLF 1162
>gi|302834483|ref|XP_002948804.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
gi|300265995|gb|EFJ50184.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
Length = 2442
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/410 (28%), Positives = 168/410 (40%), Gaps = 103/410 (25%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFP----- 967
+ S KL LV+DLD L +S ++DP L + + P R LFR P
Sbjct: 2044 LLSRGKLVLVVDLDGVLADSCWDDQLDPAAAAALSRHAAAEAGLPEDRRELFRLPLDAGA 2103
Query: 968 -------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
G+W KLRPG FL RA + FE+ ++ + YA + ++LDP LF R
Sbjct: 2104 TAGAASGGSGLWLKLRPGARAFLARAHERFELWAHSRQGRPYADAVVELLDPSLALFGSR 2163
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
V+++G+ R+ + D + I+D W L+ + Y+Y
Sbjct: 2164 VVAQGELA------RRLLTALDARAPI-----TAILDTPSAAWMGEQLAPGLLPLPPYSY 2212
Query: 1079 F----PCSRRQF--------------------GLLGPSLLEIDHDERSEDGTLASSLGVI 1114
F C+ G+ G SLLE++ DE E G LA++L ++
Sbjct: 2213 FSYRPACTADAAAGVGGGGAAPGARTLAPSASGMAGRSLLEVNRDECPERGVLAAALPLL 2272
Query: 1115 ERLHKIFFSH-------------------------------------------QSLDDVD 1131
L+ S L+ D
Sbjct: 2273 YVLYTRVMSTYGAAATTTASSSAGSTAAAAAAAAIAKPPTDGPPPPPPPAPPLAGLEPWD 2332
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHL---------HPLWQTAEQFGAVCTKHI 1182
VR +L +++++LAG I FSRVF G HPLW+ AE FGA
Sbjct: 2333 VRRVLREQRQRVLAGFHITFSRVFTSGGGGGDGGGGGAAGPQHPLWRLAEAFGATVGASC 2392
Query: 1183 DDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
D THVVA S GT+K WA GRFVV+P W+EAS L+R+ +E F +
Sbjct: 2393 SDATTHVVALSAGTEKAIWAAQHGRFVVYPSWLEASCFLWRKVDESLFLL 2442
>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
Length = 375
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 140/285 (49%), Gaps = 61/285 (21%)
Query: 926 DLDHTLLNSAKFHEVD----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
DLDHTLLNS +EV P E+ R+++E + P R L +WTKLRPG++
Sbjct: 133 DLDHTLLNSVHMNEVGEDVAPRLAELQRREQEANL-GPRRLLHCLADKKLWTKLRPGVFE 191
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
FLE +EMH+YTMG+K YA E+ ++LDP G LF+ VI++ +K
Sbjct: 192 FLEGLRDAYEMHIYTMGDKTYAAEVRRLLDPTGRLFSS-VIAK--------DHSTTATAK 242
Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 1101
L+ +L + +++DD+ VWP ++ NL+ V
Sbjct: 243 HLDVLLSADELALVLDDTEVVWPGHRRNLLQV---------------------------- 274
Query: 1102 SEDGTLASSLGVIERLHKIFFSHQS-----LDDVDVRNILAAEQRKILAGCRIVFSRVFP 1156
S++G+ FFS + LD+ DVR++LA ++ +IL G I FSR +
Sbjct: 275 ------MSAVGL------QFFSQEDSRLPPLDERDVRDVLAEQKARILTGVHITFSRCW- 321
Query: 1157 VGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVVANSLGTDKVN 1200
+ +P PLWQ AE GA C D V THVVA GT KV
Sbjct: 322 AQDKDPRKEPLWQLAEGLGATCLPAYDPAVTTHVVAAGGGTAKVR 366
>gi|328872613|gb|EGG20980.1| putative tfiif-interacting component of the c-terminal domain
phosphatase [Dictyostelium fasciculatum]
Length = 757
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 184/392 (46%), Gaps = 83/392 (21%)
Query: 907 RRLEEQ--KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPVHDEILRKKEEQDREKPHR 961
+++EE+ K++ +KL LVLDLDHT++++ F EV P I DR+K +
Sbjct: 191 QQIEEKNAKRLLDNKKLSLVLDLDHTIIHAIMEQHFMEV-PYWRTI-------DRKKSNI 242
Query: 962 H-LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
H + + + KLRP ++ FL ++LFE+H+YTMG + YA ++A ++DPK +F R
Sbjct: 243 HEIILNGNQRYFIKLRPHLYEFLREVNRLFELHIYTMGTRNYAQKIASLVDPKQRVFKER 302
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
V+SR D P D + K L+ + +S V+I+DD VW +K NLI + Y YF
Sbjct: 303 VLSR--DDTPNDMNH-----KTLKRLFPCDDSMVLIVDDRSDVWKKSK-NLIQIVPYLYF 354
Query: 1080 PCSRRQFGLL-------GPSLL------------------------------EIDHDERS 1102
+ LL L+ ++D E
Sbjct: 355 VGCKDMVNLLPTDKQSPHQQLMNDEPAAATAAAVNTPTPPSPGSPNVSSPKQKVDQQEED 414
Query: 1103 EDGTLASS-------------LGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCR 1148
++ ++ + L + +H F+ S + V NI+ ++KIL
Sbjct: 415 DENSIYNENDESEGDHHLRVILSKLTEIHTEFYKQVSNNQKPHVTNIVDDIKKKILKDVY 474
Query: 1149 IVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVA-------NSLGTDKVNW 1201
IV S ++P+ + P PL AE+ GA I + THV+A GT KVN
Sbjct: 475 IVLSGIYPINSSQP--QPLRILAEEMGATVQNEITPKTTHVMAARVRRKRKRKGTSKVNQ 532
Query: 1202 ALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
A+S G VV+ W+E++A+ + + +E DF IK
Sbjct: 533 AISMGLHVVNSSWLESTAMHWTKLDENDFPIK 564
>gi|115533721|ref|NP_492423.2| Protein FCP-1 [Caenorhabditis elegans]
gi|82658167|emb|CAC70088.2| Protein FCP-1 [Caenorhabditis elegans]
Length = 659
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 166/339 (48%), Gaps = 46/339 (13%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 969
++ + + RKL L++DLD T+++++ K D E H+ + ++ H
Sbjct: 134 DENNLITNRKLVLLVDLDQTIIHTSD-------------KPMTVDTEN-HKDITKYNLHS 179
Query: 970 GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
++T KLRP FL + S ++EMH+ T G + YA +A++LDP LF R++SR
Sbjct: 180 RVYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR---D 236
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF-------- 1079
+ F K+ +L+ + ++ VVIIDD VW +++ LI ++ Y +F
Sbjct: 237 ELFSAQH---KTNNLKALFPCGDNLVVIIDDRSDVWMYSEA-LIQIKPYRFFKEVGDINA 292
Query: 1080 -PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDVR 1133
S+ Q P +E D+ ED L V+ +H ++ L +DV+
Sbjct: 293 PKNSKEQM----PVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVLLDVK 345
Query: 1134 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 1193
++ E+ K+L GC IVFS + P+GE +++ QFGAV + D VTHVV
Sbjct: 346 EVIKEERHKVLDGCVIVFSGIVPMGEKLERT-DIYRLCTQFGAVIVPDVTDDVTHVVGAR 404
Query: 1194 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
GT KV A +FVV WV A + +A+E F +
Sbjct: 405 YGTQKVYQANRLNKFVVTVQWVYACVEKWLKADENLFQL 443
>gi|15239576|ref|NP_200232.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9759494|dbj|BAB10744.1| unnamed protein product [Arabidopsis thaliana]
gi|332009084|gb|AED96467.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 306
Score = 135 bits (339), Expect = 2e-28, Method: Composition-based stats.
Identities = 85/239 (35%), Positives = 127/239 (53%), Gaps = 13/239 (5%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D Q + I T+R+ Q F+ +KL LVLDLDHTLL++ + ++ EE
Sbjct: 62 DGLQLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHTVMISNLTKEETYLI---EE 118
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+D + R L KLRP + FL+ A+K+F M++YTMG++ YA + ++DP+
Sbjct: 119 EDSREDLRRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPE 178
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
V F RVI+R + P K L+ VL E VVI+DD+ VWP +K NL+ +
Sbjct: 179 KVYFGDRVITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDHKRNLLEI 228
Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDV 1132
+Y YF R S E DE DG+LA+ L VI+++++ FFS D+D+
Sbjct: 229 TKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVIKQVYEGFFSGGVEKDLDI 287
>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
Length = 734
Score = 134 bits (338), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 140/498 (28%), Positives = 215/498 (43%), Gaps = 116/498 (23%)
Query: 834 MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGS--GPEAGPVGAHP----------QS 881
+V+QN IQP Q+ D V +H + G + G E A P ++
Sbjct: 132 VVNQNHVIQPQQMVLSVD----VCDHKVQFNGNCALCGIEMDIYSASPILESPREFSTRT 187
Query: 882 AW--GDVEHLFEGYDDQQKAA-------IQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 932
W + HL Y Q ++ E TRRL +KK L LVLDLD+TL+
Sbjct: 188 DWTLSNKHHLNPAYTHPQLRVSRNELELVEGENTRRLLRRKK------LSLVLDLDNTLI 241
Query: 933 NSAKF----HEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM-----WTKLRPGIWTFL 983
++ E EI ++ E+ E + + + KLRP + FL
Sbjct: 242 HATLVSHFPQEWYQYKQEIYQQATEKALECSAPLMEDIHELDLDGSISLVKLRPNVRRFL 301
Query: 984 ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 1043
E+ + +E+H+YTMG++ YA +A +LDP G LF R++SR D E + K L
Sbjct: 302 EKIHQRYELHIYTMGSRSYADAIATLLDPSGNLFQRRIVSRD------DFVEGMMNRKSL 355
Query: 1044 EGVLGM-ESAVVIIDDSVRVW-PHNKL----NLIVVERYTYF------------------ 1079
+ +S V+I+DD VW HN+ NLI + Y +F
Sbjct: 356 RRIFPCDDSMVIIVDDREDVWMDHNQGEMVPNLIRAKPYLFFVQDVHENMNNHLVWDSTT 415
Query: 1080 ----PCS---RRQFGLLGPSLLE-IDHDERSEDGTL--------------ASSLGVIERL 1117
P S + F + +L ++ E E G + LG +E+L
Sbjct: 416 TSIHPSSESHKESFANISTCMLTCLNWKENLESGCYFPYLPWVQKTVESDENYLGRLEQL 475
Query: 1118 ----HKIFF---------------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 1158
H+ FF +H L DV+ ILA + ++L C + F+ +F +
Sbjct: 476 LIQIHESFFKEMEQCHNVSQTDDSNHVKLPLRDVKLILAEMRHRVLRNCYLSFTGIFRLE 535
Query: 1159 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS---LGTDKVNWALSTGR-FVVHPGW 1214
E+ P + +W+ AE+FGA+C K + Q TH++ +S L T K +AL G F+V W
Sbjct: 536 ES-PEVSTVWRLAEEFGAICNKQVTSQTTHLIVDSQRGLHTGKTKYALRRGDIFLVTLEW 594
Query: 1215 VEASALLYRRANEQDFAI 1232
+E S Y RA+E FA+
Sbjct: 595 LETSMQYYIRASELQFAL 612
>gi|297850432|ref|XP_002893097.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
lyrata]
gi|297338939|gb|EFH69356.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
lyrata]
Length = 281
Score = 134 bits (337), Expect = 4e-28, Method: Composition-based stats.
Identities = 88/246 (35%), Positives = 129/246 (52%), Gaps = 34/246 (13%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 959
T+ L Q + RKL +VLDLDHTLL+S + +LR+ + + DRE
Sbjct: 62 TKSLTTQLACLNERKLHVVLDLDHTLLHSVMVSRLSEGEKYLLRESDLREDLWTLDRE-- 119
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
M KLRP + FL A++ F M++YTMGN+ YA + K++DPK V F
Sbjct: 120 -----------MLIKLRPFVHEFLNEANEFFSMYVYTMGNRDYAQAVLKLIDPKKVYFGD 168
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
RVI+R + G SK L+ VL E VVI+DD+ VWP ++ NL+ + +Y+YF
Sbjct: 169 RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218
Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILA 1137
++ S E DE G+LA+ L V++++H+ FFS + LD DVR +L
Sbjct: 219 RDYNQED---SKSYAEEKRDESRSQGSLANVLKVLKKIHQEFFSGGIEKLDSKDVRLLLQ 275
Query: 1138 AEQRKI 1143
+ I
Sbjct: 276 EQHIAI 281
>gi|324504080|gb|ADY41763.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
suum]
Length = 490
Score = 133 bits (334), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 171/343 (49%), Gaps = 49/343 (14%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 965
+Q+ + +R+L L++DLD TL+++ VD VH ++
Sbjct: 50 DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 90
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+TK+RP TFL R S+L+EMH+ + G + YA ++A++LDP F R++SR
Sbjct: 91 LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 149
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP---- 1080
+ F + K+ +++ + + + IIDD VW ++ LI V+ Y +F
Sbjct: 150 --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSDA-LIQVKPYRFFKETGD 203
Query: 1081 ------CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDV-- 1130
C+ +Q L+ + +++ E D TL V+ R+H F++H +++ D
Sbjct: 204 INAPTICNAQQQSLVQERIAQVNV-EGDGDETLEFVATVLTRVHTTFYAHYDKNMSDRVQ 262
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV+ ++A + ++L GC IV S + P+G + + QFGA T+ + D THV+
Sbjct: 263 DVKLVIAYMRSQVLRGCSIVLSGIVPIG-MDVRNTEAFHLCIQFGATVTESVTDTTTHVI 321
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ A +V+P W+ A + +A+E++F +
Sbjct: 322 AARWGTTKVHDARRRANIAIVNPRWLYACVERWEKADEKEFEL 364
>gi|324508774|gb|ADY43701.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
suum]
Length = 576
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 171/343 (49%), Gaps = 49/343 (14%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 965
+Q+ + +R+L L++DLD TL+++ VD VH ++
Sbjct: 136 DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 176
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+TK+RP TFL R S+L+EMH+ + G + YA ++A++LDP F R++SR
Sbjct: 177 LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 235
Query: 1026 DDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP---- 1080
+ F + K+ +++ + + + IIDD VW ++ LI V+ Y +F
Sbjct: 236 --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSDA-LIQVKPYRFFKETGD 289
Query: 1081 ------CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDV-- 1130
C+ +Q L+ + +++ E D TL V+ R+H F++H +++ D
Sbjct: 290 INAPTICNAQQQSLVQERIAQVNV-EGDGDETLEFVATVLTRVHTTFYAHYDKNMSDRVQ 348
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV+ ++A + ++L GC IV S + P+G + + QFGA T+ + D THV+
Sbjct: 349 DVKLVIAYMRSQVLRGCSIVLSGIVPIG-MDVRNTEAFHLCIQFGATVTESVTDTTTHVI 407
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ A +V+P W+ A + +A+E++F +
Sbjct: 408 AARWGTTKVHDARRRANIAIVNPRWLYACVERWEKADEKEFEL 450
>gi|15224433|ref|NP_178570.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|4585924|gb|AAD25584.1| hypothetical protein [Arabidopsis thaliana]
gi|330250795|gb|AEC05889.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 277
Score = 131 bits (329), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 144/269 (53%), Gaps = 28/269 (10%)
Query: 880 QSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE 939
+S + +++F+G +A T+ L + + +KL LVLDLDHTLL+S
Sbjct: 29 KSQFRKFDYIFKGLQLSNEAV---ALTKSLTTKHSCLNEKKLHLVLDLDHTLLHSKLVSN 85
Query: 940 VDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLY 995
+ ++++ + RE L++F +G KLRP + FL+ A+++F M +Y
Sbjct: 86 LSQAERYLIQEASSRTRED----LWKFRPIGHPIDRLIKLRPFVRDFLKEANEMFTMFVY 141
Query: 996 TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVI 1055
TMG+++YA + +++DPK + F RVI++ + P+ K L VL E VVI
Sbjct: 142 TMGSRIYAKAILEMIDPKKLYFGNRVITKDES----------PRMKTLNLVLAEERGVVI 191
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIE 1115
+DD+ +WPH+K NLI + +Y YF R+ GL S E DE DG LA+ L ++
Sbjct: 192 VDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYSEKKTDEGENDGGLANVLKLLR 247
Query: 1116 RLHKIFF---SHQSLDDVDVRNILAAEQR 1141
+H+ FF + L+ +DVR++L R
Sbjct: 248 EVHRRFFIVEVEEVLESMDVRSLLKEGYR 276
>gi|297792855|ref|XP_002864312.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297310147|gb|EFH40571.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 305
Score = 130 bits (328), Expect = 4e-27, Method: Composition-based stats.
Identities = 86/244 (35%), Positives = 130/244 (53%), Gaps = 24/244 (9%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+R+ Q F+ +KL LVLDLDHTLL++ + +E E RE L++
Sbjct: 76 TKRVTTQITCFNDKKLHLVLDLDHTLLHTVMVSNLS--KEETYLIGEADSRED----LWK 129
Query: 966 FP---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
F KLRP + FL+ A+++F M++YTMG++ YA + K++DP+ + F RVI
Sbjct: 130 FNGGYSSEFLIKLRPYVHEFLKEANEMFSMYVYTMGDRDYANNVLKLIDPEKIYFGHRVI 189
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
+R + P K L+ VL E VVI+DD+ +VWP +K NL+ + +Y YF
Sbjct: 190 TRNES----------PYIKTLDLVLADECGVVIVDDTPQVWPDDKRNLLEITKYNYFSDK 239
Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-----HQSLDDVDVRNILA 1137
R+ S E DE DG+LA+ L VI+ +++ FFS +D DVR +L
Sbjct: 240 TRRDVKYSKSYAEEKRDEGRNDGSLANVLKVIKEIYEGFFSGGVEKELDIDSKDVRLLLH 299
Query: 1138 AEQR 1141
+R
Sbjct: 300 NARR 303
>gi|384501479|gb|EIE91970.1| hypothetical protein RO3G_16681 [Rhizopus delemar RA 99-880]
Length = 494
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 160/330 (48%), Gaps = 50/330 (15%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
+E K++ +KL L+LDLD T+++++ + + +R+ F P
Sbjct: 20 VENAKRLLDQKKLSLILDLDQTIVHASCDQRISQWQNPDIRQ-------------FNLPR 66
Query: 969 --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+ + KLRPG+ FL+ +L+E+H+YTMG K YA +AK +DP+G LF R++SR +
Sbjct: 67 SPLVYYIKLRPGLIEFLKEIEELYELHIYTMGTKDYAKAVAKEIDPEGCLFKERILSRDE 126
Query: 1027 DGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
G K L+ + + S VV++DD VW ++ NL+ ++ Y YF
Sbjct: 127 SG--------CLTQKKLQRIFPCDTSMVVVLDDRSDVWSYSP-NLVRIKPYEYF------ 171
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
+G D + +++++H+ F+ ++ + DV I+ +R++L
Sbjct: 172 ---IGTG-----------DIHSPTKNKILKKIHQEFYKNKK--EGDVTKIIPNMKRQVLH 215
Query: 1146 GCRIVFS-RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
C I F+ V P +P L +WQ FGA+C+ + + TH++A K A
Sbjct: 216 HCIISFAPDVIPSNLKDPTLSWIWQMGTSFGALCSNDLTGKTTHLIAVRWDA-KAKAAKD 274
Query: 1205 TGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
G +V P W+ S + +E+ +A++
Sbjct: 275 YGHSKIVTPAWLLDSTARWAIQDEEAYALE 304
>gi|15229069|ref|NP_188382.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9294142|dbj|BAB02044.1| unnamed protein product [Arabidopsis thaliana]
gi|332642446|gb|AEE75967.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 296
Score = 127 bits (318), Expect = 6e-26, Method: Composition-based stats.
Identities = 88/263 (33%), Positives = 135/263 (51%), Gaps = 30/263 (11%)
Query: 887 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
++L +G +AA T+R Q + +KL LVLDLDHTLL+S + +
Sbjct: 55 DYLVQGLQLSHEAA---AFTKRFTTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKC 111
Query: 947 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
++ + RE L++ TKLRP + FL+ A++LF M++YTMG ++YA +
Sbjct: 112 LIEEACSTTRED----LWKLDS-DYLTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESL 166
Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
K++DPK + F RVI+R + P K L+ VL E VVI+DD+ VW H+
Sbjct: 167 LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLAEERGVVIVDDTSDVWTHH 216
Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
K NL+ + Y +F + GP S E DE +G LA+ L +++ +H FF
Sbjct: 217 KSNLVEINEYHFFRVN-------GPEESNSYTEEKRDESKNNGGLANVLKLLKEVHYGFF 269
Query: 1123 S-HQSLDDVDVRNILAAEQRKIL 1144
+ L+ DVR +L K+L
Sbjct: 270 RVKEELESQDVRFLLQEIDFKLL 292
>gi|297834870|ref|XP_002885317.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297331157|gb|EFH61576.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 592
Score = 126 bits (316), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 81/238 (34%), Positives = 127/238 (53%), Gaps = 25/238 (10%)
Query: 893 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
+D Q ++ T+RL + + +KL LVLDLDHTLL+S + + ++ +
Sbjct: 61 FDGLQLSSEALALTKRLTTKFSCLNMKKLHLVLDLDHTLLHSVRVQFLSEAEKYLIEEAG 120
Query: 953 EQDREKPHRHLFRFPHMG--------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
RE L++ G TKLRP + FL+ A+KLF M++YT G + YA
Sbjct: 121 STTRED----LWKMKVKGDPIPITIEYLTKLRPFLREFLKEANKLFTMYVYTKGTRRYAK 176
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1064
+ K++DPK + F RVI+R + P +K L+ VL E VVI+DD+ +WP
Sbjct: 177 AILKLIDPKKLYFGHRVITRNES----------PHTKTLDLVLADERGVVIVDDTRNIWP 226
Query: 1065 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
++K NL+V+ +Y YF R + +L P E DE +G LA+ L +++ +H+ FF
Sbjct: 227 NHKSNLVVIGKYKYF---RFEGRVLKPHSEEKTTDESENNGGLANVLKLLKEVHRKFF 281
Score = 110 bits (276), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 117/227 (51%), Gaps = 28/227 (12%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTK 974
+KL LVLDLDHTLL++ + +L + RE L++ +G TK
Sbjct: 389 KKLHLVLDLDHTLLHTVMVPSLSQAEKYLLEEAGSATRED----LWKIKAIGDPMEFLTK 444
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
LRP + FL+ A+++F M++YT G++ YA ++ +++DPK + F RVI++ +
Sbjct: 445 LRPFVREFLKEANQMFTMYVYTKGSRGYAKQVLELIDPKKLYFEDRVITKNES------- 497
Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
P K L+ VL E VVI+DD VWP +K NL+ + +YTYF R G
Sbjct: 498 ---PHMKTLDLVLAEERGVVIVDDMRTVWPDHKSNLVDISKYTYF----RLKGQESMPYS 550
Query: 1095 EIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
E DE DG LA+ L +++ +H FF V+ R + A R
Sbjct: 551 EEMTDESESDGGLANVLKLLKEVHSRFFR------VEGRQVAATRNR 591
>gi|15217916|ref|NP_173457.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
gi|9558594|gb|AAF88157.1|AC026234_8 Contains similarity to a FCP1 serine phosphatase from Xenopus laevis
gi|6689545 [Arabidopsis thaliana]
gi|332191840|gb|AEE29961.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
Length = 342
Score = 126 bits (316), Expect = 9e-26, Method: Composition-based stats.
Identities = 87/245 (35%), Positives = 126/245 (51%), Gaps = 37/245 (15%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 959
T+ L Q + RKL LVLDLDHTLL+S + +L + + + DRE
Sbjct: 62 TKSLTTQLACLNERKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE-- 119
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
M KLRP + FL+ A+++F M++YTMGN+ YA + K +DPK V F
Sbjct: 120 -----------MLIKLRPFVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGD 168
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
RVI+R + G SK L+ VL E VVI+DD+ VWP ++ NL+ + +Y+YF
Sbjct: 169 RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218
Query: 1080 PCSRRQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNI 1135
R + S E DE G+LA+ L V++ +H+ FF + LD DVR +
Sbjct: 219 ----RDYSHDKESKSYAEEKRDESRNQGSLANVLKVLKDVHQEFFRGGIEELDSKDVRLL 274
Query: 1136 LAAEQ 1140
L +
Sbjct: 275 LQEQH 279
>gi|297846748|ref|XP_002891255.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297337097|gb|EFH67514.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 210
Score = 125 bits (315), Expect = 1e-25, Method: Composition-based stats.
Identities = 78/221 (35%), Positives = 119/221 (53%), Gaps = 31/221 (14%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKPHRHLFRFPHMGMW 972
+KL LVLDLDHTL+++ ++ +L + + + +++ P+ +
Sbjct: 3 KKLHLVLDLDHTLIHTVLVSDLSEREKYLLEEADSRQDLWRCNKDSPYEFII-------- 54
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
KLRP + FL A+KLF MH+YTMGN YA ++ K++DP V F RVI+R
Sbjct: 55 -KLRPFVHEFLLEANKLFTMHVYTMGNSCYAQDVLKLIDPDKVYFGNRVITR-------- 105
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
E P +K L+ ++ VVI+DD++ VWPH+K NL+ + +Y YF R G S
Sbjct: 106 --EASPCNKTLDLLVADTRRVVIVDDTISVWPHHKRNLLQITKYIYF----RVDGTKWDS 159
Query: 1093 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVR 1133
E DE + G+LA+ L +E +HK F + LD D+R
Sbjct: 160 YAEEKKDESRKSGSLANVLKFLEDVHKRF--EEDLDSKDLR 198
>gi|297834668|ref|XP_002885216.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297331056|gb|EFH61475.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 296
Score = 125 bits (314), Expect = 1e-25, Method: Composition-based stats.
Identities = 87/263 (33%), Positives = 135/263 (51%), Gaps = 30/263 (11%)
Query: 887 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
++L +G +AA T+R + + +KL LVLDLDHTLL+S + +
Sbjct: 55 DYLVQGLQLSHEAA---AFTKRFTTEFYCLNEKKLHLVLDLDHTLLHSIRVSILSETERY 111
Query: 947 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
++ + RE L++ + TKLRP + FL+ A+++F M++YTMG ++YA +
Sbjct: 112 LIEEACSTTRED----LWKL-DIDYLTKLRPFVHEFLKEANEMFTMYVYTMGTRVYAESL 166
Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
K++DPK + F RVI+R + P K L+ VL E VVI+DD+ VW H+
Sbjct: 167 LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLADERGVVIVDDTRDVWTHH 216
Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
K NL+ + Y YF + GP S E DE G LA+ L +++ +H FF
Sbjct: 217 KSNLVEINEYHYFRVN-------GPEESKSYTEEKRDESKNSGGLANVLKLLKEVHYGFF 269
Query: 1123 S-HQSLDDVDVRNILAAEQRKIL 1144
+ L+ DVR +L K+L
Sbjct: 270 RVKEELESQDVRFMLQEIDFKLL 292
>gi|308500103|ref|XP_003112237.1| CRE-FCP-1 protein [Caenorhabditis remanei]
gi|308268718|gb|EFP12671.1| CRE-FCP-1 protein [Caenorhabditis remanei]
Length = 664
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/339 (29%), Positives = 171/339 (50%), Gaps = 46/339 (13%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 969
++ + + RKL L++DLD T+++++ K D EK H+ + ++ H
Sbjct: 134 DETNLITTRKLVLLVDLDQTIIHTSD-------------KPMSADAEK-HKDITKYNLHS 179
Query: 970 GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
++T KLRP FL + + ++EMH+ T G + YA +A++LDP LF R++SR
Sbjct: 180 RVYTTKLRPHTTEFLNKMAAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQRILSR---D 236
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
+ F K+++L+ + ++ VVIIDD VW +++ LI ++ Y +F ++ G
Sbjct: 237 ELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF----KEVG 288
Query: 1088 LLG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDVR 1133
+ P +E D+ ED L V+ +H ++ L D +DV+
Sbjct: 289 DINAPKDSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHDKYYEKHDLKDGDQALLDVK 345
Query: 1134 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 1193
++ E+ K+L GC IVFS + P GE +++ QFGA + ++VTHVV
Sbjct: 346 EVIKEERHKVLDGCVIVFSGIVPTGEKLERT-DIYRLCVQFGATIVPEVVEEVTHVVGAR 404
Query: 1194 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
GT KV+ A G+ VV WV A + +A+E+ F +
Sbjct: 405 YGTTKVHQAHRLGKNVVTVQWVYACVEKWMKADEKQFEL 443
>gi|170578206|ref|XP_001894313.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
gi|158599134|gb|EDP36825.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
Length = 576
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 166/332 (50%), Gaps = 37/332 (11%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
+ A KL L++DLD TL+++ H + E D + H ++ +TK
Sbjct: 141 LLKAHKLVLLVDLDQTLIHTTN-HTFNL----------ENDTDVLH---YKLKGTDFYTK 186
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
+RP FL R + L+EMH+ + G + YA +A+ LDP+ + F R++SR + F
Sbjct: 187 IRPHAHEFLRRMASLYEMHIISYGERQYAHRIAEFLDPEKIYFGHRILSR---DELFSA- 242
Query: 1035 ERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF--------PCSRRQ 1085
+ K+++++ + + +V+IDD VW ++ LI V+ Y +F P + +
Sbjct: 243 --MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFFKEIGDINAPRNEKG 299
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV----DVRNILAAEQR 1141
+L S E D E +D TL V+ ++H F+ V D++ I++ ++
Sbjct: 300 EPILSGSYAEQDM-ESEDDETLEYIALVLTKVHSAFYELFDSAKVNRFPDLKGIISYLRK 358
Query: 1142 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW 1201
++L C IV S + PVG + +++ QFGAV T ++++ THV+A GT KV+
Sbjct: 359 QVLRECSIVLSGIVPVG-VDIKKTEVYRLCVQFGAVVTDNVNNSTTHVIAARWGTTKVHE 417
Query: 1202 ALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A +V+P W+ + +A+E++F +
Sbjct: 418 AEKLSNIHIVNPKWLFTCVERWEKADEKEFEL 449
>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
Length = 204
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 115/266 (43%), Gaps = 78/266 (29%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
M K RP FL+ AS++F +++YT+G+ YA EMAK+LDP G F +V SR D
Sbjct: 1 MMIKSRPFARMFLKEASQMFGLYMYTLGDPAYALEMAKLLDPGGEFFNAKVTSRDD---- 56
Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
G +R K D+
Sbjct: 57 --GTQRHQKGHDV----------------------------------------------- 67
Query: 1091 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSL-------DDVDVRNILAAEQRKI 1143
+ DE G LAS L + ++H IFF L D DVR +L +R +
Sbjct: 68 -----LKSDESESGGALASVLKALRKVHHIFFEGTLLQELEENPDGRDVRKVLKTVRRDV 122
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1203
L GC+IVFSRVFP + H LW+ EQ GA C+ GT+K AL
Sbjct: 123 LKGCKIVFSRVFPT-QFQADNHHLWRMVEQLGATCS------------TEAGTEKSRRAL 169
Query: 1204 STGRFVVHPGWVEASALLYRRANEQD 1229
+F+VHPGW+EA+ +++ E++
Sbjct: 170 KHNKFLVHPGWIEATNYFWQKQPEEN 195
>gi|15218405|ref|NP_175026.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
gi|91805923|gb|ABE65690.1| NLI interacting factor family protein [Arabidopsis thaliana]
gi|332193852|gb|AEE31973.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
Length = 255
Score = 122 bits (305), Expect = 1e-24, Method: Composition-based stats.
Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 25/232 (10%)
Query: 914 KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 969
++FS +KL LVLDLDHTLL+S ++ +L + + QD + + + F
Sbjct: 43 QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
KLRP + FL A+KLF MH+YTMG+ YA ++ K++DP V F RVI+R
Sbjct: 100 --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152
Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
E P +K L+ + + VVI+DD+V VWP +K NL+ + +Y YF G
Sbjct: 153 -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203
Query: 1090 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
S E DE +G+LA+ L +E +HK F + L D+R ++ QR
Sbjct: 204 WDSYAEAKKDESQSNGSLANVLKFLEVVHKRF--EEDLGFKDLRLLIPCRQR 253
>gi|116830952|gb|ABK28432.1| unknown [Arabidopsis thaliana]
Length = 256
Score = 122 bits (305), Expect = 1e-24, Method: Composition-based stats.
Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 25/232 (10%)
Query: 914 KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 969
++FS +KL LVLDLDHTLL+S ++ +L + + QD + + + F
Sbjct: 43 QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
KLRP + FL A+KLF MH+YTMG+ YA ++ K++DP V F RVI+R
Sbjct: 100 --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152
Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
E P +K L+ + + VVI+DD+V VWP +K NL+ + +Y YF G
Sbjct: 153 -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203
Query: 1090 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
S E DE +G+LA+ L +E +HK F + L D+R ++ QR
Sbjct: 204 WDSYAEAKKDESQSNGSLANVLKFLEVVHKRF--EEDLGFKDLRLLIPCRQR 253
>gi|308464266|ref|XP_003094401.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
gi|308247823|gb|EFO91775.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
Length = 754
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/337 (29%), Positives = 168/337 (49%), Gaps = 43/337 (12%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
++ + + RKL L++DLD T+++++ D+++ E+ ++ +L H
Sbjct: 230 DETNLITTRKLVLLVDLDQTIIHTS---------DKLMSADAEKHKDITKYNL----HSR 276
Query: 971 MWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
++T KLRP FL + S ++EMH+ T G + YA +AK+LDP LF R++SR +
Sbjct: 277 VYTTKLRPHTTEFLNKMSAMYEMHIVTFGERKYALRIAKILDPDARLFGQRILSRNE--- 333
Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
+ ++K L ++ VVIIDD VW +++ LI ++ Y +F ++ G +
Sbjct: 334 -LSSAQHKTENKAL--FPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF----KEVGDI 385
Query: 1090 G---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDVRNI 1135
P +E D+ ED L V+ +H ++ L D +DV+ +
Sbjct: 386 NAPKHSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHNKYYEKHDLKDADQALLDVKEV 442
Query: 1136 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 1195
+ E+ K+L GC IVFS + GE ++ QFGA + ++VTHVV G
Sbjct: 443 IKEERHKVLDGCVIVFSGIVSAGEKLERTE-IYHLCLQFGATIVPEVVEEVTHVVGARYG 501
Query: 1196 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
T+KV+ A G+ VV WV A + +A+E+ F +
Sbjct: 502 TEKVHQAHRLGKNVVTVQWVYACVEKWMKADEKQFEL 538
>gi|334185470|ref|NP_188594.3| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|332642744|gb|AEE76265.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 302
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/242 (33%), Positives = 131/242 (54%), Gaps = 25/242 (10%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+RL + + +KL LVLDLD TL++S + + ++ + RE + R
Sbjct: 74 TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133
Query: 966 -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H+ KLRP + FL+ A+++F M++YT G + YA + K++DPK + F
Sbjct: 134 GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
RVI+R + P +K L+ VL E VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191 HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVRNILA 1137
F R Q +L P E DE +G LA+ L +++ +H FF + ++ DVR+ILA
Sbjct: 241 F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVRDILA 296
Query: 1138 AE 1139
++
Sbjct: 297 SK 298
>gi|297835808|ref|XP_002885786.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
lyrata]
gi|297331626|gb|EFH62045.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
lyrata]
Length = 285
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 138/279 (49%), Gaps = 24/279 (8%)
Query: 864 TGTGSGPEAGPVGAHP---QSAWGDVEHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSAR 919
G G +G +S W +++F G +A A+ K RT + +
Sbjct: 10 CGHWYGFHGVCIGCKSIVHKSQWRAFDYIFNGLQLSHEAVALTKSRT----TNNSCLNEK 65
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
KL LVLDLDHTLL+ K + ++++ RE + + KLRP +
Sbjct: 66 KLHLVLDLDHTLLHMKKVPCLSRAEMYLIQEACSVTREDIWKIRLLGDPIDRLIKLRPFV 125
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
FL+ A+++F M++YT G + YA + +++DP + F RVI++ + P
Sbjct: 126 RDFLKEANEMFTMYVYTKGTRKYAKAVLELIDPNRLYFGDRVITKDES----------PH 175
Query: 1040 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099
K L+ VL E VVI+DD +WPH+K NLI + +Y YF S G S E D
Sbjct: 176 QKTLDLVLAEERGVVIVDDRRDIWPHHKSNLIEISKYKYFRVS----GQGSNSYSEKKTD 231
Query: 1100 ERSEDGTLASSLGVIERLHKIFF--SHQSLDDVDVRNIL 1136
E +DG LA+ L +++++H FF + L+ +DVR++L
Sbjct: 232 ESEKDGGLANVLKLLKQVHCRFFMVEEEKLESMDVRSLL 270
>gi|312066139|ref|XP_003136128.1| hypothetical protein LOAG_00540 [Loa loa]
Length = 577
Score = 118 bits (296), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 173/353 (49%), Gaps = 49/353 (13%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E R++ +++ + A KL L++DLD TL+++ H K ++D +
Sbjct: 126 VSDELARKIGSRDRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDV 174
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H ++ +TK+RP FL R ++L+EMH+ + G + YA +A+ LDP + F
Sbjct: 175 LH---YKLKGTDFYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFG 231
Query: 1019 GRVISRGDDGDPFDGDE---RVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 1074
R++SR DE + K+++++ + + +V+IDD VW ++ LI V+
Sbjct: 232 HRILSR---------DELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVK 281
Query: 1075 RYTYFP------CSRRQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS 1126
Y +F R + G +L S E D E +D TL V+ ++H F+ +
Sbjct: 282 PYRFFKEIGDINAPRYEKGEPILSGSYAEQDM-ESEDDETLEYVAVVLTKIHNAFY--EL 338
Query: 1127 LDDV------DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 1180
D D++ I++ ++++L C IV S + PVG + ++ QFGAV
Sbjct: 339 FDGAKINRFPDLKGIISYLRKQVLRDCSIVLSGIVPVG-VDIKKTEAYRLCMQFGAVVMD 397
Query: 1181 HIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
++ D THV+A GT KV+ A +V+P W+ + +A+E++F +
Sbjct: 398 NVSDSTTHVIAARWGTTKVHEAHKLPNIHIVNPKWLFTCVERWEKADEKEFEL 450
>gi|393909596|gb|EFO27947.2| hypothetical protein LOAG_00540 [Loa loa]
Length = 506
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 168/338 (49%), Gaps = 41/338 (12%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+++ + A KL L++DLD TL+++ H K ++D + H ++
Sbjct: 67 DRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDVLH---YKLKGTD 112
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+TK+RP FL R ++L+EMH+ + G + YA +A+ LDP + F R++SR +
Sbjct: 113 FYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFGHRILSR---DEL 169
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP------CSR 1083
F + K+++++ + + +V+IDD VW ++ LI V+ Y +F R
Sbjct: 170 FCA---MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFFKEIGDINAPR 225
Query: 1084 RQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV------DVRNI 1135
+ G +L S E D E +D TL V+ ++H F+ + D D++ I
Sbjct: 226 YEKGEPILSGSYAEQDM-ESEDDETLEYVAVVLTKIHNAFY--ELFDGAKINRFPDLKGI 282
Query: 1136 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 1195
++ ++++L C IV S + PVG + ++ QFGAV ++ D THV+A G
Sbjct: 283 ISYLRKQVLRDCSIVLSGIVPVG-VDIKKTEAYRLCMQFGAVVMDNVSDSTTHVIAARWG 341
Query: 1196 TDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
T KV+ A +V+P W+ + +A+E++F +
Sbjct: 342 TTKVHEAHKLPNIHIVNPKWLFTCVERWEKADEKEFEL 379
>gi|15218404|ref|NP_175025.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
gi|117958727|gb|ABK59679.1| At1g43600 [Arabidopsis thaliana]
gi|332193851|gb|AEE31972.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
Length = 221
Score = 117 bits (294), Expect = 3e-23, Method: Composition-based stats.
Identities = 79/221 (35%), Positives = 116/221 (52%), Gaps = 22/221 (9%)
Query: 921 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHMGMWTKLRPGI 979
L LVLDLDHTLL+S ++ +L + + QD + + + F KLRP +
Sbjct: 19 LHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF-----IIKLRPFL 73
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
FL A+KLF MH+YTMG+ YA ++ K++DP V F RVI+R E P
Sbjct: 74 HEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----------EASPF 123
Query: 1040 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099
+K L+ + + VVI+DD+V VWP +K NL+ + +Y YF G S E D
Sbjct: 124 NKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYVYFKVD----GTKWDSYAEAKKD 179
Query: 1100 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
E +G+LA+ L +E +HK F + L D+R ++ Q
Sbjct: 180 ESQSNGSLANVLKFLEDVHKRF--EEDLGFKDLRLLIPCRQ 218
>gi|297808347|ref|XP_002872057.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297317894|gb|EFH48316.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 302
Score = 117 bits (293), Expect = 4e-23, Method: Composition-based stats.
Identities = 84/253 (33%), Positives = 128/253 (50%), Gaps = 24/253 (9%)
Query: 887 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
++L++G +A + T+R+ Q +KL LVLDLDHTL+++ K ++ E
Sbjct: 56 DYLYKGMHMSHEALV---FTKRVISQTSWLEDKKLHLVLDLDHTLVHTIKASQL--YESE 110
Query: 947 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
+E R+ R FP + KLRP + FL+ +++F M++YT G YA +
Sbjct: 111 KCLTEEVGSRKDLWRFNSGFPDESL-IKLRPFVHQFLKECNEMFSMYVYTKGGCDYAQVV 169
Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
+++DP+ + F RVI+R + P K L+ VL E VVI+DD VWPH+
Sbjct: 170 LELIDPEKIYFGNRVITRRES----------PDLKTLDLVLADERGVVIVDDKCSVWPHD 219
Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---S 1123
K NL+ + +Y YF F S + DE E G L L ++ +H FF S
Sbjct: 220 KKNLLQIAKYKYFGDQSCSF-----SECKNKRDESEEKGPLDIVLRFLKDVHNEFFCDWS 274
Query: 1124 HQSLDDVDVRNIL 1136
+ LD VDVR +L
Sbjct: 275 RKDLDSVDVRPLL 287
>gi|367047187|ref|XP_003653973.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
gi|347001236|gb|AEO67637.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
Length = 909
Score = 117 bits (292), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 122/226 (53%), Gaps = 28/226 (12%)
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQD 955
+RT R E Q+++ +RKL LV+DLD T++ + + +P H+ + K Q
Sbjct: 147 QRTER-ELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNHESVKEVKSFQL 205
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+ P R + + K+RPG+ FL+R S+L+EMH+YTMG + YA +A+V+DP+
Sbjct: 206 DDGPSDLARRCSY---YIKMRPGLEEFLKRISELYEMHVYTMGTRAYAQNVARVVDPQRK 262
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVE 1074
LF RVISR ++G+ F +K L + + + VVIIDD VWP N+ NLI V
Sbjct: 263 LFGNRVISRDENGNMF--------AKSLGRLFPVSTNMVVIIDDRSDVWPRNRPNLIKVS 314
Query: 1075 RYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKI 1120
Y +F + G + S L HD + D A++ GV + K+
Sbjct: 315 PYEFF----KGIGDINSSFLPKRHDLLTSD---AATNGVQNKGRKM 353
>gi|215794709|pdb|3EF0|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
Phosphatase
Length = 372
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 181/397 (45%), Gaps = 101/397 (25%)
Query: 908 RLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDEILRKKEEQ 954
RLE + K++ ++L L++DLD T++++ VDP V+ ++LR
Sbjct: 4 RLESENVKRLRQEKRLSLIVDLDQTIIHAT----VDPTVGEWMSDPGNVNYDVLRDVRSF 59
Query: 955 D-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ +E P + + + K RPG+ FL++ S+L+E+H+YTMG K YA E+AK++DP
Sbjct: 60 NLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPT 114
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIV 1072
G LF RV+SR D G K L + + S VV+IDD VW N NLI
Sbjct: 115 GKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWDWNP-NLIK 165
Query: 1073 VERYTYF------------------------------------PCSRRQFGLL------G 1090
V Y +F P +++Q LL
Sbjct: 166 VVPYEFFVGIGDINSNFLSGNREALEEQNKERVTALELQKSERPLAKQQNALLEDEGKPT 225
Query: 1091 PS---LLEIDHD-ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRN----------IL 1136
PS L DH+ ER E V++ +H +++ + +D+ R+ I+
Sbjct: 226 PSHTLLHNRDHELERLEK--------VLKDIHAVYYEEE--NDISSRSGNHKHANVGLII 275
Query: 1137 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 1196
++K+L GCR++FS V P+G + + + A FGA TH++A + T
Sbjct: 276 PKMKQKVLKGCRLLFSGVIPLG-VDVLSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRT 334
Query: 1197 DKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+KV A+S G VV W+ S ++R E D+ +
Sbjct: 335 EKVKKAVSMGNIKVVKLNWLTESLSQWKRLPESDYLL 371
>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
Length = 510
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 178/397 (44%), Gaps = 84/397 (21%)
Query: 893 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
YD+ Q+ + T + + KL L++DLD T++++ VDP +E+L
Sbjct: 46 YDEAQRIGKTSKHT--------LLKSSKLALIVDLDQTIIHAT----VDPTVNELL---- 89
Query: 953 EQDREKPHR------HLFRFPHMGM----------WTKLRPGIWTFLERASKLFEMHLYT 996
QD ++ H F+ G+ + K RPG+ FL+ +KLFEMH+YT
Sbjct: 90 -QDPTLVYKGALNDVHKFKLGDFGLVNHHEFGSWYFVKFRPGLMEFLDNMNKLFEMHVYT 148
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVI 1055
MG + YA + +++DP G F R++SR + G K L+ + + S VI
Sbjct: 149 MGTRSYALAICQLIDPSGKYFGERILSRDESGS--------FTQKSLQRLFPTDTSMCVI 200
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYFPC--------SRRQFGLLGPSLLEIDHDERSEDGT- 1106
IDD VW + NL+ V + +F R++ + D +E+ +D
Sbjct: 201 IDDRADVWG-DSPNLVKVIPFEFFVGIGDINALNKRKKLRSNQNGDGDGDQEEQEQDSNE 259
Query: 1107 ---------------------LASSLG--------VIERLHKIFFSHQSL-DDVDVRNIL 1136
L+ ++ ++ ++H F++ ++ DD D + I+
Sbjct: 260 SLSKQADQRPLAKRQKEHQQILSETIDEELPRLSKILTQIHSNFYNFKNAGDDPDTKEII 319
Query: 1137 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 1196
+RK+L G ++VFS V P+G + ++ A +FGA + +++VTHVVA GT
Sbjct: 320 PTLKRKVLHGLKLVFSSVIPLGMP-LEISGIYNLASKFGATIDHNYNEKVTHVVAAKKGT 378
Query: 1197 DKVNWA-LSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
KV A VV W+ S + + E+++ +
Sbjct: 379 AKVEDAKKGDSAHVVWSEWLLDSCAKWEKMPEENYYL 415
>gi|9294425|dbj|BAB02545.1| unnamed protein product [Arabidopsis thaliana]
Length = 314
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 127/239 (53%), Gaps = 25/239 (10%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+RL + + +KL LVLDLD TL++S + + ++ + RE + R
Sbjct: 74 TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133
Query: 966 -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H+ KLRP + FL+ A+++F M++YT G + YA + K++DPK + F
Sbjct: 134 GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
RVI+R + P +K L+ VL E VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191 HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVRNIL 1136
F R Q +L P E DE +G LA+ L +++ +H FF + ++ DVR +L
Sbjct: 241 F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVRLLL 295
>gi|171680434|ref|XP_001905162.1| hypothetical protein [Podospora anserina S mat+]
gi|170939844|emb|CAP65069.1| unnamed protein product [Podospora anserina S mat+]
Length = 835
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 104/179 (58%), Gaps = 20/179 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHR 961
E QK++ +RKL LV+DLD T++ + V +P +D + K Q + PH
Sbjct: 154 ELQKRLLESRKLSLVVDLDQTVIQACIDPTVGEWMKDPTNPNYDSVKNVKTFQLDDGPHA 213
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ + + + K+RPG+ FL+R S ++E+H+YTMG + YA +A+V+DP+ LF RV
Sbjct: 214 VVRKCWY---YIKMRPGLEGFLKRISTMYELHVYTMGTRAYAQNVARVIDPEKKLFGNRV 270
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G+ + SK L+ + + + VVIIDD VWPHN+ NL+ V Y +F
Sbjct: 271 ISRDENGNMY--------SKSLQRLFPVSTNMVVIIDDRSDVWPHNRPNLVKVTPYEFF 321
>gi|15226925|ref|NP_178335.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
thaliana]
gi|3894162|gb|AAC78512.1| hypothetical protein [Arabidopsis thaliana]
gi|330250469|gb|AEC05563.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
thaliana]
Length = 302
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 122/234 (52%), Gaps = 21/234 (8%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+RL Q +KL LVLDLDHTL+++ K ++ I +E + R+ R
Sbjct: 72 TKRLISQTSWLEDKKLHLVLDLDHTLVHTIKVSQLSESEKYI--TEEVESRKDLRRFNTG 129
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP + KLR + FL+ +++F +++YT G YA + +++DP + F RVI+R
Sbjct: 130 FPEESL-IKLRSFVHQFLKECNEMFSLYVYTKGGYDYAQLVLEMIDPDKIYFGNRVITRR 188
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
+ P K L+ VL E +V++DD VWPH+K NL+ + RY YF Q
Sbjct: 189 ES----------PGFKTLDLVLADERGIVVVDDKSSVWPHDKKNLLQIARYKYFG---DQ 235
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNIL 1136
LL +I DE E G L ++L + +H+ FF S + LD VDVR +L
Sbjct: 236 SCLLSECKKKI--DESDEKGPLNTALRFLMDVHEEFFCDWSRKDLDSVDVRPLL 287
>gi|225194907|gb|ACN81954.1| C-terminal domain phosphatase-like 5 [Arabidopsis thaliana]
Length = 601
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/256 (31%), Positives = 137/256 (53%), Gaps = 28/256 (10%)
Query: 887 EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
+++F+G +A A+ K T +L + +KL LVLDLDHTLL++ +
Sbjct: 355 DYIFDGLQLSHEAVALTKCFTTKLS----CLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 410
Query: 946 EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
++ + R+ L++ +G TKLRP + FL+ A++ F M++YT G+++
Sbjct: 411 YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 466
Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 1061
YA ++ +++DPK + F RVI++ + P K L+ VL E VVI+DD+
Sbjct: 467 YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 516
Query: 1062 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIF 1121
VWP +K NL+ + +Y+YF R G E DE +G LA+ L +++ +H+ F
Sbjct: 517 VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRF 572
Query: 1122 FS-HQSLDDVDVRNIL 1136
F + L+ DVR++L
Sbjct: 573 FRVEEELESKDVRSLL 588
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 125/236 (52%), Gaps = 25/236 (10%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+RL + + +KL LVLDLD TL++S + + ++ + RE + R
Sbjct: 74 TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133
Query: 966 -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H+ KLRP + FL+ A+++F M++YT G + YA + K++DPK + F
Sbjct: 134 GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
RVI+R + P +K L+ VL E VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191 HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVR 1133
F R Q +L P E DE +G LA+ L +++ +H FF + ++ DVR
Sbjct: 241 F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVR 292
>gi|303280109|ref|XP_003059347.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459183|gb|EEH56479.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 136
Score = 115 bits (287), Expect = 2e-22, Method: Composition-based stats.
Identities = 60/122 (49%), Positives = 80/122 (65%), Gaps = 8/122 (6%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRP FL AS + ++++YTMG+K YA EMAK+LDP G LF GRVI+ +
Sbjct: 1 KLRPRAREFLRAASAMCQLYVYTMGDKNYAREMAKILDPTGELFNGRVIA--------NS 52
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
D ++KDL+ VLG E +V+I+DD+ RVWPHN NLI ++RY +FP S F G S+
Sbjct: 53 DSTCSRTKDLDIVLGAEGSVLIVDDTDRVWPHNLANLIRIDRYHFFPQSAAGFRQPGRSV 112
Query: 1094 LE 1095
LE
Sbjct: 113 LE 114
>gi|186510238|ref|NP_001118664.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9294424|dbj|BAB02544.1| unnamed protein product [Arabidopsis thaliana]
gi|332642743|gb|AEE76264.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 307
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/256 (31%), Positives = 137/256 (53%), Gaps = 28/256 (10%)
Query: 887 EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
+++F+G +A A+ K T +L + +KL LVLDLDHTLL++ +
Sbjct: 61 DYIFDGLQLSHEAVALTKCFTTKL----SCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 116
Query: 946 EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
++ + R+ L++ +G TKLRP + FL+ A++ F M++YT G+++
Sbjct: 117 YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 172
Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 1061
YA ++ +++DPK + F RVI++ + P K L+ VL E VVI+DD+
Sbjct: 173 YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 222
Query: 1062 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIF 1121
VWP +K NL+ + +Y+YF R G E DE +G LA+ L +++ +H+ F
Sbjct: 223 VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRF 278
Query: 1122 FS-HQSLDDVDVRNIL 1136
F + L+ DVR++L
Sbjct: 279 FRVEEELESKDVRSLL 294
>gi|15237769|ref|NP_197738.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9759085|dbj|BAB09563.1| unnamed protein product [Arabidopsis thaliana]
gi|332005790|gb|AED93173.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 302
Score = 112 bits (281), Expect = 1e-21, Method: Composition-based stats.
Identities = 79/234 (33%), Positives = 119/234 (50%), Gaps = 21/234 (8%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+ L Q +KL LVLDLD TL+++ K + I+ +E + R+ R
Sbjct: 72 TKGLISQTSWLEDKKLHLVLDLDQTLIHTIKTSLLYESEKYII--EEVESRKDIKRFNTG 129
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP + KLRP + FL+ +++F M++YT G YA + +++DP F RVI+R
Sbjct: 130 FPEESL-IKLRPFVHQFLKECNEMFSMYVYTKGGYDYARLVLEMIDPDKFYFGNRVITRR 188
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
+ P K L+ VL E +VI+DD+ VWPH+K NL+ + RY YF
Sbjct: 189 ES----------PGFKTLDLVLADERGIVIVDDTSSVWPHDKKNLLQIARYKYFGDKSCL 238
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNIL 1136
F + DE E G L ++L ++ +H+ FF S + LD VDVR +L
Sbjct: 239 FSEDKKKI-----DESDEKGPLNTALRFLKDVHEEFFYDWSKKDLDSVDVRPLL 287
>gi|340931931|gb|EGS19464.1| hypothetical protein CTHT_0049250 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 871
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 109/196 (55%), Gaps = 22/196 (11%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
DQ + E +R+E+ Q+++ +RKL LV+DLD T++ + + +P H
Sbjct: 135 DQTNLRVGAEHAQRVEQELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNH 194
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
D + K Q + P L R + K+RPG+ FL+R S+++E+H+YTMG + YA
Sbjct: 195 DAVKDVKSFQLDDGPS-ALAR--KCWYYIKMRPGLEGFLKRISEMYELHVYTMGTRAYAQ 251
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
+A+V+DP LF RVISR ++G+ + +K L+ + + + VVIIDD VW
Sbjct: 252 NVARVVDPDRKLFGNRVISRDENGNIY--------TKSLQRLFPVSTNMVVIIDDRSDVW 303
Query: 1064 PHNKLNLIVVERYTYF 1079
P N+ NLI V Y +F
Sbjct: 304 PRNRPNLIKVSPYEFF 319
>gi|9294260|dbj|BAB02162.1| unnamed protein product [Arabidopsis thaliana]
Length = 288
Score = 112 bits (280), Expect = 1e-21, Method: Composition-based stats.
Identities = 70/234 (29%), Positives = 114/234 (48%), Gaps = 19/234 (8%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
T+ L ++ +KL LVLDLDHTL++S K + ++++++ R+ ++ R
Sbjct: 62 TKHLTTLVSVYGRKKLHLVLDLDHTLIHSMKTSNLSKAEKYLIKEEKSGSRKDLRKYNNR 121
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
K RP + FL+ A+KLF M YT G Y + +++DP + F R+I+R
Sbjct: 122 L------VKFRPFVEEFLKEANKLFTMTAYTKGGSTYGQAVVRMIDPNKIYFGDRIITRK 175
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
+ P K L+ VL E +VI+D++ VWPH+K NL+ + Y YF +
Sbjct: 176 ES----------PDLKTLDLVLADERGIVIVDNTPNVWPHHKRNLLEITSYFYFKNDGKN 225
Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNIL 1136
S E DE L + L ++ +H FF+ + LD DVR ++
Sbjct: 226 MMRSRLSYAERKSDESRTKRALVNLLKFLKEVHNGFFTCGLEEELDFKDVRYLI 279
>gi|380472901|emb|CCF46552.1| FCP1-like phosphatase, partial [Colletotrichum higginsianum]
Length = 740
Score = 110 bits (274), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 956
E QK++ RKL LV+DLD T++++ ++P E + KK + +
Sbjct: 152 ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYEAVKDVKKFQLND 207
Query: 957 EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
E P + G W K+RPG+ FLER ++L+E+H+YTMG + YA +AK++DP+
Sbjct: 208 EGPRGMVTS----GCWYYIKMRPGLAEFLERVAELYELHVYTMGTRAYALNIAKIVDPQQ 263
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 1073
LF RVISR ++G SK L+ + + + VVIIDD VWP N+ NLI V
Sbjct: 264 KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPSNRPNLIKV 315
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 316 VPYDFF 321
>gi|408390401|gb|EKJ69801.1| hypothetical protein FPSE_10001 [Fusarium pseudograminearum CS3096]
Length = 765
Score = 109 bits (273), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
E QK++ RKL LV+DLD T++++ + +P HD + K Q + R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ + KLRPG+ FLE SK++E+H+YTMG + YA +AK++DP LF RV
Sbjct: 208 GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G SK L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 266 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316
>gi|320591286|gb|EFX03725.1| RNA polymerase 2 ctd phosphatase [Grosmannia clavigera kw1407]
Length = 923
Score = 109 bits (273), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 103/185 (55%), Gaps = 32/185 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
E Q+++ S RKL LV+DLD T++++ +DP E +QD P+ + + R
Sbjct: 159 ELQQRLLSQRKLSLVVDLDQTIIHAC----IDPTIGEW-----QQDPSNPNYEALKDVRR 209
Query: 966 FP--------HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
F G W K+RP + FLE+ S ++E+H+YTMG + YAT +A+++DP
Sbjct: 210 FQLEEGFQGLARGCWYYIKMRPHLTEFLEKISTMYELHVYTMGTRTYATNIAQIVDPNQK 269
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVE 1074
LF RVISR ++G+ +K L+ + + + + VIIDD VWP+N+ NLI V
Sbjct: 270 LFGNRVISRDENGNII--------AKSLQRLFPVSTNMAVIIDDRADVWPYNRHNLIKVN 321
Query: 1075 RYTYF 1079
Y +F
Sbjct: 322 PYDFF 326
>gi|367032510|ref|XP_003665538.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
42464]
gi|347012809|gb|AEO60293.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
42464]
Length = 913
Score = 109 bits (273), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 108/196 (55%), Gaps = 33/196 (16%)
Query: 899 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+A+Q +RT + E Q+++ +RKL LV+DLD T++ + +DP E ++D
Sbjct: 142 SAVQAQRTEQ-ELQRRLLKSRKLSLVVDLDQTIIQAC----IDPTVGEW-----QKDPTN 191
Query: 959 PHRHL------FRFP--------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
P+ L F+ + K+RPG+ FL+R ++++E+H+YTMG + YA
Sbjct: 192 PNHELAKEVKSFQLDDGPTDLARRCWYYIKMRPGLQDFLKRIAEMYELHVYTMGTRAYAQ 251
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVW 1063
+A+V+DP LF RVISR ++G+ F +K L + + + +V IIDD VW
Sbjct: 252 NVARVVDPDKKLFGNRVISRDENGNIF--------AKSLHRLFPVSTHMVAIIDDRSDVW 303
Query: 1064 PHNKLNLIVVERYTYF 1079
P N+ NLI V Y +F
Sbjct: 304 PRNRPNLIKVSPYEFF 319
>gi|46126951|ref|XP_388029.1| hypothetical protein FG07853.1 [Gibberella zeae PH-1]
Length = 765
Score = 109 bits (273), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
E QK++ RKL LV+DLD T++++ + +P HD + K Q + R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ + KLRPG+ FLE SK++E+H+YTMG + YA +AK++DP LF RV
Sbjct: 208 GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G SK L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 266 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316
>gi|310791724|gb|EFQ27251.1| FCP1-like phosphatase [Glomerella graminicola M1.001]
Length = 860
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 956
E QK++ RKL LV+DLD T++++ ++P E + KK + +
Sbjct: 152 ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYQAVKDVKKFQLND 207
Query: 957 EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
E P + G W K+RPG+ FLE+ ++L+E+H+YTMG + YA +AK++DP
Sbjct: 208 EGPRGMVTS----GCWYYIKMRPGLAEFLEKVAELYELHVYTMGTRAYALNIAKIVDPHQ 263
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 1073
LF RVISR ++G SK L+ + + + VVIIDD VWP+N+ NLI V
Sbjct: 264 KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPNNRPNLIKV 315
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 316 VPYDFF 321
>gi|346326901|gb|EGX96497.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Cordyceps militaris
CM01]
Length = 780
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/196 (35%), Positives = 103/196 (52%), Gaps = 21/196 (10%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 944
DQ + + +R+E QK++ RKL LV+DLD T++++ V +P H
Sbjct: 131 DQTGLLVSENVAQRVEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNH 190
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ + Q ++ R L + KLRPG+ FLE SK++E+H+YTMG + YA
Sbjct: 191 SAVKDVRSFQLKDDGPRGLA--SGCTYYIKLRPGLRDFLEEVSKMYELHVYTMGTRAYAL 248
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
+AK++DP LF RVISR ++G +K L + + + VVIIDD VW
Sbjct: 249 NIAKIVDPDRKLFGNRVISRDENGS--------ITAKSLARLFPVSTDMVVIIDDRADVW 300
Query: 1064 PHNKLNLIVVERYTYF 1079
P NK NLI V Y +F
Sbjct: 301 PMNKANLIKVAAYDFF 316
>gi|402080254|gb|EJT75399.1| RNA polymerase II subunit A domain phosphatase [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 850
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 104/194 (53%), Gaps = 27/194 (13%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEIL 948
+KAA + E E QK++ RKL LV+DLD T++++ + +P H+ +
Sbjct: 150 KKAATKTE----FELQKRLLDQRKLILVVDLDQTIIHACIEPTIGDWQRDPTNPNHEAVK 205
Query: 949 RKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
K Q + R L G W K+RPG+ FLE+ + ++E+H+YTMG + YA +
Sbjct: 206 DVKSFQLNDDGPRGLAS----GCWYYIKMRPGLVDFLEKIATMYELHVYTMGTRAYAMNI 261
Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPH 1065
AK++DP LF RVISR ++G +K L+ + + + VVIIDD VWP
Sbjct: 262 AKIVDPDQKLFGNRVISRDENGS--------MTAKSLQRLFPVSTRMVVIIDDRADVWPR 313
Query: 1066 NKLNLIVVERYTYF 1079
N+ NLI V Y +F
Sbjct: 314 NRPNLIKVVPYDFF 327
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 17/146 (11%)
Query: 1103 EDGTLASSLGVIERLHKIFF---------SHQSLDDV----DVRNILAAEQRKILAGCRI 1149
+D LA + +LH+ F+ + +S DD DV N+L + K+L G RI
Sbjct: 453 DDSELAYLEQHLTQLHEAFYRAHDAEVASAKKSADDQTNPPDVGNVLDGLKAKVLRGTRI 512
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTH-VVANSLG-TDKVNWALST-G 1206
V S + PVG + + A FGA + +VTH VV+NS T K+ A G
Sbjct: 513 VLSGLVPVG-VDARQSEIGLQARSFGATIRGKVSKRVTHLVVSNSRARTQKMVEAERIPG 571
Query: 1207 RFVVHPGWVEASALLYRRANEQDFAI 1232
+V+ W+ S ++ +E +A+
Sbjct: 572 IKMVNQDWLINSLSQWKHLDETPYAV 597
>gi|302404507|ref|XP_003000091.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium albo-atrum VaMs.102]
gi|261361273|gb|EEY23701.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium albo-atrum VaMs.102]
Length = 755
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 944
DQ + + R E Q+++ RKL LV+DLD T++++ V +P +
Sbjct: 135 DQTGLMVSNDMAARAEHDAQRRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMNDPENPNY 194
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
D + ++ Q ++ R + + G W K+RPG+ FLER ++L+E+H+YTMG + Y
Sbjct: 195 DAVKDVQKFQLNDEGPRGVTQ----GCWYYIKMRPGLREFLERVAELYELHVYTMGTRAY 250
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
A +AK++DP+ LF RVISR ++G SK L+ + + + VVIIDD
Sbjct: 251 ALNIAKIVDPQQKLFGNRVISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRAD 302
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VWP N+ NLI V Y +F
Sbjct: 303 VWPRNRPNLIKVVPYDFF 320
>gi|346975758|gb|EGY19210.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium dahliae VdLs.17]
Length = 818
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + R E Q+++ RKL LV+DLD T++++ ++P E + E
Sbjct: 135 DQTGLMVSNDMAARAEHDAQRRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMNDPE 190
Query: 953 E------QDREKPHRHLF--RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
+D EK + R G W K+RPG+ FLE+ ++L+E+H+YTMG + Y
Sbjct: 191 NPNYDAVKDVEKFQLNDEGPRGVTQGCWYYIKMRPGLREFLEKVAELYELHVYTMGTRAY 250
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
A +AK++DP+ LF RVISR ++G SK L+ + + + VVIIDD
Sbjct: 251 ALNIAKIVDPQQKLFGNRVISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRAD 302
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VWP N+ NLI V Y +F
Sbjct: 303 VWPRNRPNLIKVVPYDFF 320
>gi|429854785|gb|ELA29772.1| RNA polymerase ii ctd phosphatase [Colletotrichum gloeosporioides
Nara gc5]
Length = 829
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 107/194 (55%), Gaps = 26/194 (13%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSA------KFHE--VDPVHDEILRKKEEQDREKPHRHL 963
QK++ RKL LV+DLD T++++ ++ E +P ++ + K+ Q ++ R +
Sbjct: 154 QKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMEDPTNPNYNAVKDVKKFQLNDEGPRGV 213
Query: 964 FRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
G W K+RPG+ FLE+ S+L+E+H+YTMG + YA +A+++DP LF RV
Sbjct: 214 VT---SGCWYYIKMRPGLKEFLEKISELYELHVYTMGTRAYAMNIAQIVDPDRKLFGNRV 270
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
ISR ++G SK L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 271 ISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPRNRPNLIKVVPYDFF- 321
Query: 1081 CSRRQFGLLGPSLL 1094
R G + S L
Sbjct: 322 ---RGIGDINSSFL 332
>gi|116179414|ref|XP_001219556.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
gi|88184632|gb|EAQ92100.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
Length = 828
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 107/190 (56%), Gaps = 22/190 (11%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
DQ + +R E+ Q+++ +RKL LV+DLD T++ + + +P H
Sbjct: 135 DQTNLTVSASHAQRTEQELQRRLLVSRKLSLVVDLDQTIIQACIDPTVGDWQKDPTNPNH 194
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ + K Q + P + + + + K+RPG+ +FL+R ++++E+H+YTMG + YA
Sbjct: 195 ESVKSVKSFQLDDGPTQAANQCSY---YIKMRPGLESFLKRIAQMYELHVYTMGTRAYAQ 251
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVW 1063
+A+V+DP LF RVISR ++G + +KDL+ + + + +V IIDD VW
Sbjct: 252 NVARVVDPDKKLFGNRVISRDENGSIY--------AKDLQRLFPISTHMVAIIDDRSDVW 303
Query: 1064 PHNKLNLIVV 1073
P+N+ NLI V
Sbjct: 304 PNNRANLIKV 313
>gi|322706326|gb|EFY97907.1| RNA Polymerase II CTD phosphatase Fcp1 [Metarhizium anisopliae ARSEF
23]
Length = 807
Score = 106 bits (264), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 96/177 (54%), Gaps = 19/177 (10%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 963
QK++ RKL LV+DLD T++++ + E +P H+ + K Q + R L
Sbjct: 150 QKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQKDESNPNHEAVKDVKSFQLNDDGPRGL 209
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ KLRPG+ FLE + ++E+H+YTMG + YA +A+++DP LF RVIS
Sbjct: 210 A--SGCTYYIKLRPGLQEFLEEIATMYELHVYTMGTRAYALNIARIVDPDRKLFGNRVIS 267
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R ++G SK L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 268 RDENGS--------ITSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFF 316
>gi|164429292|ref|XP_958446.2| hypothetical protein NCU11408 [Neurospora crassa OR74A]
gi|157073422|gb|EAA29210.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 868
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + + ++ E Q+++ RKL LV+DLD T++++ +DP E
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185
Query: 953 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
++D P+ R++ F + + K+RPG+ FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
+ YA +A+++DP LF RVISR ++G+ + +K L+ + + + VVIID
Sbjct: 246 TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297
Query: 1058 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 1092
D VWP N+ NLI V Y +F +Q GLL PS
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340
>gi|336466789|gb|EGO54953.1| hypothetical protein NEUTE1DRAFT_84976 [Neurospora tetrasperma FGSC
2508]
gi|350288620|gb|EGZ69856.1| hypothetical protein NEUTE2DRAFT_160171 [Neurospora tetrasperma FGSC
2509]
Length = 867
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + + ++ E Q+++ RKL LV+DLD T++++ +DP E
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185
Query: 953 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
++D P+ R++ F + + K+RPG+ FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
+ YA +A+++DP LF RVISR ++G+ + +K L+ + + + VVIID
Sbjct: 246 TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297
Query: 1058 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 1092
D VWP N+ NLI V Y +F +Q GLL PS
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340
>gi|342878347|gb|EGU79693.1| hypothetical protein FOXB_09806 [Fusarium oxysporum Fo5176]
Length = 769
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 19/179 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
E QK++ RKL LV+DLD T++++ K +P ++ + ++ Q + R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWKNDPTNPNYEAVKDVRDFQLNDDGPR 207
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L + KLRPG+ FL+ SK++E+H+YTMG + YA +AK++DP LF RV
Sbjct: 208 GLT--SGCTYYIKLRPGLMEFLDEVSKMYELHVYTMGTRAYALNIAKIVDPDQKLFGNRV 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G +K L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 266 ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316
>gi|302889251|ref|XP_003043511.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256724428|gb|EEU37798.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 765
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 20/179 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
E QK++ RKL LV+DLD T++++ + +P H + K Q + P R
Sbjct: 148 ESQKRLLRQRKLTLVVDLDQTIIHACIEPTIGEWQRDPTNPNHQAVKDVKSFQLDDGP-R 206
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L + KLRPG+ FLE SK++E+H+YTMG + YA +A+++DP LF RV
Sbjct: 207 GLA--SGCTYYIKLRPGLAEFLEEISKMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 264
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G SK L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 265 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPLNRPNLIKVVPYDFF 315
>gi|336259270|ref|XP_003344437.1| hypothetical protein SMAC_08633 [Sordaria macrospora k-hell]
gi|380087533|emb|CCC05319.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 878
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 110/202 (54%), Gaps = 34/202 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + + ++ E Q+++ RKL LV+DLD T++++ +DP E
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185
Query: 953 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
++D P+ R++ F + + K+RPG+ FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
+ YA +A+++DP+ LF RVISR ++G+ + +K L+ + + + VVIID
Sbjct: 246 TRAYAQNVARIVDPEKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VWP N+ NLI V Y +F
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFF 319
>gi|358390781|gb|EHK40186.1| hypothetical protein TRIATDRAFT_89336 [Trichoderma atroviride IMI
206040]
Length = 768
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 97/179 (54%), Gaps = 19/179 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
E QK++ RKL LV+DLD T++++ + + +P H+ + K Q + R
Sbjct: 148 ESQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDKANPNHEAVKDVKSFQLNDDGPR 207
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L + KLRPG+ FLE S ++E+H+YTMG + YA +A+++DP LF RV
Sbjct: 208 GLA--SGCTYYIKLRPGLHEFLETVSTMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G +K L+ + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 266 ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRSDVWPMNRPNLIKVVPYDFF 316
>gi|378731871|gb|EHY58330.1| protein phosphatase [Exophiala dermatitidis NIH/UT8656]
Length = 856
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 77/237 (32%), Positives = 120/237 (50%), Gaps = 34/237 (14%)
Query: 854 AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ- 912
+ TN T +G E P D HL I KE R++E+
Sbjct: 102 GMCTNCGKDMTTVQAGSETTDADRAPIRMTHDTPHL----------TISKEEAARIDEEA 151
Query: 913 -KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-----KPHRHLFRF 966
+++ S+RKL LV+DLD T++++A VDP E + K+ + + + + +
Sbjct: 152 KRRLLSSRKLSLVVDLDQTIIHAA----VDPTIAEWQKDKDNPNYDAVKDVRSFQLIDDG 207
Query: 967 PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
P M G W KLRPG+ FLE S+L+EMH+YTMG + YA ++A ++DP+ F R++S
Sbjct: 208 PGMRGCWYYIKLRPGLTEFLEHISQLYEMHIYTMGTRQYAQQIAAIVDPERKFFGDRILS 267
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R + G +K+LE + +++ VVIIDD VW + NLI V + +F
Sbjct: 268 RDESGSMV--------AKNLERLFPVDTKMVVIIDDRGDVWKWSA-NLIRVRPFDFF 315
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 1/103 (0%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D++ ++ A + K+L+G IVFS V P+G + + A+ FGA T + VTHVV
Sbjct: 527 DIKKVMPAMKMKVLSGVTIVFSGVLPLG-TDIQSADISTWAKTFGATITDKVGRGVTHVV 585
Query: 1191 ANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
A GT KV A+ G VV W+ S +R+ +E+ + ++
Sbjct: 586 AARPGTAKVKQAVKRGIKVVGTAWLIESMQQWRKLDEKPYLLE 628
>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
phosphatase [Polysphondylium pallidum PN500]
Length = 881
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/184 (36%), Positives = 97/184 (52%), Gaps = 21/184 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA---KFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
E K++ RKL LVLD+DHT++++ F EV P I D EK +
Sbjct: 266 ENAKRLIKQRKLSLVLDIDHTIIHAIMEPHFMEV-PYWRNI-------DCEKENIRSITL 317
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+M + KLRP ++ FLE +K FE+H+YTMG + YA E+AK++D K LF R++SR D
Sbjct: 318 GNMKYYIKLRPFLYKFLEDVNKKFELHIYTMGTRNYALEIAKLIDEKQELFKERILSRDD 377
Query: 1027 DGD-PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
D F +R+ D S V+I+DD VW +K NL+ + Y YF +
Sbjct: 378 TTDMSFKTLQRLFPCDD--------SMVLIVDDRSDVWKRSK-NLVQISPYLYFVGCKDM 428
Query: 1086 FGLL 1089
LL
Sbjct: 429 VNLL 432
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 74/131 (56%), Gaps = 2/131 (1%)
Query: 1104 DGTLASSLGVIERLHKIFFS-HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANP 1162
D L + + +H+I++ + + + V ++L+ ++KIL G +VFS V+P+ +
Sbjct: 548 DKHLLTVWNQLSEVHRIYYEEYDKGNKLHVADVLSIVKKKILNGVNLVFSGVYPL-QLPA 606
Query: 1163 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 1222
H PL AE+ GA I + THVVA GT KV+ A+S G +V+ W+E SA +
Sbjct: 607 HRQPLRLLAEELGATVQNDITNTTTHVVAARKGTSKVHKAISKGLKIVNQNWIEQSAFQW 666
Query: 1223 RRANEQDFAIK 1233
+R +E DF ++
Sbjct: 667 QRLDEADFPVQ 677
>gi|157823025|ref|NP_001099601.1| RNA polymerase II subunit A C-terminal domain phosphatase [Rattus
norvegicus]
gi|149015915|gb|EDL75222.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 969
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
+Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 160 MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207
Query: 961 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R
Sbjct: 208 FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 267 ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319
Query: 1080 P 1080
P
Sbjct: 320 P 320
Score = 45.1 bits (105), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 595 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 654
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 655 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPEW 709
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 710 LWSCLERWDKVEEQLFPL 727
>gi|148677457|gb|EDL09404.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_a [Mus musculus]
Length = 956
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
+Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 160 MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207
Query: 961 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R
Sbjct: 208 FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 267 ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319
Query: 1080 P 1080
P
Sbjct: 320 P 320
Score = 45.4 bits (106), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 582 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 641
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 642 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 696
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 697 LWSCLERWDKVEEQLFPL 714
>gi|400603434|gb|EJP71032.1| FCP1-like phosphatase [Beauveria bassiana ARSEF 2860]
Length = 774
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
QK++ RKL LV+DLD T++++ V +P H + + Q + R L
Sbjct: 150 QKRLLRHRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNHSAVKDVRSFQLNDDGPRGL 209
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ KLRPG+ FLE SK++E+H+YTMG + YA +AK++DP LF RVIS
Sbjct: 210 A--SGCTYYIKLRPGLSEFLEEISKMYELHVYTMGTRAYALNIAKIVDPDRKLFGNRVIS 267
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R ++G SK L + + + VVIIDD VWP N+ NLI V Y +F
Sbjct: 268 RDENGS--------ITSKSLARLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316
>gi|351695852|gb|EHA98770.1| hypothetical protein GW7_03722 [Heterocephalus glaber]
Length = 963
Score = 104 bits (259), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 97/172 (56%), Gaps = 20/172 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 170 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HFQLGRGEPM 218
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 219 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 277
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
PF K+ +L+ + +S V IIDD VW NLI V++Y YFP
Sbjct: 278 PFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 322
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 66/138 (47%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LAG ++FS + FPV +
Sbjct: 594 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLAGVTVIFSGLHPTNFPVEKTR 653
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ THV+A GT+KV+ A VV P W
Sbjct: 654 EHYH-----ATALGAKVLTQLVLSPDAPDRATHVIAARAGTEKVHQAQECRHLHVVSPDW 708
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 709 LWSCLERWDKVEEQLFPL 726
>gi|348555132|ref|XP_003463378.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Cavia porcellus]
Length = 970
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 597 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVTLIFSGLHPTNFPVEKTR 656
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 657 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 711
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 712 LWSCLERWDKVEEQLFPL 729
>gi|297741470|emb|CBI32601.3| unnamed protein product [Vitis vinifera]
Length = 147
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 82/143 (57%), Gaps = 11/143 (7%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
MG + YA EM KVLDP+ V F+ VIS+ D K L+ VLG +SAV+I+
Sbjct: 1 MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSAVLIL 52
Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 1116
DD+ R W ++K NLI++ERY +F S QFG SL E+ DE DG LA+ L V+++
Sbjct: 53 DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112
Query: 1117 LHKIFFSHQSLDDV---DVRNIL 1136
H F + D+ DVR +L
Sbjct: 113 THSTLFDPELSDNFSGRDVRQVL 135
>gi|148677459|gb|EDL09406.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_c [Mus musculus]
Length = 1000
Score = 103 bits (258), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 205 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 252
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 253 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 311
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 312 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 364
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 626 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 685
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 686 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 740
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 741 LWSCLERWDKVEEQLFPL 758
>gi|74140094|dbj|BAE33777.1| unnamed protein product [Mus musculus]
Length = 960
Score = 103 bits (258), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324
Score = 45.4 bits (106), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 586 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 645
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 646 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 700
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 701 LWSCLERWDKVEEQLFPL 718
>gi|34328280|ref|NP_080571.2| RNA polymerase II subunit A C-terminal domain phosphatase [Mus
musculus]
gi|46395722|sp|Q7TSG2.1|CTDP1_MOUSE RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=TFIIF-associating CTD
phosphatase
gi|31419683|gb|AAH53435.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Mus musculus]
Length = 960
Score = 103 bits (258), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324
Score = 45.4 bits (106), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 586 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 645
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A VV P W
Sbjct: 646 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 700
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 701 LWSCLERWDKVEEQLFPL 718
>gi|339254478|ref|XP_003372462.1| conserved hypothetical protein [Trichinella spiralis]
gi|316967111|gb|EFV51594.1| conserved hypothetical protein [Trichinella spiralis]
Length = 683
Score = 103 bits (257), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 166/367 (45%), Gaps = 64/367 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+ K + S +KL L++DLD TL+++++ ++ D + + P+
Sbjct: 221 EDSKNLLSQKKLALLVDLDLTLIHTSE-------------TSDDSDALDVYHYQMEGPNS 267
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRP FL++ ++ FE+H+ T GN+ YA ++ K+LDP VLF R++SR
Sbjct: 268 PWYHTRLRPYARYFLKKINEYFELHIITHGNRKYAEKVVKMLDPNNVLFGDRILSR---D 324
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF-------PC 1081
+ FD + + P K L G + V IIDD VW + + N++ V Y +F
Sbjct: 325 ECFDPNMKAPNLKAL--FPGGDDLVCIIDDREDVWNYAE-NVVRVRPYRFFKHTDDFNAA 381
Query: 1082 SRRQFGLLGPSLL--EIDHDERSE---------------------DGTLASSLGVIERLH 1118
+ + G + + L + DE D L ++ R+H
Sbjct: 382 TLAELGNIIETELAKNVAGDEEDAGKDEVKLWLEKADEVPTDNDPDNYLVYLFFLLRRIH 441
Query: 1119 KIFFSHQSL-----DDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 1173
+ F++ + L + ++ ++ A + + R VF+ + P +A + ++Q
Sbjct: 442 ETFYNVRKLTGNPKKTLSLKTVMNALRENVFKNLRFVFTGLVPADQAITD-SIFYYRSKQ 500
Query: 1174 FGAVCTK----HIDDQ---VTHVVANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRA 1225
FGAV K H D+ TH++A L T+KV A +G +V P W + R
Sbjct: 501 FGAVVQKEVVVHDGDETLPTTHLIAGKLDTEKVARARKSGSVIIVSPAWFWTCVERWCRV 560
Query: 1226 NEQDFAI 1232
E+D+ +
Sbjct: 561 PEKDYQL 567
>gi|429963056|gb|ELA42600.1| FCP1-like phosphatase, phosphatase domain-containing protein
[Vittaforma corneae ATCC 50505]
Length = 445
Score = 103 bits (257), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 106/210 (50%), Gaps = 43/210 (20%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLD 928
G + PV H + +F+ ++ +K +QK R ++L E+KKM L+LDLD
Sbjct: 45 GTDLVPVLHHT-------DRVFQTSEEARK--LQKIRNKQLNEEKKMI------LILDLD 89
Query: 929 HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
T+L++ + ++D F + KLRP + FLE+ SK
Sbjct: 90 QTILHTTLW-KIDC------------------DFTFSISSTMFYVKLRPHLNRFLEKISK 130
Query: 989 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
+FE+H+YTMG + Y TE+ K +DP G+ F R++SR ++ + K +E +
Sbjct: 131 MFEIHIYTMGTREYVTEICKAIDPNGIYFGDRIVSRNENFNEL--------KKSIERITC 182
Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
+ VVIIDD VW ++K NL+++ + Y
Sbjct: 183 ISRNVVIIDDRADVWNYSK-NLVLIRPFWY 211
>gi|115396432|ref|XP_001213855.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193424|gb|EAU35124.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 820
Score = 103 bits (257), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 111/202 (54%), Gaps = 33/202 (16%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ +KE TR E+ K+ + S RKL LV+DLD T++++ VDP E +
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM--- 183
Query: 952 EEQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
+D+E P H+ L F+ P M G W KLRPG+ TFLE ++LFE+H+YTMG
Sbjct: 184 --EDKENPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLETFLENVAELFELHIYTMG 241
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIID
Sbjct: 242 TRAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIID 293
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW + NLI V Y +F
Sbjct: 294 DRGDVWRWSP-NLIKVSPYDFF 314
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++I+ +RK+L G +VFS V P+G + LW A+ FGAV ++ I+ + TH+
Sbjct: 500 DVKDIMPYIKRKVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINMKTTHL 557
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++ +E + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKHLSEDPYLL 601
>gi|358383388|gb|EHK21054.1| hypothetical protein TRIVIDRAFT_90991 [Trichoderma virens Gv29-8]
Length = 758
Score = 103 bits (257), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 101/196 (51%), Gaps = 21/196 (10%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
DQ + K +R E QK++ RKL LV+DLD T++++ + +P H
Sbjct: 131 DQTGLMVSKNVAKRAEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNH 190
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ + K Q + R L + KLRPG+ FLE S +E+H+YTMG + YA
Sbjct: 191 EAVKDVKSFQLNDDGPRGLA--SGCTYYIKLRPGLQEFLEAVSTKYELHVYTMGTRAYAL 248
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
+A+++DP LF RVISR ++G +K L+ + + + VVIIDD VW
Sbjct: 249 NIARIVDPDRKLFGNRVISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVW 300
Query: 1064 PHNKLNLIVVERYTYF 1079
P N+ NLI V Y +F
Sbjct: 301 PMNRPNLIKVVPYDFF 316
>gi|344242866|gb|EGV98969.1| hypothetical protein I79_008270 [Cricetulus griseus]
Length = 848
Score = 103 bits (256), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 103/185 (55%), Gaps = 31/185 (16%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ E+Q + ++
Sbjct: 46 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNK 88
Query: 962 HLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+F F +G + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ L
Sbjct: 89 GIFHF-QLGRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKL 147
Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVER 1075
F+ R++SR + DPF K+ +L + +S V IIDD VW NLI V++
Sbjct: 148 FSHRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKK 200
Query: 1076 YTYFP 1080
Y YFP
Sbjct: 201 YVYFP 205
Score = 44.7 bits (104), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 474 ILVRVHTDYYTKYDRYLKKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 533
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A VV P W
Sbjct: 534 EHYH-----ATALGAKVLTQLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVSPDW 588
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 589 LWSCLERWDKVEEQLFPL 606
>gi|334325963|ref|XP_001374906.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Monodelphis domestica]
Length = 1208
Score = 103 bits (256), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 395 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 443
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 444 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 502
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW + NLI V++Y YF
Sbjct: 503 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKYAP-NLITVKKYVYF 546
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 16/114 (14%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 1182
D+R I+ + K+LA I+FS V FP+ H H A GA K++
Sbjct: 857 DIRKIVPELKSKVLADVTIIFSGVCPTNFPIERTREHYH-----ATALGAKINKNLILSA 911
Query: 1183 DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
DD + TH++A GT+KV A VV+P W+ + + + EQ F +K
Sbjct: 912 DDPNKATHLIAARAGTEKVRQAQECKHLHVVNPDWLWSCLERWDKVEEQLFPLK 965
>gi|406865754|gb|EKD18795.1| FCP1-like phosphatase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 863
Score = 102 bits (255), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 23/181 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRHLF--- 964
E Q+++ RKL LV+DLD T++++ ++P E R K + E K +
Sbjct: 155 ELQRRLLKHRKLSLVVDLDQTIIHAC----IEPTVGEWQRDKNSPNYEAVKDVKSFQLND 210
Query: 965 ---RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
R G W K+RPG+ FL S+L+E+H+YTMG + YA +AK++DP LF
Sbjct: 211 DGPRGLASGCWYYIKMRPGLAEFLAHISELYELHVYTMGTRAYAINIAKIVDPDKKLFGD 270
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTY 1078
R+ISR ++G+ +K L + +++ VVIIDD VWP N+ NLI V Y +
Sbjct: 271 RIISRDENGN--------VTAKSLARLFPVDTKMVVIIDDRADVWPQNRPNLIKVVPYDF 322
Query: 1079 F 1079
F
Sbjct: 323 F 323
>gi|50838820|ref|NP_001002873.1| RNA polymerase II subunit A C-terminal domain phosphatase [Danio
rerio]
gi|49618915|gb|AAT68042.1| RNA polymerase II CTD phosphatase [Danio rerio]
Length = 947
Score = 102 bits (255), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 159 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 207
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KLFE+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 208 -LHTRLRPHCKDFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 266
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L+ + +S V IIDD VW NLI V++Y YF
Sbjct: 267 PFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 310
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 25/140 (17%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
++ R+H ++S +S + D+R I+ + + LAG ++FS ++P +
Sbjct: 556 ILVRIHNEYYSRYEAFLKKESSESPDIRKIVPELKGRTLAGTTVIFSGLYPTN------Y 609
Query: 1166 PL-----WQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 1213
P+ W + GA K + ++ TH++A GT+KV A VV+P
Sbjct: 610 PMERTREWYHGKALGAKLCKTLVLNAKDPNRTTHLIAARAGTEKVRQAQGCKHLHVVNPD 669
Query: 1214 WVEASALLYRRANEQDFAIK 1233
W+ A + R EQ F +K
Sbjct: 670 WLWACLERWERVEEQLFPLK 689
>gi|354479392|ref|XP_003501894.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Cricetulus griseus]
Length = 978
Score = 102 bits (255), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 99/177 (55%), Gaps = 30/177 (16%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ E+Q + ++ +F F +
Sbjct: 183 EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNKGIFHF-QL 225
Query: 970 G-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
G + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR
Sbjct: 226 GRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSR 285
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ DPF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 286 DECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 335
Score = 43.9 bits (102), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + L++ D+R I+ + K+LA ++FS + FPV +
Sbjct: 604 ILVRVHTDYYTKYDRYLKKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 663
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A VV P W
Sbjct: 664 EHYH-----ATALGAKVLTQLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVSPDW 718
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 719 LWSCLERWDKVEEQLFPL 736
>gi|395830784|ref|XP_003788497.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Otolemur garnettii]
Length = 1290
Score = 102 bits (255), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 20/172 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
PF K+ +L + +S V IIDD VW NLI V++Y YFP
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 16/114 (14%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 1182
D+R I+ + K+LA I+FS + FP+ + H H A GA +
Sbjct: 603 DIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGAKILTQLVLNP 657
Query: 1183 DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
DD + TH++A GT+KV A VV P W+ + + + EQ F +K
Sbjct: 658 DDPGRATHLIAARAGTEKVLQAQGCEHLHVVSPDWLWSCLERWDKVEEQLFPLK 711
>gi|296222911|ref|XP_002757404.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Callithrix jacchus]
Length = 1053
Score = 102 bits (255), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 611 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 665
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 666 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 725
Query: 1230 FAIK 1233
F ++
Sbjct: 726 FPLR 729
>gi|391345370|ref|XP_003746962.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Metaseiulus occidentalis]
Length = 475
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 169/387 (43%), Gaps = 89/387 (22%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
+++ ++ + +KL L++DLD TL+++ +PV+D+I H FR P
Sbjct: 21 DDELRLLTQKKLVLLVDLDQTLIHTTS----EPVYDKI-----------KGVHHFRLPSS 65
Query: 970 G-MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
W T++RPG FL + S+LFE+H+ T G + YA +A +LDP F R++SR +
Sbjct: 66 NNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIASLLDPGKKYFQYRILSRDE 125
Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
+P K+ +L+ + + V IIDD VW NLI V+ Y +F R
Sbjct: 126 CFNP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLIAVKPYVFF---RGA 175
Query: 1086 FGLLGPSLLEIDHDE--RSEDGTLASSL-------------------GVIER-------- 1116
+ P+ L D SE GT ++ L G+IE
Sbjct: 176 GDINAPAGLLADCHALPASEGGTCSTVLSHRNPEALRADREVVACLQGLIEHTCGATDGF 235
Query: 1117 -------------------LHKIFF------SHQSLDDV----DVRNILAAEQRKILAGC 1147
+H+ +F H+ + D++ ++ ++K+L
Sbjct: 236 IDYEDTDDYLLHLEDSLRTVHRAYFELYEQMKHEKHGEASSIPDLKTVIPYVRQKVLKDV 295
Query: 1148 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 1207
IVF+ FP+ + P ++ A GA K + +VTH+VA GT KV A
Sbjct: 296 VIVFTGCFPINQ-RPESAKIFLVAVSLGAKVQKELSKEVTHLVAARPGTAKVQQARKFRS 354
Query: 1208 F-VVHPGWVEASALLYRRANEQDFAIK 1233
VV P W+ + A + +++E F +K
Sbjct: 355 IKVVSPDWLWSCAERWEKSSEALFPLK 381
>gi|417412899|gb|JAA52807.1| Putative rna polymerase ii subunit a c-terminal domain phosphatase,
partial [Desmodus rotundus]
Length = 845
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + IL H L R M
Sbjct: 66 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIL-----------HFQLGRGEPM 114
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 115 -LHTRLRPHCRQFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 173
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 174 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 217
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 65/138 (47%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ + D+R I+ + ++LA I+FS + FPV +
Sbjct: 474 ILARVHSDYYARYDRYLRGETPEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPVEKTR 533
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A + GR VV+P W
Sbjct: 534 EHYH-----ATALGARILTQLVLDPDAPDRATHLIAARAGTEKVRQAQACGRLHVVNPDW 588
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 589 LWSCLERWDKVEEQLFPL 606
>gi|410911388|ref|XP_003969172.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Takifugu rubripes]
Length = 905
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 21/182 (11%)
Query: 899 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+A Q E+ R E+Q+++ +KL L++DLD TL+++ + H + IL
Sbjct: 156 SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIL---------- 204
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+
Sbjct: 205 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
R++SR + DPF K+ +L + +S V IIDD VW NL+ V++Y
Sbjct: 263 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLVTVKKYV 315
Query: 1078 YF 1079
YF
Sbjct: 316 YF 317
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 66/140 (47%), Gaps = 25/140 (17%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
V+ER+H +++ ++ + D+R I+ + K L G IVFS ++P +
Sbjct: 571 VLERVHAEYYARYEAYLGKEAPESPDIRKIVPELKSKTLEGATIVFSGLYPTN------Y 624
Query: 1166 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 1213
P+ +T E + GA ++ + Q TH++A GT+KV A VV+P
Sbjct: 625 PIEKTREYYHAKALGATISRSLILSTKGPGQTTHLIAARAGTEKVRQAQGCKHLRVVNPD 684
Query: 1214 WVEASALLYRRANEQDFAIK 1233
W+ + + R EQ + +K
Sbjct: 685 WLWSCLERWERVEEQLYPLK 704
>gi|291414979|ref|XP_002723734.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) phosphatase, subunit 1-like [Oryctolagus
cuniculus]
Length = 940
Score = 102 bits (254), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 100/179 (55%), Gaps = 21/179 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + IL H
Sbjct: 146 QAEKIAR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIL-----------HF 193
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 194 QLGRGEPM-LHTRLRPHCKDFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 252
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 253 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 304
Score = 46.2 bits (108), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 65/139 (46%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LAG I FS + FP+ +
Sbjct: 565 ILARVHADYYTRYDRYLNREVEEAPDMRRIVPELKSKVLAGVVITFSGLHPANFPIEKTR 624
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A VV+P W
Sbjct: 625 EHYH-----ATALGAKILTRLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVNPDW 679
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 680 LWSCLERWDKVEEQLFPLR 698
>gi|326916917|ref|XP_003204751.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Meleagris gallopavo]
Length = 1003
Score = 102 bits (254), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 183 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 231
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 232 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 290
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 291 PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 334
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + ++++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 622 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 681
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA K++ D+ TH++A GT+KV A VV+P W
Sbjct: 682 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 736
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 737 LWSCLERWDKVEEQLFPLK 755
>gi|123401628|ref|XP_001301902.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121883137|gb|EAX88972.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 461
Score = 102 bits (254), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 163/352 (46%), Gaps = 40/352 (11%)
Query: 900 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDR 956
+ ++ + + LEE++++ A+KL LV+DLD TL+++ + EVD + ++ D
Sbjct: 45 SFEEAKRKNLEEEQRLIDAKKLSLVIDLDKTLIDTTEVRNRAEVDAI--------KKLDP 96
Query: 957 EKPHRHLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
F F + + + RP + FL + F+M +YT+ + YA + +DP+
Sbjct: 97 AATEDDFFEFNMNQNLLIRYRPHVRQFLASIAPYFDMQIYTLASPAYAHAILSKIDPEDK 156
Query: 1016 LFAGRVISRGDDG-----DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVW--PHNK 1067
LF R+ SR + + + K+++ + + V+++DDS VW +NK
Sbjct: 157 LFKNRIFSRTAEDFAMIKEAMRNQTDIVNKKNIKKIFPYSDKLVLVLDDSPEVWFCDNNK 216
Query: 1068 L--NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ 1125
L L+ ++RY+YF RQ GP+ + + D L V+ +H +F+ +
Sbjct: 217 LFKGLVQIKRYSYFT---RQ----GPNSPPTVNPDYVNDDILIQMRSVLIDVHDMFYKNY 269
Query: 1126 SLDDVDVRNILAAEQRK--ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 1183
++ V I+ QRK + G FS G + + AE+FGA+
Sbjct: 270 DPEESHV--IMTLHQRKAQVFEGKTFYFS-----GLSEDDTMTFTRLAEEFGALVVDSFT 322
Query: 1184 DQVTHVVANSLGT-DKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 1233
TH++ GT D++ A+ G ++++ W+ + Y R E ++I+
Sbjct: 323 PYTTHIIVGEGGTDDQIQKAMEYRGVYIIYLKWLFECFIQYARIEESLYSIQ 374
>gi|320164786|gb|EFW41685.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 877
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 12/171 (7%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
E++ + ++KL L++DLD TL+++ +V P + LR E +E + L PH+
Sbjct: 222 EKESLLQSKKLVLIVDLDQTLIHAVVSSQV-PWIGQFLRDNVELQKEIFNFSLPNHPHL- 279
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ KLRPG FL +A+KLFE+H++TMG+++YA+ +A VLDP G LF R++SR +
Sbjct: 280 YYIKLRPGAREFLAQATKLFELHIFTMGSRMYASRVAAVLDPDGALFGSRIMSRDESKSA 339
Query: 1031 -FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
F K L + + V ++DD + VW N+I + Y YF
Sbjct: 340 NF-------KHTQLSQLFPSGHNMVAVLDDRIDVWARLG-NVIQISPYEYF 382
>gi|440638319|gb|ELR08238.1| hypothetical protein GMDG_03040 [Geomyces destructans 20631-21]
Length = 1765
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 105/198 (53%), Gaps = 25/198 (12%)
Query: 895 DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ ++ ++ R EEQ +++ RKL LV+DLD T++++ ++P E R
Sbjct: 134 DQTLLSVSQDEASRAEEQLQRRLLKNRKLSLVVDLDQTIIHAC----IEPTIGEWQRDPT 189
Query: 953 EQDRE--------KPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
+ E + H R G W K+RPG+ FL ++ +E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVKSFQLHDDGPRGLASGCWYYIKMRPGLAHFLTTIAEKYELHVYTMGTRAY 249
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
A E+AK++DP+ LF R+ISR ++G +K L + +++ VVIIDD
Sbjct: 250 AQEIAKIVDPEHKLFGDRIISRDENGS--------LTAKTLSRLFPVDTKMVVIIDDRAD 301
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VWP N+ NLI V Y +F
Sbjct: 302 VWPRNRSNLIKVVPYDFF 319
>gi|194214772|ref|XP_001496059.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Equus caballus]
Length = 868
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 80 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 128
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 129 -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 187
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 188 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 231
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 61/124 (49%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F S + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 510 FLSGEIQEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 564
Query: 1177 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
+ T+ + D + TH+VA GT+KV A G+ VV+P W+ + + + EQ
Sbjct: 565 KILTQLVLDPNDPNRATHLVAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDKVEEQL 624
Query: 1230 FAIK 1233
F ++
Sbjct: 625 FPLR 628
>gi|344269798|ref|XP_003406734.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Loxodonta africana]
Length = 972
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 178 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 226
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 227 -LHTRLRPHCKEFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 285
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 286 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 329
Score = 43.9 bits (102), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
+ + ++ + D+R I+ + K+LA I+FS + FPV + H H A GA
Sbjct: 608 YLNKETEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPVEKTREHYH-----ATALGA 662
Query: 1177 VCTKHI----DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
++ DD + TH++A GT+KV A VV+P W+ + + + EQ
Sbjct: 663 RILTNLVLSPDDPNRATHLIAARAGTEKVRQAQECRHLHVVNPDWLWSCLERWDKVEEQL 722
Query: 1230 FAIK 1233
F ++
Sbjct: 723 FPLR 726
>gi|402903417|ref|XP_003914562.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 1 [Papio anubis]
Length = 965
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 615 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 669
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 670 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 729
Query: 1230 FAIK 1233
F ++
Sbjct: 730 FPLR 733
>gi|402903419|ref|XP_003914563.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 2 [Papio anubis]
Length = 871
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 615 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 669
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 670 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 729
Query: 1230 FAIK 1233
F ++
Sbjct: 730 FPLR 733
>gi|297702856|ref|XP_002828379.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Pongo abelii]
Length = 962
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.1 bits (113), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 597 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 656
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 657 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 711
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 712 LWSCLERWDKVEEQLFPLR 730
>gi|332850750|ref|XP_001144243.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 2 [Pan troglodytes]
Length = 1026
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|5326898|gb|AAD42088.1| RNA polymerase II CTD phosphatase [Homo sapiens]
Length = 961
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDRKDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|119587034|gb|EAW66630.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_c [Homo sapiens]
Length = 948
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|395511850|ref|XP_003760164.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Sarcophilus harrisii]
Length = 1267
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 453 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 501
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 502 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 560
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 561 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 604
Score = 46.6 bits (109), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 16/114 (14%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 1182
D+R I+ + K+LA I+FS V +P+ H H A GA K++
Sbjct: 914 DIRKIVPELKSKVLADVTIIFSGVCPTNYPIERTREHYH-----ATALGAKINKNLILNA 968
Query: 1183 DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
DD + TH++A GT+KV A VV+P W+ + + + EQ F +K
Sbjct: 969 DDPNKATHLIAARAGTEKVRQAQECKHLHVVNPDWLWSCLERWDKVEEQLFPLK 1022
>gi|363730338|ref|XP_418905.3| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Gallus gallus]
Length = 958
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 137 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 185
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 186 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 244
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 245 PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 288
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + ++++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 576 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 635
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA K++ D+ TH++A GT+KV A VV+P W
Sbjct: 636 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 690
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 691 LWSCLERWDKVEEQLFPLK 709
>gi|340518072|gb|EGR48314.1| predicted protein [Trichoderma reesei QM6a]
Length = 594
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 96/179 (53%), Gaps = 19/179 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
+ QK++ RKL LV+DLD T++++ + +P H+ + K Q + R
Sbjct: 148 DTQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNHEAVKDVKSFQLNDDGPR 207
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L + KLRPG+ FLE S +E+H+YTMG + YA +A+++DP LF RV
Sbjct: 208 GLAS--GCTYYIKLRPGLKEFLEAVSTKYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ISR ++G +K L+ + + + VVIIDD VWP+N+ NLI V Y +F
Sbjct: 266 ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPNNRPNLIKVAPYDFF 316
>gi|109122558|ref|XP_001088601.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 2 [Macaca mulatta]
Length = 964
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 616 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 670
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 671 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 730
Query: 1230 FAIK 1233
F ++
Sbjct: 731 FPLR 734
>gi|67188445|ref|NP_004706.3| RNA polymerase II subunit A C-terminal domain phosphatase isoform 1
[Homo sapiens]
gi|327478586|sp|Q9Y5B0.3|CTDP1_HUMAN RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=TFIIF-associating CTD
phosphatase
gi|119587032|gb|EAW66628.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_a [Homo sapiens]
Length = 961
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|3769521|gb|AAC64549.1| serine phosphatase FCP1a [Homo sapiens]
Length = 842
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 66/139 (47%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 537 DDYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610
>gi|410294550|gb|JAA25875.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
Length = 961
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|321267522|ref|NP_001189433.1| RNA polymerase II subunit A C-terminal domain phosphatase isoform 3
[Homo sapiens]
Length = 842
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.9 bits (115), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610
>gi|389637610|ref|XP_003716438.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
70-15]
gi|351642257|gb|EHA50119.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
70-15]
gi|440471327|gb|ELQ40350.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Magnaporthe oryzae Y34]
gi|440487323|gb|ELQ67117.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Magnaporthe oryzae P131]
Length = 866
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 16/177 (9%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPHRHL 963
LE QK++ + RKL LV+DLD T++ +A + DP + KE + E P
Sbjct: 159 LEMQKRLVAQRKLVLVVDLDQTVIQTACEPTIGEWQKDPSNPNYEALKEVRSFELPSEDG 218
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
R + + K RPG FL + S LFEMH+YTM + YA + +++DPK LF RVIS
Sbjct: 219 PRRNYT-YYVKCRPGTHEFLNKVSNLFEMHVYTMATRAYAEHILRIIDPKKNLFGNRVIS 277
Query: 1024 RGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R ++ +R+ P S + V +IDD VWP N+ N+I V Y ++
Sbjct: 278 RNENKGIEKTLQRIFPTSTKM---------VAVIDDRTDVWPQNRSNVIKVVPYNFY 325
>gi|410215194|gb|JAA04816.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
gi|410254644|gb|JAA15289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
gi|410331971|gb|JAA34932.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
Length = 961
Score = 101 bits (252), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|397467065|ref|XP_003805250.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Pan paniscus]
Length = 842
Score = 101 bits (252), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.9 bits (115), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610
>gi|402903421|ref|XP_003914564.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 3 [Papio anubis]
Length = 846
Score = 101 bits (252), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 496 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 550
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 551 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 610
Query: 1230 FAIK 1233
F ++
Sbjct: 611 FPLR 614
>gi|431907029|gb|ELK11148.1| RNA polymerase II subunit A C-terminal domain phosphatase [Pteropus
alecto]
Length = 918
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + IL H L R M
Sbjct: 145 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIL-----------HFQLGRGEPM 193
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 194 -LHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 252
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 253 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 296
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ + D+R I+ + ++LA I+FS + FPV +
Sbjct: 553 ILARVHSDYYARYDRYLRGEAPEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPVEKTR 612
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A GR VV+P W
Sbjct: 613 EHYH-----ATALGAKILTQLVLDPDAPDRATHLIAARAGTEKVRQAQECGRLHVVNPDW 667
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + R EQ F +
Sbjct: 668 LWSCLERWDRVEEQLFPL 685
>gi|39645774|gb|AAH63447.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Homo sapiens]
Length = 867
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|67188550|ref|NP_430255.2| RNA polymerase II subunit A C-terminal domain phosphatase isoform 2
[Homo sapiens]
gi|119587035|gb|EAW66631.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_d [Homo sapiens]
Length = 867
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729
>gi|355702027|gb|EHH29380.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
[Macaca mulatta]
Length = 861
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 67 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 116 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 175 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 511 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 565
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 566 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 625
Query: 1230 FAIK 1233
F ++
Sbjct: 626 FPLR 629
>gi|426386293|ref|XP_004059621.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 1 [Gorilla gorilla gorilla]
gi|426386295|ref|XP_004059622.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 2 [Gorilla gorilla gorilla]
Length = 842
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610
>gi|355755122|gb|EHH58989.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
[Macaca fascicularis]
Length = 861
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 67 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 116 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 175 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 511 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 565
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 566 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 625
Query: 1230 FAIK 1233
F ++
Sbjct: 626 FPLR 629
>gi|123490666|ref|XP_001325656.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121908559|gb|EAY13433.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 474
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 155/349 (44%), Gaps = 37/349 (10%)
Query: 900 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 959
+ ++ R R L+E++++ A+KL LV+DLD TL+++ + + H E+ E + P
Sbjct: 45 SFEEARNRNLQEEQRLIDAKKLSLVIDLDKTLIDTTEVRD----HSEV----EAIKKLDP 96
Query: 960 HR---HLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
H F F + + + RP + FL + F++ +YT+ YA + +DP
Sbjct: 97 HATEDDFFEFNMNQNLLIRYRPHVREFLASIAPYFDLQIYTLALPSYAHAILSKIDPDDK 156
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-------ESAVVIIDDSVRVW--PHN 1066
LF R+ SR + +E + D+ + + V+++DDS VW N
Sbjct: 157 LFKNRIFSRTAEDFAMLREEAMRNRTDIVHKKNIKKLFPYSDKLVLVLDDSPEVWYCDDN 216
Query: 1067 KL--NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 1124
KL L+ ++RY+YF RQ GP+ + + ED L V+ +H +F+ +
Sbjct: 217 KLFKGLVQIKRYSYFT---RQ----GPNFPPTVNPDYVEDDILIQMRSVLIEVHDLFYKN 269
Query: 1125 QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 1184
+ V L + ++ G FS G ++ AE+FGA+
Sbjct: 270 YDPEQSHVIMTLHQRKAQVFEGKTFYFS-----GLSDADARSFTYLAEEFGALVVDSFTP 324
Query: 1185 QVTHVVANSLGTD-KVNWALS-TGRFVVHPGWVEASALLYRRANEQDFA 1231
TH++ G D +V AL G +V++ W+ + Y R E +A
Sbjct: 325 YTTHIIVGEGGADEEVQKALKYNGVYVIYLKWLFECFIQYARLEESTYA 373
>gi|119587036|gb|EAW66632.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_e [Homo sapiens]
Length = 748
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610
>gi|441603466|ref|XP_004087808.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Nomascus leucogenys]
Length = 1236
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
+ S + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 612 YLSKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 666
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 667 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 726
Query: 1230 FAIK 1233
F ++
Sbjct: 727 FPLR 730
>gi|47217775|emb|CAG05997.1| unnamed protein product [Tetraodon nigroviridis]
Length = 979
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 101/182 (55%), Gaps = 21/182 (11%)
Query: 899 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+A Q E+ R E+Q+++ +KL L++DLD TL+++ + H + I
Sbjct: 156 SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCHRMSNKGIF---------- 204
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+
Sbjct: 205 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
R++SR + DPF K+ +L + +S V IIDD VW NLI V++Y
Sbjct: 263 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 315
Query: 1078 YF 1079
YF
Sbjct: 316 YF 317
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 67/140 (47%), Gaps = 25/140 (17%)
Query: 1113 VIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
V+ER+H ++ ++ + D+R I+ + K L G IVFS ++P +
Sbjct: 571 VLERVHAEYYVRYEAYQRKEASESPDIRKIVPELKSKTLEGATIVFSGLYPTN------Y 624
Query: 1166 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 1213
P+ +T E + GA ++++ + TH++A GT+KV A + VV+P
Sbjct: 625 PIEKTREYYHAKALGAKISRNLVLSSKDPGRTTHLIAARAGTEKVRQAQGCKQLQVVNPD 684
Query: 1214 WVEASALLYRRANEQDFAIK 1233
W+ + + R EQ + +K
Sbjct: 685 WLWSCLERWERVEEQLYPLK 704
>gi|147770504|emb|CAN75676.1| hypothetical protein VITISV_003260 [Vitis vinifera]
Length = 205
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 79/139 (56%), Gaps = 8/139 (5%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
MG + YA EM KVLDP+ V F+ VIS+ D K L+ VLG +S V+I+
Sbjct: 1 MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSXVLIL 52
Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 1116
DD+ R W ++K NLI++ERY +F S QFG SL E+ DE DG LA+ L V+++
Sbjct: 53 DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112
Query: 1117 LHKIFFSHQSLDDVDVRNI 1135
H F + D+ R++
Sbjct: 113 THSTLFDPELSDNFSGRDV 131
>gi|449493392|ref|XP_002190004.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Taeniopygia guttata]
Length = 871
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PF------SKTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + ++++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 492 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 551
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA K++ D+ TH++A GT+KV A VV+P W
Sbjct: 552 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 606
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 607 LWSCLERWDKVEEQLFPLK 625
>gi|119491655|ref|XP_001263322.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
fischeri NRRL 181]
gi|119411482|gb|EAW21425.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
fischeri NRRL 181]
Length = 824
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 109/197 (55%), Gaps = 23/197 (11%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ +KE TR E+ K+ + + RKL LV+DLD T++++ VDP E + K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186
Query: 952 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
+ + E + + + P M G W KLRPG+ +FL+ S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHEALSDVRAFQLVDEGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
+A ++DP LF R++SR + G +K+L+ + +++ VVIIDD V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298
Query: 1063 WPHNKLNLIVVERYTYF 1079
W + NLI V Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314
Score = 47.8 bits (112), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++I+ + ++L G +VFS V P+G + LW A+ FGAV + I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601
>gi|403268140|ref|XP_003926140.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Saimiri boliviensis boliviensis]
Length = 937
Score = 100 bits (249), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 147 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 195
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 196 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 254
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 255 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 298
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F + + + D+R I+ + K+LA I+FS + FP+ + H H A GA
Sbjct: 587 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 641
Query: 1177 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
V + D+ TH++A GT+KV A G VV+P W+ + + + EQ
Sbjct: 642 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 701
Query: 1230 FAIK 1233
F ++
Sbjct: 702 FPLR 705
>gi|62858037|ref|NP_001017022.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Xenopus (Silurana) tropicalis]
Length = 570
Score = 100 bits (249), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ RKL L++DLD TL+++ + H I H L R M
Sbjct: 170 EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 218
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KLFE+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 219 -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 277
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 278 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 321
>gi|89269074|emb|CAJ81904.1| ctd (carboxy terminal domain rna polymerase 2 polypeptide a)
phosphatase subunit 1 [Xenopus (Silurana) tropicalis]
Length = 567
Score = 100 bits (249), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ RKL L++DLD TL+++ + H I H L R M
Sbjct: 167 EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 215
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KLFE+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 216 -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 274
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 275 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 318
>gi|444518074|gb|ELV11938.1| RNA polymerase II subunit A C-terminal domain phosphatase [Tupaia
chinensis]
Length = 876
Score = 100 bits (249), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 17 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCAQMSNRGIF-----------HFQLGRGEPM 65
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 66 -LHTRLRPHCKDFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 124
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 125 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 168
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 55/114 (48%), Gaps = 16/114 (14%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA------VCTK 1180
D+R I+ + K+LA I+FS + FPV + H H A GA V +
Sbjct: 444 DIRRIVPELRSKVLADVAILFSGLHPTNFPVEKTREHYH-----ATALGAKILTQLVLSP 498
Query: 1181 HIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
D+ TH++A GT+KV A +G VV PGW+ + + + EQ F ++
Sbjct: 499 DAPDRATHLIAARAGTEKVRQAQESGHVHVVSPGWLWSCLERWDKVEEQLFPLR 552
>gi|358418617|ref|XP_003583993.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bos taurus]
Length = 864
Score = 100 bits (249), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 269 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F S +S + D+R I+ + K+LA I+FS + FPV H H A GA
Sbjct: 588 FLSGESPEAPDIRKIVPELRSKVLADVAIIFSGLHPTNFPVERTREHYH-----ARALGA 642
Query: 1177 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
+ T+ + D + TH++A GT+KV A G+ VV+P W+ + + R EQ
Sbjct: 643 RILTQLVLDPDDPARPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDRVEEQL 702
Query: 1230 FAIK 1233
F ++
Sbjct: 703 FPLR 706
>gi|345324709|ref|XP_001509122.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Ornithorhynchus anatinus]
Length = 1168
Score = 100 bits (249), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 101/182 (55%), Gaps = 22/182 (12%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E+ ++L E+Q+++ RKL L++DLD TL+++ + H + I
Sbjct: 167 VSSEQAKQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF---------- 216
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H L R M + T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+
Sbjct: 217 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 274
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
R++SR + DPF K+ +L + +S V IIDD VW NLI V++Y
Sbjct: 275 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 327
Query: 1078 YF 1079
YF
Sbjct: 328 YF 329
Score = 47.0 bits (110), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ Q + +V D+R I+ + K+LA I+FS + FP+
Sbjct: 588 ILVRVHTDYYAKYDKYLRQEIQEVPDIRKIVPELKSKVLADVTIIFSGLYPTNFPMERTR 647
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA K++ ++ TH++A GT+KV A VV+P W
Sbjct: 648 EHYH-----ATALGAKINKNLILSADNPNRATHLIAARAGTEKVRQAQDCKHLHVVNPDW 702
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 703 LWSCLERWDKVEEQLFPLK 721
>gi|348512639|ref|XP_003443850.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Oreochromis niloticus]
Length = 998
Score = 100 bits (248), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ +KL L++DLD TL+++ + H + I H L R M
Sbjct: 166 EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 214
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 215 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 273
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 274 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 317
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 67/140 (47%), Gaps = 25/140 (17%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
V+ER+H +++ ++ + D+R I+ + K L G IVFS ++P +
Sbjct: 591 VLERIHAEYYARYEGYLRKEASEMPDIRKIVPELKSKTLEGTMIVFSGLYPTN------Y 644
Query: 1166 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 1213
P+ +T E + GA K++ ++ TH++A GT+KV A VV+P
Sbjct: 645 PMERTREYYHAKALGAKIGKNLILSAQNPNRTTHLIAARAGTEKVRQAQGCKHLHVVNPD 704
Query: 1214 WVEASALLYRRANEQDFAIK 1233
W+ + + R EQ + +K
Sbjct: 705 WLWSCLERWERVEEQLYPLK 724
>gi|73945347|ref|XP_533365.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
isoform 1 [Canis lupus familiaris]
Length = 933
Score = 100 bits (248), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 169 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 217
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T++RP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 218 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 276
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 277 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ S + D+R I+ + ++LA I+FS + FP+ +
Sbjct: 572 ILARVHSDYYAKYDRYLRGDSQEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPIEKTR 631
Query: 1162 PHLHPLWQTAEQFGA-VCTKHI-----DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA + T+ + D+ TH++A GT+KV A G+ VV+P W
Sbjct: 632 EHYH-----ATALGAKILTQLVLDPDNPDRATHLIAARAGTEKVRQAQECGQLHVVNPDW 686
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + R EQ F ++
Sbjct: 687 LWSCLERWDRVEEQLFPLR 705
>gi|432884093|ref|XP_004074439.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Oryzias latipes]
Length = 1129
Score = 100 bits (248), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 162 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 210
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 211 -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 269
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 270 PFS------KTGNLRYLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 313
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 65/140 (46%), Gaps = 25/140 (17%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
V+ER+H +++ ++ D+R I+ + K L IVFS ++P +
Sbjct: 557 VLERIHAEYYARYEAYLRKETAQMPDIRKIVPELKSKTLEDATIVFSGLYPTN------Y 610
Query: 1166 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 1213
P+ +T E + GA ++++ + TH++A GT+KV VV+P
Sbjct: 611 PMDRTREYYHAKALGAKVSRNLILNPQDPGRTTHLIAARAGTEKVRQGQGYKHLHVVNPD 670
Query: 1214 WVEASALLYRRANEQDFAIK 1233
W+ + + R +EQ + +K
Sbjct: 671 WLWSCLERWERVDEQLYPLK 690
>gi|359079164|ref|XP_003587804.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bos taurus]
Length = 994
Score = 99.8 bits (247), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 269 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 16/124 (12%)
Query: 1121 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 1176
F S +S + D+R I+ + K+LA I+FS + FPV H H A GA
Sbjct: 588 FLSGESPEAPDIRKIVPELRSKVLADVAIIFSGLHPTNFPVERTREHYH-----ARALGA 642
Query: 1177 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
+ T+ + D + TH++A GT+KV A G+ VV+P W+ + + R EQ
Sbjct: 643 RILTQLVLDPDDPARPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDRVEEQL 702
Query: 1230 FAIK 1233
F ++
Sbjct: 703 FPLR 706
>gi|350634686|gb|EHA23048.1| hypothetical protein ASPNIDRAFT_197473 [Aspergillus niger ATCC 1015]
Length = 824
Score = 99.8 bits (247), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ + RKL LV+DLD T++++ VDP E +
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183
Query: 953 EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
QD+E P+ F+ P M G W KLRPG+ +FL+ S+++E+H+YTMG
Sbjct: 184 -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIIDD
Sbjct: 243 RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW N NLI V Y +F
Sbjct: 295 RGDVWRWNP-NLIKVSPYDFF 314
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ I+ + +IL G +VFS V P+G + LW A+ FGAV I+ + TH+
Sbjct: 502 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADISLW--AKSFGAVIASKINMKTTHL 559
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ ++R E+ + +
Sbjct: 560 VAGRNRTAKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 603
>gi|358372260|dbj|GAA88864.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus kawachii IFO
4308]
Length = 825
Score = 99.8 bits (247), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ + RKL LV+DLD T++++ VDP E +
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183
Query: 953 EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
QD+E P+ F+ P M G W KLRPG+ +FL+ S+++E+H+YTMG
Sbjct: 184 -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIIDD
Sbjct: 243 RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW N NLI V Y +F
Sbjct: 295 RGDVWRWNP-NLIKVSPYDFF 314
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ I+ + +IL G +VFS V P+G + LW A+ FGAV I+ + TH+
Sbjct: 501 DVKIIMPQMKHRILGGVTLVFSGVLPLGTDTQNADISLW--AKSFGAVIASKINMKTTHL 558
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ ++R E+ + +
Sbjct: 559 VAGRNRTAKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 602
>gi|426253911|ref|XP_004020634.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase, partial [Ovis aries]
Length = 820
Score = 99.8 bits (247), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 74 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 121
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 122 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 180
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+SR + DPF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 181 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 232
>gi|345568228|gb|EGX51125.1| hypothetical protein AOL_s00054g501 [Arthrobotrys oligospora ATCC
24927]
Length = 854
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 103/204 (50%), Gaps = 36/204 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR---- 965
E +K++ SA+KL LV+DLD T++ + VDP E D P+ H +
Sbjct: 149 EAKKRLLSAKKLSLVVDLDQTIIQAT----VDPTVGEW-----RDDPSNPNYHAVKDVEA 199
Query: 966 FPHM-------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
F + G W KLRPG+ FL SK++E H+YTMG + YA +AK++DP+G +
Sbjct: 200 FQLLDEGAGGRGCWYYVKLRPGLKRFLSNISKIYECHIYTMGTRAYAMSIAKIVDPEGSI 259
Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVER 1075
F R++SR + G SK LE + +++ VVIIDD VW + NLI V
Sbjct: 260 FGERILSRDESGS--------LTSKSLERLFPVDTKMVVIIDDRGDVWKWSD-NLIKVTP 310
Query: 1076 YTYFPCSRRQFGLLGPSLLEIDHD 1099
Y +F G + S L HD
Sbjct: 311 YDFFVG----IGDINSSFLPKRHD 330
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D+++I+ +R++ IVFS V P+G + + Q FGA+ I VTHVV
Sbjct: 529 DIKHIMPRMKRRVFDNIVIVFSGVIPLG-VDVQSSDVAQWGRAFGAMVVDGISAAVTHVV 587
Query: 1191 ANSLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAI 1232
A T KV A + +F +V P W S ++ + +E F I
Sbjct: 588 AARTRTAKVRTAAT--KFPHIRIVTPQWFFQSITMWHKLDETPFLI 631
>gi|30962890|gb|AAH52576.1| CTDP1 protein, partial [Homo sapiens]
Length = 874
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 85 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 133
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 134 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 192
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P + K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 193 P------ISKTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 236
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 509 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 568
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 569 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 623
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 624 LWSCLERWDKVEEQLFPLR 642
>gi|432105445|gb|ELK31660.1| RNA polymerase II subunit A C-terminal domain phosphatase [Myotis
davidii]
Length = 823
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + + IL H L R M
Sbjct: 53 EDQQRLHRNRKLVLMVDLDQTLIHTTEQQCQQMSNKGIL-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 65/138 (47%), Gaps = 23/138 (16%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ + D+R I+ + ++LA I+FS + FPV +
Sbjct: 453 ILARVHSDYYARYDRYLRGETQEAPDIRKIVPELKSRVLADVAIIFSGLYPTNFPVEKTR 512
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V D+ TH++A GT+KV A + G+ VV+P W
Sbjct: 513 EHYH-----ATALGARILTQLVLDPDAPDRATHLIAAKAGTEKVRQAQACGQLHVVNPDW 567
Query: 1215 VEASALLYRRANEQDFAI 1232
+ + + + EQ F +
Sbjct: 568 LWSCLERWDKVEEQLFPL 585
>gi|327270066|ref|XP_003219812.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Anolis carolinensis]
Length = 965
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H ++ +R H L R M
Sbjct: 155 EDQERLRRNRKLVLMVDLDQTLIHTTEQH-----------CQQMSNRGIFHYQLGRGEPM 203
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LD + LF+ R++SR + D
Sbjct: 204 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAAFLDSEKKLFSHRILSRDECID 262
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 263 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 306
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 65/135 (48%), Gaps = 15/135 (11%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 1165
++ R+H ++S + D++ D+R I+ +RK+LAG I+FS ++P A
Sbjct: 589 ILVRVHADYYSKYDKYVRKETDEIPDIRKIVPELKRKVLAGVTILFSGLYPTNFAIERTR 648
Query: 1166 PLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEAS 1218
+ A GA K + ++ TH++A GT+KV + +V+P W+ +
Sbjct: 649 ENYH-ATALGAKIAKTLILDENDPNKTTHLIAARAGTEKVRQGQTCKSLHIVNPDWLWSC 707
Query: 1219 ALLYRRANEQDFAIK 1233
+ + EQ F +K
Sbjct: 708 LERWDKVEEQLFPLK 722
>gi|355681363|gb|AER96784.1| CTD phosphatase, subunit 1 [Mustela putorius furo]
Length = 819
Score = 99.0 bits (245), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 53 EDQERLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T++RP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204
Score = 48.1 bits (113), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 64/139 (46%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ S + D+R I+ + ++LA I+FS + FPV +
Sbjct: 455 ILARVHSDYYARYDRHLLGDSPEAPDIRKIVPELKSRVLADVVIIFSGLHPTNFPVEKTR 514
Query: 1162 PHLHPLWQTAEQFGAVCTKHID------DQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA + ++ TH++A GT+KV A G+ VV+P W
Sbjct: 515 EHYH-----ATALGAKVLTQLALDPGSPNRATHLIAARAGTEKVRQAQECGQLHVVNPDW 569
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + R EQ F ++
Sbjct: 570 LWSCLERWDRVEEQLFPLR 588
>gi|156050785|ref|XP_001591354.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980]
gi|154692380|gb|EDN92118.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 806
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + + EE Q+++ RKL LV+DLD T++++ ++P E R
Sbjct: 134 DQTHLTVSHDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189
Query: 953 EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
+ E K R R G W K+RPG+ FL + S+++E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLTKISEMYELHVYTMGTRAY 249
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 1061
A +AK++DP LF R+ISR ++G+ +K L + + +V IIDD
Sbjct: 250 ALSIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VWP N+ NLI V Y +F
Sbjct: 302 VWPMNRPNLIKVVPYDFF 319
>gi|391867600|gb|EIT76846.1| TFIIF-interacting CTD phosphatase [Aspergillus oryzae 3.042]
Length = 820
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ S RKL LV+DLD T++++ VDP E +
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183
Query: 953 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+D++ P H+ L F+ P M G W KLRPG+ +FL+ S+LFE+H+YTMG
Sbjct: 184 -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIIDD
Sbjct: 243 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 295 RGDVWRWSP-NLIKVSPYDFF 314
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D+++I+ +R+IL G +VFS V P+G + LW A+ FGAV ++ I+ + TH+
Sbjct: 500 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 557
Query: 1190 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A T +V W+ ++ +E+ + +
Sbjct: 558 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 601
>gi|242781762|ref|XP_002479866.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
stipitatus ATCC 10500]
gi|218720013|gb|EED19432.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
stipitatus ATCC 10500]
Length = 822
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 107/197 (54%), Gaps = 23/197 (11%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ Q+E TR E+ K+ + ++++L LV+DLD T++++ VDP E K
Sbjct: 131 HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEWKEDK 186
Query: 952 EEQDREK-PHRHLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
+ E F+ P M G W KLRPG+ +FL+ SKL+E+H+YTMG + YA
Sbjct: 187 NNPNHEAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISKLYELHIYTMGTRAYA 246
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
+A ++DP LF R++SR + G +K+L+ + +++ VVIIDD V
Sbjct: 247 QNIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298
Query: 1063 WPHNKLNLIVVERYTYF 1079
W N NLI V Y +F
Sbjct: 299 WKWNP-NLIKVSPYDFF 314
Score = 43.5 bits (101), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 6/105 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 1188
DV+ I+ ++K+L +VFS V P+G N + LW A+ FGA + I + TH
Sbjct: 504 DVKVIMPQIKQKVLGDVVLVFSGVLPLGTDLQNADI-SLW--AKSFGATISPRIGSKTTH 560
Query: 1189 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+VA T KV A R +V W+ S ++ +E+ + I
Sbjct: 561 LVAGRNRTAKVREATRYPRIKIVTTQWLLDSLTQWKHMDEEPYLI 605
>gi|83767703|dbj|BAE57842.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 820
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ S RKL LV+DLD T++++ VDP E +
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183
Query: 953 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+D++ P H+ L F+ P M G W KLRPG+ +FL+ S+LFE+H+YTMG
Sbjct: 184 -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIIDD
Sbjct: 243 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 295 RGDVWRWSP-NLIKVSPYDFF 314
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D+++I+ +R+IL G +VFS V P+G + LW A+ FGAV ++ I+ + TH+
Sbjct: 500 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 557
Query: 1190 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A T +V W+ ++ +E+ + +
Sbjct: 558 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 601
>gi|238486788|ref|XP_002374632.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus flavus
NRRL3357]
gi|220699511|gb|EED55850.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus flavus
NRRL3357]
Length = 698
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ S RKL LV+DLD T++++ VDP E +
Sbjct: 10 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 61
Query: 953 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+D++ P H+ L F+ P M G W KLRPG+ +FL+ S+LFE+H+YTMG
Sbjct: 62 -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 120
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G +K+L + +++ VVIIDD
Sbjct: 121 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 172
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 173 RGDVWRWSP-NLIKVSPYDFF 192
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D+++I+ +R+IL G +VFS V P+G + LW A+ FGAV ++ I+ + TH+
Sbjct: 378 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 435
Query: 1190 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A T +V W+ ++ +E+ + +
Sbjct: 436 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 479
>gi|70999518|ref|XP_754478.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus fumigatus Af293]
gi|66852115|gb|EAL92440.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
fumigatus Af293]
Length = 827
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ +KE TR E+ K+ + + RKL LV+DLD T++++ VDP E + K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186
Query: 952 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
+ + + + + + P M G W KLRPG+ +FL+ S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHDALGDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRV 1062
+A ++DP LF R++SR + G +K+L+ + +++ VVIIDD V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298
Query: 1063 WPHNKLNLIVVERYTYF 1079
W + NLI V Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314
Score = 47.4 bits (111), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++I+ + ++L G +VFS V P+G + LW A+ FGAV + I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601
>gi|159127495|gb|EDP52610.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
fumigatus A1163]
Length = 827
Score = 98.2 bits (243), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ +KE TR E+ K+ + + RKL LV+DLD T++++ VDP E + K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186
Query: 952 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
+ + + + + + P M G W KLRPG+ +FL+ S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHDALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRV 1062
+A ++DP LF R++SR + G +K+L+ + +++ VVIIDD V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298
Query: 1063 WPHNKLNLIVVERYTYF 1079
W + NLI V Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314
Score = 47.4 bits (111), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++I+ + ++L G +VFS V P+G + LW A+ FGAV + I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601
>gi|347836062|emb|CCD50634.1| similar to FCP1-like phosphatase [Botryotinia fuckeliana]
Length = 832
Score = 98.2 bits (243), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
DQ + + + EE Q+++ RKL LV+DLD T++++ ++P E R
Sbjct: 134 DQTHLTVSLDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189
Query: 953 EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
+ E K R R G W K+RPG+ FL + S+++E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLAKVSEMYELHVYTMGTRAY 249
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 1061
A +AK++DP LF R+ISR ++G+ +K L + + +V IIDD
Sbjct: 250 ALNIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VWP N+ NLI V Y +F
Sbjct: 302 VWPMNRPNLIKVVPYDFF 319
Score = 40.4 bits (93), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 1142 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS--LGTDKV 1199
++L GC IV S + P+G + + Q E FG K + +VTHVVA+S T KV
Sbjct: 534 QVLDGCVIVMSGLVPLG-VDLMRSEIAQQIESFGGKIHKKVSKRVTHVVASSQKTRTQKV 592
Query: 1200 NWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
A +V W+ S +++ +E ++ ++
Sbjct: 593 REAAKYPHIKIVTQQWLTQSMSKWKKEDESEYLVE 627
>gi|410977919|ref|XP_003995346.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Felis catus]
Length = 960
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++Q+++ RKL L++DLD TL+++ + H + I H L R M
Sbjct: 191 QDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 239
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T++RP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 240 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 298
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
PF K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 299 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 342
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 64/134 (47%), Gaps = 13/134 (9%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ S + D+R I+ + ++LA I+FS + FP+ +
Sbjct: 598 ILARVHSDYYAKYDRYLRGDSQEPPDIRKIVPELKSRVLADVAIIFSGLHPTNFPIEKTR 657
Query: 1162 PHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASA 1219
H H A+ + D D+ TH++A GT+KV A G+ VV+P W+ +
Sbjct: 658 EHYHATALGAKILTQLVLDPDDPDRATHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCL 717
Query: 1220 LLYRRANEQDFAIK 1233
+ R EQ F ++
Sbjct: 718 ERWDRVEEQLFPLR 731
>gi|19115680|ref|NP_594768.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe 972h-]
gi|26393804|sp|Q9P376.1|FCP1_SCHPO RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase fcp1
gi|9588462|emb|CAC00553.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe]
Length = 723
Score = 97.4 bits (241), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 72/216 (33%), Positives = 108/216 (50%), Gaps = 44/216 (20%)
Query: 890 FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
+ GY D +A I E RLE + K++ ++L L++DLD T++++
Sbjct: 121 YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 179
Query: 937 FHEVDP-----------VHDEILRKKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
VDP V+ ++LR + +E P + + + K RPG+ FL+
Sbjct: 180 ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQ 231
Query: 985 RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
+ S+L+E+H+YTMG K YA E+AK++DP G LF RV+SR D G K L
Sbjct: 232 KISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLR 283
Query: 1045 GVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ + S VV+IDD VW N NLI V Y +F
Sbjct: 284 RLFPCDTSMVVVIDDRGDVWDWNP-NLIKVVPYEFF 318
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 66/133 (49%), Gaps = 14/133 (10%)
Query: 1113 VIERLHKIFFSHQSLDDVDVRN----------ILAAEQRKILAGCRIVFSRVFPVGEANP 1162
V++ +H +++ + +D+ R+ I+ ++K+L GCR++FS V P+G +
Sbjct: 452 VLKDIHAVYYEEE--NDISSRSGNHKHANVGLIIPKMKQKVLKGCRLLFSGVIPLG-VDV 508
Query: 1163 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALL 1221
+ + A FGA TH++A + T+KV A+S G VV W+ S
Sbjct: 509 LSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRTEKVKKAVSMGNIKVVKLNWLTESLSQ 568
Query: 1222 YRRANEQDFAIKP 1234
++R E D+ + P
Sbjct: 569 WKRLPESDYLLYP 581
>gi|317144011|ref|XP_001819844.2| RNA polymerase II subunit A C-terminal domain phosphatase
[Aspergillus oryzae RIB40]
Length = 799
Score = 97.1 bits (240), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 102/188 (54%), Gaps = 29/188 (15%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ S RKL LV+DLD T++++ VDP E +
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+D++ P+ + LRPG+ +FL+ S+LFE+H+YTMG + YA +A ++DP
Sbjct: 184 -EDKDNPNHQAL--------SDLRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDP 234
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
LF R++SR + G +K+L + +++ VVIIDD VW + NLI
Sbjct: 235 DRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVWRWSP-NLI 285
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 286 KVSPYDFF 293
Score = 48.1 bits (113), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D+++I+ +R+IL G +VFS V P+G + LW A+ FGAV ++ I+ + TH+
Sbjct: 479 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 536
Query: 1190 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A T +V W+ ++ +E+ + +
Sbjct: 537 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 580
>gi|212526776|ref|XP_002143545.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
marneffei ATCC 18224]
gi|210072943|gb|EEA27030.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
marneffei ATCC 18224]
Length = 829
Score = 96.7 bits (239), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 110/202 (54%), Gaps = 33/202 (16%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ Q+E TR E+ K+ + ++++L LV+DLD T++++ VDP E
Sbjct: 131 HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEW---- 182
Query: 952 EEQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
++D+ P+ F+ P M G W KLRPG+ +FL+ S+L+E+H+YTMG
Sbjct: 183 -KEDKNNPNHDAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISELYELHIYTMG 241
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
+ YA +A ++DP LF R++SR + G +K+L+ + +++ VVIID
Sbjct: 242 TRAYAQHIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIID 293
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW N NLI V Y +F
Sbjct: 294 DRGDVWKWNP-NLIKVSPYDFF 314
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 6/105 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 1188
DV+ ++ + K+L G +VFS V P+G N + LW A+ FGAV + I + TH
Sbjct: 504 DVKVVMPQIKAKVLGGVVLVFSGVLPLGTDLQNADI-SLW--AKSFGAVISSRIGSKTTH 560
Query: 1189 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+VA T KV A R +V W+ S ++ NE+ + I
Sbjct: 561 LVAGRNRTAKVREATRYPRIKIVTTQWLLDSLTNWKHMNEEPYLI 605
>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Hydra magnipapillata]
Length = 718
Score = 96.7 bits (239), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 30/183 (16%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+ E+ + +EQ+ + ARKL LV+DLD TL+++ V+P K +
Sbjct: 137 EAEKLAKYDEQQ-LLRARKLVLVVDLDMTLIHTT----VEPT-------------PKNTK 178
Query: 962 HLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
+F F G TKLRPG FLE SK +E+H++TMG++LYA +AK LDP G F
Sbjct: 179 DVFSFKLPGHQYEYHTKLRPGARKFLESISKFYELHIFTMGSRLYAHTVAKCLDPDGKFF 238
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERY 1076
A R+ SR + + F K DL+ + + V IIDD VW + NLI V+ Y
Sbjct: 239 AHRIRSRDEFINSF------SKFHDLKALFPCGDHMVCIIDDREDVWNYAP-NLITVKPY 291
Query: 1077 TYF 1079
+F
Sbjct: 292 KFF 294
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 67/156 (42%), Gaps = 44/156 (28%)
Query: 1113 VIERLHKIFFSHQSLDDV------------------------DVRNILAAEQRKILAGCR 1148
++ER+H F+S +LD V D+R IL +R+ L GC
Sbjct: 395 ILERVHYSFYS--TLDAVKAKGGAASKDTEITKFCTFGTMNPDIRVILPELRRQTLKGCN 452
Query: 1149 IVFSRVFPVGEANPHLHPL-----WQTAEQFGAVCTKHI----DD--QVTHVVANSLGTD 1197
IVF+ V P PL W+TA GA T + +D + THVVA GT
Sbjct: 453 IVFTGVIPTN------CPLEKSKAWKTAVSLGARVTSEVVGKEEDGLRTTHVVAARHGTH 506
Query: 1198 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
K + A + +V+P W+ S+ + A E F +
Sbjct: 507 KAHKAYKSPDINLVNPNWLWCSSERWEWAEESIFPV 542
>gi|148236185|ref|NP_001090168.1| CTD phosphatase [Xenopus laevis]
gi|13487713|gb|AAK27686.1| CTD phosphatase [Xenopus laevis]
Length = 980
Score = 96.7 bits (239), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ +KL L++DLD TL+++ + H I H L R M
Sbjct: 165 EDQFRLHRNKKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 214 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 273 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + +D V D+R I+ + K+L I FS + FP+
Sbjct: 601 ILVRVHTDYYAKYDRYLKKEVDSVPDIRKIVPELKSKVLENVIISFSGLHPTNFPIERTR 660
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H A GA K++ D+ TH++A GT+KV A + VV+P W
Sbjct: 661 EFYH-----ARALGASIHKNLILKPDDPDRTTHLIAARAGTEKVRKAQNCKHLHVVNPEW 715
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 716 LWSCLERWEKVEEQLFPLK 734
>gi|121705758|ref|XP_001271142.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
clavatus NRRL 1]
gi|119399288|gb|EAW09716.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
clavatus NRRL 1]
Length = 826
Score = 96.3 bits (238), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 109/197 (55%), Gaps = 23/197 (11%)
Query: 893 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
+D+ ++E TR E+ K+ + + +KL LV+DLD T++++ VDP E + K
Sbjct: 131 HDNTSLTVSEREATRVEEDAKRRLLANKKLSLVVDLDQTIIHAT----VDPTVREWMEDK 186
Query: 952 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
+ + E + + + P M G W KLRPG+ +FL+ ++LFE+H+YTMG + YA
Sbjct: 187 DNPNHEALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVAELFELHIYTMGTRAYA 246
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
+A ++DP LF R++SR + G +K+L+ + +++ VVIIDD V
Sbjct: 247 QHIAAIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298
Query: 1063 WPHNKLNLIVVERYTYF 1079
W + NLI V Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314
Score = 46.6 bits (109), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++I+ + ++L G +VFS V P+G + LW + FGAV + I+ + TH+
Sbjct: 499 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--TKSFGAVISTKINRKTTHL 556
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ + ++R +E+ + +
Sbjct: 557 VAGRNRTAKVREATRYPNIKIVTTQWLLDTLTQWKRLDEEPYLL 600
>gi|148227040|ref|NP_001081726.1| FCP1 serine phosphatase [Xenopus laevis]
gi|62185667|gb|AAH92306.1| Fcp1 protein [Xenopus laevis]
Length = 979
Score = 96.3 bits (238), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ +KL L++DLD TL+++ + H I H L R M
Sbjct: 165 EDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 214 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 273 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + +DDV D+RNI+ + K+L I FS + FP+
Sbjct: 598 ILVRVHTDYYAKYDRYLKKEVDDVPDIRNIVPELKSKVLENVIITFSGLHPTNFPIERTR 657
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H A GA +K++ D+VTH++A GT+KV A + VV+P W
Sbjct: 658 EFYH-----ARALGANISKNLILKPDDSDRVTHLIAARSGTEKVRQAQNCKHLHVVNPEW 712
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 713 LWSCLERWEKVEEQLFPLK 731
>gi|428183780|gb|EKX52637.1| hypothetical protein GUITHDRAFT_101798 [Guillardia theta CCMP2712]
Length = 749
Score = 96.3 bits (238), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 99/181 (54%), Gaps = 33/181 (18%)
Query: 915 MFSARKLCLVLDLDHTLLNSAKFHEVDP---VHDEILRKKEEQDREKPHRHLFRFPHMGM 971
MF A LVLDLDHTLL H P + + I++ EQ ++ H+ +
Sbjct: 113 MFGA----LVLDLDHTLL-----HTTLPRTEMEEMIMQTLHEQCKDV---HVLQVSAARY 160
Query: 972 WTKLRPGIWTFLERASKLFEMHLYT--MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+TKLRPGI FL S+LFE+++YT MG++ YA +A +LD G +F GR+ISR D D
Sbjct: 161 YTKLRPGIRNFLSEMSRLFELYIYTAGMGSQQYAEAVAHMLDESGRMFRGRIISRDDYTD 220
Query: 1030 PFDGDERVPKSKDLEGVLGME---SAVVIIDDSVRVWPH-------NKLNLIVVERYTYF 1079
+ K L+ V ++ + V+I+DD+ W H ++ NLI V++Y+++
Sbjct: 221 V------SLEHKKLDKVFPIDEHRALVIILDDNAETWDHQYSDGRNSQENLIQVDKYSFW 274
Query: 1080 P 1080
P
Sbjct: 275 P 275
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 15/108 (13%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ-----FGAVCTKHIDDQ 1185
D+R +L + Q ++L+G +I FS V PH PL + E GA C + I
Sbjct: 482 DIRELLRSAQSRVLSGLKIAFSGVI------PHSFPLLSSREGRLASLLGAQCCEEIAG- 534
Query: 1186 VTH--VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
+TH VV + T+KV ++ G +V P W+ A A + + +E F
Sbjct: 535 ITHLVVVIRTGLTEKVIESIRQGDVEIVSPEWLYACASRWEKVSEDKF 582
>gi|303317134|ref|XP_003068569.1| NLI interacting factor-like phosphatase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240108250|gb|EER26424.1| NLI interacting factor-like phosphatase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|320038484|gb|EFW20419.1| RNA Polymerase II CTD phosphatase Fcp1 [Coccidioides posadasii str.
Silveira]
Length = 868
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 104/196 (53%), Gaps = 24/196 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K+ R+EE ++++ S+RKL LV+DLD T++++ VDP E K
Sbjct: 132 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEWQEDKT 187
Query: 953 EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ E + + + P M G W KLRPG+ FL S L+E+H+YTMG + YA
Sbjct: 188 NPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGTRAYAQ 247
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+A ++DP +F R++SR + G +K+L+ + +++ VVIIDD VW
Sbjct: 248 NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDVW 299
Query: 1064 PHNKLNLIVVERYTYF 1079
+ NLI V Y +F
Sbjct: 300 NWSD-NLIRVHPYDFF 314
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R+IL +VFS V P+ + + LW A+ FGA+ T+ ID + TH+
Sbjct: 510 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 567
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ S ++R +E+ + +
Sbjct: 568 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 611
>gi|392870961|gb|EAS32809.2| FCP1-like phosphatase, phosphatase domain-containing protein
[Coccidioides immitis RS]
Length = 868
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K+ R+EE ++++ S+RKL LV+DLD T++++ VDP E
Sbjct: 132 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 182
Query: 953 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
++D+ P+ + P M G W KLRPG+ FL S L+E+H+YTMG
Sbjct: 183 QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP +F R++SR + G +K+L+ + +++ VVIIDD
Sbjct: 243 RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 295 RGDVWNWSD-NLIRVHPYDFF 314
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R+IL +VFS V P+ + + LW A+ FGA+ T+ ID + TH+
Sbjct: 510 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 567
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ S ++R +E+ + +
Sbjct: 568 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 611
>gi|213403530|ref|XP_002172537.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Schizosaccharomyces japonicus yFS275]
gi|212000584|gb|EEB06244.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Schizosaccharomyces japonicus yFS275]
Length = 723
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 30/207 (14%)
Query: 890 FEGYDDQQKAAIQKER-------TRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEV 940
+ G+ D +A I RRLE + ++ ++L L++DLD T++++ V
Sbjct: 121 YMGFSDLSRATINMTHGSGGLTEARRLETETAIRLQKQKRLSLIVDLDQTIIHAT----V 176
Query: 941 DPVHDEILRKKEEQDRE---KPHRHLFRFPHMGM----WTKLRPGIWTFLERASKLFEMH 993
DP E ++ + + H R G + K RPG+ FL SKL+E+H
Sbjct: 177 DPTVGEWMKDPNNVNYKVLRDVHYFYLREGTSGYTSCYYIKPRPGLQEFLHNVSKLYELH 236
Query: 994 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SA 1052
+YTMG K YATE+AKV+DP G LF RV+SR D G+ K + + + S
Sbjct: 237 IYTMGTKAYATEVAKVIDPDGELFQDRVLSRDDSGN--------LTQKSIRRLFPCDTSM 288
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
VV+IDD VW + NLI V + +F
Sbjct: 289 VVVIDDRGDVWNWSS-NLIKVYPFEFF 314
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV I+ ++++L+ CRI+FS + P+G + + + A FGA + D TH+V
Sbjct: 451 DVGLIIPEMKKRVLSHCRILFSGIIPLG-VDVITSDIAKWAMSFGAHVLLDLKDNPTHLV 509
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
A+ + T+KV AL VV+ W+ S ++ E++F + P
Sbjct: 510 ASKVQTEKVRRALEIQTVKVVNLDWLLHSMSQWKALPEENFLLYP 554
>gi|119187277|ref|XP_001244245.1| hypothetical protein CIMG_03686 [Coccidioides immitis RS]
Length = 839
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K+ R+EE ++++ S+RKL LV+DLD T++++ VDP E
Sbjct: 103 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 153
Query: 953 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
++D+ P+ + P M G W KLRPG+ FL S L+E+H+YTMG
Sbjct: 154 QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 213
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP +F R++SR + G +K+L+ + +++ VVIIDD
Sbjct: 214 RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 266 RGDVWNWSD-NLIRVHPYDFF 285
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R+IL +VFS V P+ + + LW A+ FGA+ T+ ID + TH+
Sbjct: 481 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 538
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ S ++R +E+ + +
Sbjct: 539 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 582
>gi|148236996|ref|NP_001087852.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Xenopus laevis]
gi|51950264|gb|AAH82378.1| MGC81710 protein [Xenopus laevis]
Length = 977
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 93/171 (54%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ +K+ L++DLD TL+++ + H I H L R M
Sbjct: 163 EDQLRLHRNKKVVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 211
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 212 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 270
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++Y YF
Sbjct: 271 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 314
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + +D V D+R I+ + K+L I FS + FP+
Sbjct: 597 ILVRVHTDYYAKYDRYLKKEVDSVPDIRKIVPELKSKVLENVIISFSGLHPTNFPIERTR 656
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H A GA K++ D+ TH++A GT+KV A + VV+P W
Sbjct: 657 EFYH-----ARALGASIHKNLILKPDDPDRTTHLIAARAGTEKVRKAQNCKHLHVVNPEW 711
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 712 LWSCLERWEKVEEQLFPLK 730
>gi|50306333|ref|XP_453140.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642274|emb|CAH00236.1| KLLA0D01595p [Kluyveromyces lactis]
Length = 719
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 103/204 (50%), Gaps = 38/204 (18%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---- 950
+QQ +++ RL E+KK L LV+DLD T+++ VDP E +R
Sbjct: 138 EQQAETLERSSLTRLREEKK------LVLVVDLDQTVIHCG----VDPTIGEWMRDPKNP 187
Query: 951 --KEEQD------REKPHRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYT 996
K QD ++P F F P W KLRPG+ F E S FEMH+YT
Sbjct: 188 NYKALQDVKSFTLEDEPIIPSFYFGPKPPARKSWYYVKLRPGLKEFFEAVSPHFEMHIYT 247
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 1055
M + YA E+AK++DP G LF R++SR ++G +K LE + M +S VV+
Sbjct: 248 MATRSYAHEIAKIIDPTGELFGDRILSRDENGS--------LTTKSLERLFPMDQSMVVV 299
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
IDD VW + NLI V Y++F
Sbjct: 300 IDDRGDVWNWFE-NLIKVVPYSFF 322
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 70/158 (44%), Gaps = 12/158 (7%)
Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS---LDDVDVRNIL 1136
P + Q L +LL D DE L ++ R+HK +F+ + + D++ +L
Sbjct: 420 PLAELQKHLHNQTLLIDDDDE------LPHLSQILLRVHKEYFNQYTEAPKNPPDIKYLL 473
Query: 1137 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 1196
+ K+ GC VFS + P+ E + + + FGA + I+ + TH++ + T
Sbjct: 474 PQMKYKVFQGCHFVFSGLIPL-ETDVRKADIVLWTDMFGATTSSDINYKTTHIITTTSRT 532
Query: 1197 DKVNWALSTG--RFVVHPGWVEASALLYRRANEQDFAI 1232
K A S +VHP W+ + + R E + +
Sbjct: 533 FKARLAKSFNPDIKIVHPDWLFECLVQWERVKESPYEL 570
>gi|255712225|ref|XP_002552395.1| KLTH0C03894p [Lachancea thermotolerans]
gi|238933774|emb|CAR21957.1| KLTH0C03894p [Lachancea thermotolerans CBS 6340]
Length = 745
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 102/203 (50%), Gaps = 36/203 (17%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA---KFHE-----VDPVHDE 946
++Q A I+K + L E KK L LV+DLD T+++ HE +P +D
Sbjct: 146 ERQAATIEKTAQKHLREHKK------LVLVVDLDQTVIHCGVDPTIHEWANDPSNPNYDA 199
Query: 947 ILRKKEEQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTM 997
+ K E P F +MG + KLRPG+ F ++ + FE+H+YTM
Sbjct: 200 LKNVKTFSLDEDPILPPF---YMGPRPPPRKCQYYVKLRPGLQEFFDKIAPHFELHIYTM 256
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
+ YA E+AK++DPKG LF R++SR ++G K LE + M +S VVII
Sbjct: 257 ATRAYALEIAKIIDPKGELFGDRILSRDENGS--------LTHKSLERLFPMDQSMVVII 308
Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
DD VW + NLI V Y +F
Sbjct: 309 DDRGDVWSWCE-NLIKVVPYNFF 330
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/134 (22%), Positives = 60/134 (44%), Gaps = 5/134 (3%)
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVGEA 1160
+D L ++ +HK ++ + D++ ++ + ++ C VFS + P+G
Sbjct: 443 DDDELPHLSNILLNVHKTYYDKLAEQGQPPDIKTLMPQMKSQVFKRCHFVFSGLIPLG-T 501
Query: 1161 NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRFVVHPGWVEAS 1218
N + FGA + +D+ THV+ + GT K A + V+HP W+
Sbjct: 502 NVRQADIVLWTSMFGATTSVDVDEATTHVITKTPGTFKARLAKTYNPNIKVIHPDWIFEC 561
Query: 1219 ALLYRRANEQDFAI 1232
+ + R NE+ + +
Sbjct: 562 LVKWVRVNEKPYEL 575
>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 930
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 102/185 (55%), Gaps = 27/185 (14%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE------------ILRKKEEQDREKPH 960
+++ +A+KL LVLDLD TL+++ + EV+ + L + P
Sbjct: 142 ERLTAAKKLSLVLDLDQTLVHATQDAEVETLFGTDAAEAKGGSITCALPNPPAGPEDVPA 201
Query: 961 RHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
HL+RF PH + KLRP + FL LFE+H+YTMG++ YA ++A+++DP+
Sbjct: 202 AHLYRFTLEGNPHK-FYLKLRPHLEEFLMGVKDLFELHIYTMGSRSYARKVAQIIDPEQK 260
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 1074
LF ++SR + G+ V K+L+ + + +S V+IIDD V VW +K NLI +E
Sbjct: 261 LFRENIVSRDECGN-------VMNLKNLQRIFPVDDSMVMIIDDRVDVWGTSK-NLIKIE 312
Query: 1075 RYTYF 1079
Y +F
Sbjct: 313 PYYFF 317
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D++ L ++R++L G I FS +FP G + P PLW+ +E+FGA C+ + TH+V
Sbjct: 600 DIKYCLHVQRRRVLEGVHICFSSIFPTG-SKPESTPLWRLSEEFGACCSNVFTPETTHLV 658
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
A + T+KV A G +VH W+ S +RR E +
Sbjct: 659 ALNERTEKVKLAHERGGVHIVHLEWLLESIRTWRRPEEARY 699
>gi|189211133|ref|XP_001941897.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187977990|gb|EDU44616.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 774
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
+K++ SARKL L++DLD T++++ + +P HD + K+ Q + ++
Sbjct: 152 KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ K+RPG+ F +R SKL+EMH+YTM + YA +AK++DP+ F R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
R ++ K K L + +A+ VIIDD VW ++ +L+ V + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317
>gi|330930047|ref|XP_003302870.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
gi|311321498|gb|EFQ89046.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
Length = 803
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
+K++ SARKL L++DLD T++++ + +P HD + K+ Q + ++
Sbjct: 152 KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ K+RPG+ F +R SKL+EMH+YTM + YA +AK++DP+ F R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
R ++ K K L + +A+ VIIDD VW ++ +L+ V + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317
>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
Length = 730
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 91/168 (54%), Gaps = 14/168 (8%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K++ RKL LVLDLDHTL+++ ++ + R +++ D H P M
Sbjct: 129 KRLIKERKLSLVLDLDHTLIHAVTEQGLNSSPNWKNRNRKDYD---IHNITVNGP-MTYC 184
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 1031
K RP + FLE +K FE+H+YTMG + YA E+AK++DP LF R++SR D +G F
Sbjct: 185 IKKRPHLNDFLENVNKNFELHIYTMGTRNYANEIAKLIDPDQTLFKERILSRDDGNGINF 244
Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+R+ D S V+I+DD VW +K NLI + Y +F
Sbjct: 245 KTLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVFF 283
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 72/131 (54%), Gaps = 2/131 (1%)
Query: 1104 DGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANP 1162
D L L + ++H +++ + ++ V ++ +++IL IVFS V+P+G
Sbjct: 406 DCHLLIVLDKLTKIHDLYYKSKDKNEKPHVIQMVDIVKKEILKDQFIVFSGVYPLGTP-V 464
Query: 1163 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 1222
+ PL AE+FGA I + THV+A GT KVN A+S G VV P W+ S ++
Sbjct: 465 NKQPLRYLAEEFGASVENDITSKTTHVIAQRKGTSKVNKAISKGLKVVSPQWLVESTRIW 524
Query: 1223 RRANEQDFAIK 1233
+RA+E DF ++
Sbjct: 525 QRADENDFKLE 535
>gi|428672202|gb|EKX73116.1| conserved hypothetical protein [Babesia equi]
Length = 739
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 99/200 (49%), Gaps = 37/200 (18%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDE--ILRKKEEQDREKPHRHL 963
K+ RKLCLVLDLD+TLL+++ E+D + + I + + D E +
Sbjct: 240 KVLQKRKLCLVLDLDNTLLHASSQKLPSDVYVDEIDFLSKDADIFKDVQYNDDEGTLKLR 299
Query: 964 FRFPHMGMWT---------------KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
+F + T KLRPG++ FL+ S FE++L+TMG K +A+ K
Sbjct: 300 KKFESSIIQTMVYNESETMCCKSYFKLRPGVFKFLKEMSAKFELYLFTMGTKQHASSSLK 359
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNK 1067
+LDPK + F R+ R D +R+ PK K+L V+I+DD+ VW N
Sbjct: 360 ILDPKRIYFGNRIFCRNDSRSSMKSLDRIFPKHKNL---------VLIVDDTEHVWTCN- 409
Query: 1068 LNLIVVERYTYFP-CSRRQF 1086
L LI + Y +FP S QF
Sbjct: 410 LGLIKIHPYFFFPDLSYLQF 429
>gi|301118528|ref|XP_002906992.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108341|gb|EEY66393.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 735
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 112/254 (44%), Gaps = 63/254 (24%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
++ A+KL LVLDLDHTLL++ + VD V EI
Sbjct: 265 RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVSEI------------------------- 296
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
+ + L+++ +YT G +LYA ++ ++DP F R+++R D D
Sbjct: 297 ------------KQTVLYDLFIYTHGTRLYAEKIVNIIDPDETYFKNRIVARTDTPDMLH 344
Query: 1033 GDERV--PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
++ P D S ++++DD + VW N+ N+ ++E Y YF C+ G
Sbjct: 345 KSLKLLFPSCDD--------SMILVLDDRIDVWKENEGNVFLIEPYHYFKCTSEINNASG 396
Query: 1091 PSL--LEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ-------- 1140
+ +E E SED LA S V+ +H+ F++ + +R A EQ
Sbjct: 397 RGVAGMEDSEAEASEDSHLAQSTTVLRHVHEAFYAGH---ETGMRGTTAEEQMGGHGRDV 453
Query: 1141 RKILAGCRIVFSRV 1154
+ IL+ C +SRV
Sbjct: 454 KGILSACARSWSRV 467
>gi|430812451|emb|CCJ30145.1| unnamed protein product [Pneumocystis jirovecii]
Length = 741
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 105/211 (49%), Gaps = 34/211 (16%)
Query: 890 FEGYDDQQKAAIQ-----------KERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
F G+ D +A IQ KE RLE + +++ KL L++DLD T+L++
Sbjct: 151 FTGFLDSTRATIQMSHDATKLTVSKEEATRLERETMERLLKEMKLSLIVDLDQTILHAT- 209
Query: 937 FHEVDPVHDEILRK---KEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERASKL 989
VDP+ E L K + + + + G+ + K+RPG+ FLE SKL
Sbjct: 210 ---VDPIVGEWLSNPSSKHYLAVQDVQKFCLKENNSGIGNWYYVKMRPGLEQFLENISKL 266
Query: 990 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1049
+EMH+YTMG + YA +A ++D F R++SR + G K+++ + +
Sbjct: 267 YEMHIYTMGTRAYAASIAHLIDKDKKYFGDRILSRDESGS--------TTRKNIQRLFPV 318
Query: 1050 E-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ S VVIIDD VW + NLI V Y +F
Sbjct: 319 DTSMVVIIDDRADVWQWSP-NLIKVTPYEFF 348
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 56/107 (52%), Gaps = 2/107 (1%)
Query: 1127 LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 1186
++ +D++ I+ + +L G +VFS + P+G + + Q A FGA +K+I + V
Sbjct: 510 INKLDIKVIMPMLKSNVLKGTNLVFSGIVPMG-MDVLSSNIAQWAINFGAKVSKNISNDV 568
Query: 1187 THVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
TH++A + T KV AL +V W+ S ++R E D+ +
Sbjct: 569 THLIATKIRTTKVKKALQQKNIKIVSIDWLLHSISHWKRLPESDYQL 615
>gi|281204241|gb|EFA78437.1| hypothetical protein PPL_09089 [Polysphondylium pallidum PN500]
Length = 1252
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 129/305 (42%), Gaps = 47/305 (15%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-ISRGDDGDP 1030
+ K+RP TFL+ LF + L+++ +K Y +M +++DP LF + I D P
Sbjct: 934 YVKIRPYTITFLKTLYPLFNITLFSLNHKSYVNKMVEIIDPSKTLFKNIITIESFGDNIP 993
Query: 1031 FDGDERVPKS----KDLEGVLGMES--AVVIIDDSVRVWPHNKLNLIVVERYTYF----- 1079
R P S + + ++S ++V+IDD +W + NLI+VER+ +F
Sbjct: 994 KQQTNR-PYSLFTPSNFSSIFKIDSSESIVVIDDREDIWRQFRDNLIMVERFIHFSKDVE 1052
Query: 1080 ---------PCS------RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKI---- 1120
P S + F L PS+ ++ R + ++ +I
Sbjct: 1053 HSKITTYHYPFSIQDLSKQYHFPKLEPSIPYLEIFSRLLKIIQCEYMSRVQETQQINNNN 1112
Query: 1121 -----------FFSHQSLDD--VDVRNILAAEQRKILAGCRIVFSRVFP--VGEANPHLH 1165
Q++DD R I+ ++ +L C IVFS +FP + A
Sbjct: 1113 NNNSSSGKQQLMAGAQTIDDNLSQSRTIMKEIRQSVLMDCNIVFSGIFPKQIDSAKLVTT 1172
Query: 1166 PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 1225
+ Q E FGA I D TH++ GT KV A+ G VVH W+ S + +
Sbjct: 1173 RIVQLTESFGAKVQMDITDNTTHLIYIKEGTSKVLQAMKCGIKVVHFAWLRDSIYHWEKM 1232
Query: 1226 NEQDF 1230
+E ++
Sbjct: 1233 DESNY 1237
>gi|156837042|ref|XP_001642557.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
70294]
gi|156113100|gb|EDO14699.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 745
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 93/192 (48%), Gaps = 42/192 (21%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 965
+K++ +KL LV+DLD T+++ VDP E + D P+ R
Sbjct: 156 KKRLIREKKLILVVDLDQTVIHCG----VDPTIAEW-----KNDPTNPNFETLRDVKSFV 206
Query: 966 ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
P M M + K+RPG+ F E SKL+EMH+YTM + YA E+AK
Sbjct: 207 LEEEPILPPMYMGPKPPTHKCWYYVKIRPGLKEFFEEVSKLYEMHIYTMATRSYAQEIAK 266
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 1067
++DP G LFA R++SR ++G K LE + +S VV+IDD VW
Sbjct: 267 IIDPDGTLFADRILSRNENGS--------LTHKSLERLFPTDQSMVVVIDDRGDVWNWCP 318
Query: 1068 LNLIVVERYTYF 1079
NLI V Y +F
Sbjct: 319 -NLIKVTPYNFF 329
Score = 46.6 bits (109), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 35/147 (23%), Positives = 65/147 (44%), Gaps = 10/147 (6%)
Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD-DVDVRNILAA 1138
P + Q + LL D DE L +++ + + ++ D ++ ++ ++
Sbjct: 428 PLAELQKHMHNQQLLTDDDDE------LFYLKDILKNVQQKYYEELKTDKEIQIQTLMPK 481
Query: 1139 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT-- 1196
++K+L GC VFS + P+G N + FGA + I + THV+ + T
Sbjct: 482 LKKKVLEGCNFVFSGLIPLG-TNIQKADIVLWTNMFGAQSSADITEDTTHVITKTPHTYK 540
Query: 1197 DKVNWALSTGRFVVHPGWVEASALLYR 1223
K+ A + V+HP WV L ++
Sbjct: 541 AKIAKAFKSDIKVLHPDWVFECLLNWK 567
>gi|407929624|gb|EKG22436.1| BRCT domain-containing protein [Macrophomina phaseolina MS6]
Length = 861
Score = 93.6 bits (231), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 96/179 (53%), Gaps = 22/179 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-----KPHRHLF 964
E ++++ S +KL LV+DLD T++++ VDP E + E + E + + L
Sbjct: 150 EAKRRLLSNKKLSLVVDLDQTIIHAT----VDPTVAEWQKDPENPNYEAVKDVQSFQLLD 205
Query: 965 RFPH-MGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
P G W KLRPG+ FLE SK++E+H+YTMG + YA +AK++DP +F R+
Sbjct: 206 NGPGGRGCWYYIKLRPGLREFLENISKVYELHIYTMGTRAYAQNIAKIVDPNRKIFGDRI 265
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+SR + G K L + +++ VVIIDD VW + NLI V Y +F
Sbjct: 266 LSRDESGS--------LTVKTLHRIFPVDTKMVVIIDDRGDVWSWSN-NLIKVTPYDFF 315
Score = 43.5 bits (101), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 5/100 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
D+ I+ +R +L G IVFS V P+G + +W A+ FGA ++ I+ + THV
Sbjct: 527 DIGLIMPEMKRSVLKGVSIVFSGVVPLGTNIQASEYAVW--AKSFGAKVSETINKKTTHV 584
Query: 1190 V-ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
V A + T KV A + +V W++ + ++R +E
Sbjct: 585 VAARNRRTQKVRQAARHPQIKIVTVDWLKECFVTWQRVSE 624
>gi|398396164|ref|XP_003851540.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
gi|339471420|gb|EGP86516.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
Length = 822
Score = 93.2 bits (230), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 101/198 (51%), Gaps = 28/198 (14%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL------ 963
E +K++ +RKL LV+DLD T++ + V+P E + D P+
Sbjct: 155 ERRKRLLDSRKLSLVVDLDQTIIQA----NVEPTIGEW-----KNDPTNPNWKALQDVCQ 205
Query: 964 FRFPHMG---MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
F+ G + KLRPG+ FL S+L+E+H+YTMG + YA +AK++DP +F R
Sbjct: 206 FQLADDGRTWYYVKLRPGLKDFLRDMSELYELHIYTMGTRAYADNIAKIVDPDRKVFGDR 265
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++SR ++G K+L+ + ++ VVIIDD VW H NLI V + +F
Sbjct: 266 ILSRDENGS--------MTVKNLKRLFHADTRMVVIIDDRADVW-HWTPNLIKVNAFEFF 316
Query: 1080 PCSRRQFGLLGPSLLEID 1097
P G+ P E++
Sbjct: 317 PGVGDINGMFLPKRQELE 334
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 63/116 (54%), Gaps = 10/116 (8%)
Query: 1125 QSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGE-ANPHLHPLWQTAEQFGAVCT 1179
+S+D+V D I+ + + K+LAG IVFS V P+G+ N + +W A FGA T
Sbjct: 501 RSIDEVHSIPDAAPIMQSMKAKVLAGVHIVFSGVIPLGQDLNTNDTAIW--ARSFGAKIT 558
Query: 1180 KHIDDQVTHVVAN-SLGTDKVNWAL-STGRF-VVHPGWVEASALLYRRANEQDFAI 1232
++I + THV+A+ T KV A +GR +V W+ + + + + +E + I
Sbjct: 559 ENITKKTTHVIASPERRTAKVRQAAKKSGRIAIVSQNWLSSCFMQWEKVDESPYRI 614
>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
glutinis ATCC 204091]
Length = 817
Score = 93.2 bits (230), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 78/252 (30%)
Query: 890 FEGYDDQQKAAIQK-----------ERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAK 936
+ G+ D +A I E RLE+ ++ A+KL L++DLD T++++
Sbjct: 115 YTGFSDTSRATISMAHDIGGLTVSLEEAHRLEKATTARLLDAKKLSLIVDLDQTIVHAT- 173
Query: 937 FHEVDPVHDEIL--------------RKKEEQD-----REKPHRHL-FRFPHM------- 969
VDP E L ++ + QD R KP ++ R +
Sbjct: 174 ---VDPTVGEWLQDPKNPNYKALEGVKRFKLQDESPATRNKPKKYRKIRIKQVDPSKGEE 230
Query: 970 ----------------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
G W K+RPG+ FL+R ++++EMH+YTMG + YA+E+ KV+D
Sbjct: 231 ADDESSEDEEEDEDDGGCWYYIKMRPGLPDFLKRVAEMYEMHVYTMGTRAYASEVCKVID 290
Query: 1012 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW---PHNK 1067
P G LF GR++SR + G K L+ + ++ VVIIDD VW PH
Sbjct: 291 PDGGLFGGRILSRDESGS--------MTRKSLQRLFPCDTNMVVIIDDRADVWDGSPH-- 340
Query: 1068 LNLIVVERYTYF 1079
L+ V Y +F
Sbjct: 341 --LVKVIPYEFF 350
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 2/115 (1%)
Query: 1119 KIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC 1178
+ + H + DV +I+ A + + L +VFS + +G + P W+ A FGA C
Sbjct: 486 RFYEQHDAKGAADVADIIPAMKAQTLRDTHLVFSGLVALG-SRPEDSEYWKLARTFGARC 544
Query: 1179 TKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+ + TH+VAN GT KV+ A R +V+P W+ S + R E + +
Sbjct: 545 SADLSSSTTHLVANGWGTAKVHSAQRNPRIKIVYPEWLLESVARWTRVPEDAYVL 599
>gi|449299873|gb|EMC95886.1| hypothetical protein BAUCODRAFT_71386 [Baudoinia compniacensis UAMH
10762]
Length = 790
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 99/183 (54%), Gaps = 30/183 (16%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEI--LRKKEEQDREKP 959
E ++++ +RKL LV+DLD T++++ + E +P H + +RK + D
Sbjct: 150 EAKRRLIKSRKLSLVVDLDQTIIHATVDPTVAEWQADETNPNHAAVKGVRKFQLVDDGPG 209
Query: 960 HRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
R G W KLRPG+ FL+ S+ +E+H+YTM + YA E+AK++DP LF
Sbjct: 210 GR--------GTWYYIKLRPGLSDFLQLVSQYYELHIYTMATRAYAEEIAKLVDPGRKLF 261
Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
A R++SR ++G SK L+ + +++ VVIIDD VW + NL+ V Y
Sbjct: 262 ANRILSRDENGS--------MNSKSLKRLFPVDTKMVVIIDDRGDVWSWSP-NLVKVSAY 312
Query: 1077 TYF 1079
+F
Sbjct: 313 DFF 315
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 8/113 (7%)
Query: 1127 LDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI 1182
LDD+ D I+ + + ++L+G +VFS V P+G NP H L A FGA +
Sbjct: 524 LDDIENIPDAAAIMTSMKSRVLSGVHLVFSGVVPLG-VNPLNHDLAIWARSFGANVQVKV 582
Query: 1183 DDQVTHVVAN-SLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAI 1232
+ TH++++ T KV AL G +V+ W+ A +++ +E+ + I
Sbjct: 583 SRRTTHLISSPDRRTAKVRQALKKGSRIAIVNQNWLYACFSQWKKVDEEPYRI 635
>gi|452004576|gb|EMD97032.1| hypothetical protein COCHEDRAFT_1163398 [Cochliobolus heterostrophus
C5]
Length = 803
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
++++ +ARKL L++DLD T++++ + +P HD + K+ Q + ++
Sbjct: 152 KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ K+RPG+ F +R SKL+EMH+YTM + YA +AK++DP+ F R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
R ++ K K L + +A+ VIIDD VW ++ +L+ V + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 1125 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 1181
+++DDV DV I+ + ++L G +VFS + P+G + LW ++ FGA T
Sbjct: 503 RAVDDVIPDVAEIMPRIKSEVLDGAVVVFSGIIPLGVDVLSSDFALWISS--FGAEVTTS 560
Query: 1182 IDDQVTHVVAN-SLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAIK 1233
++ + THV+AN T KV A R+ +V+P W+ + +E + I+
Sbjct: 561 VNRRTTHVIANPDRKTTKVKRA---ARYPHIKIVNPEWMFQCCTRWEHVDEMPYLIE 614
>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
Length = 855
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 101/219 (46%), Gaps = 46/219 (21%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL---- 948
D + E RLE Q ++ +RKL LV+DLD T++ + VDP E +
Sbjct: 136 DAASITVSPEVAARLEHESQIRLLGSRKLSLVVDLDQTIIQAT----VDPTVGEWIDQGR 191
Query: 949 --RKKEEQDREKPH----RHLFRF---------------------PHMGMWTKLRPGIWT 981
+ E R+ P+ R + RF + K RPG+
Sbjct: 192 AWEEGREGARKNPNWEALRDVGRFRLSEERKVVNGRGGKVIRSKREDTAYYIKPRPGLHA 251
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD-PFDGDERVPKS 1040
FL R S+L+EMH+YTMG + YA+++ +++DP G LF RV+SR + G F R+
Sbjct: 252 FLSRLSELYEMHVYTMGTRSYASQVVRLIDPLGNLFGSRVLSRDESGSLTFKNLTRLFPC 311
Query: 1041 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
S+ VIIDD VW ++ NL+ V Y +F
Sbjct: 312 NT--------SSAVIIDDRADVWDLSRANLVKVVPYDFF 342
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 74/141 (52%), Gaps = 11/141 (7%)
Query: 1102 SEDGTLASSLGVIERLHKIFFSHQSLDDVD--------VRNILAAEQRKILAGCRIVFSR 1153
++D L+ L ++E +H+ +F L D V I+ + ++L+G +VFS
Sbjct: 510 NDDHELSRVLRILEEIHRRYFQQYDLYVADKNTRKEPSVMEIIPEMKAEVLSGVHLVFSS 569
Query: 1154 VFPVGEANPHLHP-LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 1212
+ P+ PH + LW+ A QFGA C + +VTHVVA GT+KV ++ G +V+P
Sbjct: 570 LIPID--MPHQNTDLWRQALQFGAACYTRVAREVTHVVAAKRGTEKVRQGVARGCKIVNP 627
Query: 1213 GWVEASALLYRRANEQDFAIK 1233
W S + R E+++ ++
Sbjct: 628 YWFMDSVAAWERKKEEEYPLE 648
>gi|255936731|ref|XP_002559392.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584012|emb|CAP92038.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 792
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 103/188 (54%), Gaps = 25/188 (13%)
Query: 903 KERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+E TR E+ K+ + ++R+L LV+DLD T++++ VDP E K+ + E R
Sbjct: 114 REATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAV-R 168
Query: 962 HLFRF------PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ +F P M G W KLRPG+ FL+ ++++E+H+YTMG + YA + ++DP
Sbjct: 169 DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 228
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
LF R++SR + G KDL+ + +++ VVIIDD +W + NLI
Sbjct: 229 TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 279
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 280 KVSPYDFF 287
Score = 46.6 bits (109), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 64/132 (48%), Gaps = 8/132 (6%)
Query: 1104 DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG--EAN 1161
DG +A+ G ERLH D+++I+ +R+IL G +VFS V P+G N
Sbjct: 445 DGRVAALRG--ERLHSRDKDMDLKSVPDIKDIMPRIKRRILGGVVLVFSGVLPLGIDFQN 502
Query: 1162 PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASAL 1220
+ LW A+ FG + ++ + TH+VA T KV A +V W+ S +
Sbjct: 503 ADI-SLW--AKSFGVTISSRVNARTTHLVAGRNRTAKVREATRYPNVKIVTTQWLVDSLV 559
Query: 1221 LYRRANEQDFAI 1232
+R +E+ + +
Sbjct: 560 QWRHVDEEPYLL 571
>gi|451853161|gb|EMD66455.1| hypothetical protein COCSADRAFT_112846 [Cochliobolus sativus ND90Pr]
Length = 803
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
++++ +ARKL L++DLD T++++ + +P HD + K+ Q + ++
Sbjct: 152 KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ K+RPG+ F +R SKL+EMH+YTM + YA +AK++DP+ F R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
R ++ K K L + +A+ VIIDD VW ++ +L+ V + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317
Score = 43.1 bits (100), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 13/117 (11%)
Query: 1125 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 1181
+++DDV DV I+ + ++L G +VFS + P+G + LW ++ FGA T
Sbjct: 503 RAMDDVIPDVAEIMPRIKSEVLDGAVVVFSGIIPLGVDVLSSDFALWISS--FGAEVTTS 560
Query: 1182 IDDQVTHVVAN-SLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAIK 1233
++ + THV+AN T KV A R+ +V+P W+ + +E + I+
Sbjct: 561 VNRRTTHVIANPDRKTTKVKRA---ARYPHIKIVNPEWMFQCCTRWEHVDETPYLIE 614
>gi|403161615|ref|XP_003321927.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375171855|gb|EFP77508.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 423
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 80/148 (54%), Gaps = 9/148 (6%)
Query: 1094 LEIDHDERSEDGTLASSLGVIERL-------HKIFFSHQSLDDVDVRNILAAEQRKILAG 1146
++ DH R E L + RL H F+S S+++ D R +++A + +L G
Sbjct: 48 VDFDHPLRPEQAVLVDRDNELSRLKLILGDIHSGFYSQDSIEEADARAVISAIKHDVLHG 107
Query: 1147 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 1206
+ FS ++P+ EA W+ AEQFGA C H+ +VTH++A LGT KVN ALS
Sbjct: 108 LHLAFSSLWPM-EAVSEQQYAWKLAEQFGARCYTHLSPKVTHLIAAKLGTSKVNLALSRP 166
Query: 1207 RF-VVHPGWVEASALLYRRANEQDFAIK 1233
+V P W+ +ALL+ R +E ++ +
Sbjct: 167 NVSIVRPKWLYDAALLWTRPSEAGYSWR 194
>gi|296419837|ref|XP_002839498.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635659|emb|CAZ83689.1| unnamed protein product [Tuber melanosporum]
Length = 896
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 100/196 (51%), Gaps = 24/196 (12%)
Query: 895 DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE------ 946
D + ++ RLEE+ +++ ++KL LV+DLD T++++ VDP +
Sbjct: 132 DSMGLTVSQDEATRLEEETKRRLLKSKKLSLVVDLDQTIIHAT----VDPTVGDWKNDPF 187
Query: 947 ILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ + +D + G W K+RPG+ FLE S+L+E+H+YTMG + YA
Sbjct: 188 CINHESVKDVQAFKLDEDIIGGRGTWYYVKMRPGLKEFLEHISQLYELHIYTMGTRAYAM 247
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+ K++DP G +F RV+SR + G K L + +++ VVIIDD VW
Sbjct: 248 SVKKIVDPDGRIFGERVLSRDESGS--------MTQKSLHRIFPVDTKMVVIIDDRGDVW 299
Query: 1064 PHNKLNLIVVERYTYF 1079
+ NL+ V Y +F
Sbjct: 300 KWSD-NLVKVRPYDFF 314
>gi|215794710|pdb|3EF1|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
Phosphatase
Length = 442
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 101/195 (51%), Gaps = 33/195 (16%)
Query: 900 AIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDE 946
+ E RLE + K++ ++L L++ LD T++++ VDP V+ +
Sbjct: 4 TVSLEEASRLESENVKRLRQEKRLSLIVXLDQTIIHAT----VDPTVGEWMSDPGNVNYD 59
Query: 947 ILRKKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
+LR + +E P + + + K RPG+ FL++ S+L+E+H+YTMG K YA E
Sbjct: 60 VLRDVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKE 114
Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWP 1064
+AK++DP G LF RV+SR D G K L + + S VV+IDD VW
Sbjct: 115 VAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWD 166
Query: 1065 HNKLNLIVVERYTYF 1079
N NLI V Y +F
Sbjct: 167 WNP-NLIKVVPYEFF 180
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 1113 VIERLHKIFFSHQSLDDVDVRN----------ILAAEQRKILAGCRIVFSRVFPVGEANP 1162
V++ +H +++ + +D+ R+ I+ ++K+L GCR++FS V P+G +
Sbjct: 314 VLKDIHAVYYEEE--NDISSRSGNHKHANVGLIIPKMKQKVLKGCRLLFSGVIPLG-VDV 370
Query: 1163 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALL 1221
+ + A FGA TH++A + T+KV A+S G VV W+ S
Sbjct: 371 LSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRTEKVKKAVSMGNIKVVKLNWLTESLSQ 430
Query: 1222 YRRANEQDFAI 1232
++R E D+ +
Sbjct: 431 WKRLPESDYLL 441
>gi|258563858|ref|XP_002582674.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908181|gb|EEP82582.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 897
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 103/196 (52%), Gaps = 24/196 (12%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K+ R+EE ++++ ++RKL LV+DLD T++++ VDP E K
Sbjct: 157 DNASLTVSKDEATRVEEDAKRRLLASRKLSLVVDLDQTIIHAT----VDPTVAEWREDKT 212
Query: 953 EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
+ E + + + P M G W KLRPG+ FL+ S L+E+H+YTM + YA
Sbjct: 213 NPNHEAVKNVRSFQLIDDGPGMRGCWYYIKLRPGLEEFLKNISSLYELHIYTMATRAYAQ 272
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+A ++DP +F R++SR + G +K+L + +++ VVIIDD VW
Sbjct: 273 NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVW 324
Query: 1064 PHNKLNLIVVERYTYF 1079
+ NLI V Y +F
Sbjct: 325 KWSD-NLIRVFPYDFF 339
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 1188
D++ I+ +R+IL +VFS V P+G N + LW A+ FGA+ T ID + TH
Sbjct: 535 DIKIIMPQIKRRILEAVVMVFSGVLPLGTDLQNADI-SLW--AKSFGAIITNRIDSRTTH 591
Query: 1189 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+VA T KV A + +V W+ S ++R +E+ + +
Sbjct: 592 LVAGRNRTAKVREATRYSKIKIVTVQWLLDSLTQWKRLDEEQYLV 636
>gi|453084575|gb|EMF12619.1| hypothetical protein SEPMUDRAFT_149240 [Mycosphaerella populorum
SO2202]
Length = 848
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 99/192 (51%), Gaps = 32/192 (16%)
Query: 902 QKERTRRLEE-QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
Q E T+ EE ++++ +R+L LV+DLD T++++ VDP E + D P+
Sbjct: 142 QDEATKTDEEGKRRLLDSRRLSLVVDLDQTIIHAC----VDPSIGEW-----QNDPSNPN 192
Query: 961 RHLFRFPH----------MGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
R + W K RPG+ +FL+ S+L+EMH+YTMG + YA +AK
Sbjct: 193 YDALRDVQAFQLRDDNKPVATWYYIKQRPGLQSFLKGLSELYEMHIYTMGTRTYAEGVAK 252
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNK 1067
++DP G +F R+++R + G K K L+ + +S VVIIDD VW
Sbjct: 253 IIDPDGRVFGDRIVTRTESGS--------DKEKSLKRLFPTDSKMVVIIDDRADVWRWIS 304
Query: 1068 LNLIVVERYTYF 1079
NL+ V + +F
Sbjct: 305 -NLVKVNVFEFF 315
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLDDV-----DVRNILAAEQRKILAGCRIVFSRVFPV 1157
E + S G + L + +S DD D ++ + ++LA C IVFS V P+
Sbjct: 506 EKQAMGSKGGRVSELRPGLSTKRSADDELQHIPDAAVVMDDLKCRVLADCHIVFSGVVPL 565
Query: 1158 G-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN-SLGTDKVNWALSTGRF-VVHPGW 1214
G + N H LW A+ FGA ++HI + THV+A+ T KV A R VV W
Sbjct: 566 GVDPNNHDAALW--AKSFGASVSQHITKKTTHVIASPERKTAKVRQAAKRPRIAVVSQHW 623
Query: 1215 VEASALLYRRAN 1226
+ A +++ +
Sbjct: 624 LHACFSQWKKVD 635
>gi|425767354|gb|EKV05928.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
digitatum PHI26]
gi|425779797|gb|EKV17828.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
digitatum Pd1]
Length = 817
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 103/188 (54%), Gaps = 23/188 (12%)
Query: 902 QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--- 957
++E TR E+ K+ + ++R+L LV+DLD T++++ VDP E K+ + E
Sbjct: 140 EREATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAVK 195
Query: 958 --KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ + + P M G W KLRPG+ FL+ ++++E+H+YTMG + YA + ++DP
Sbjct: 196 DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 255
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
LF R++SR + G KDL+ + +++ VVIIDD +W + NLI
Sbjct: 256 TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 306
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 307 KVSPYDFF 314
Score = 47.0 bits (110), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 6/105 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVG--EANPHLHPLWQTAEQFGAVCTKHIDDQVTH 1188
DV++I+ +R+IL G +VFS V P+G N + LW A+ FG + I+ + TH
Sbjct: 495 DVKDIMPQIKRRILGGVVLVFSGVLPLGIDFQNADI-SLW--AKSFGVTISSRINTRTTH 551
Query: 1189 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+VA T KV A +V W+ + + +R +E+ + +
Sbjct: 552 LVAGRNRTAKVREATRYPNIKIVTTQWLVDALVQWRHVDEEPYLL 596
>gi|67524889|ref|XP_660506.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
gi|40744297|gb|EAA63473.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
gi|259486161|tpe|CBF83781.1| TPA: CTD phosphatase-related (Eurofung) [Aspergillus nidulans FGSC
A4]
Length = 829
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + + R+EE ++++ + RKL LV+DLD T++++A VDP E +
Sbjct: 132 DNTALTVSEREAIRVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM---- 183
Query: 953 EQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
D++ P+ F+ P M G W KLRPG+ FLE ++++E+H+YTMG
Sbjct: 184 -ADKDNPNHAAVSDVRAFQLVDDGPGMRGCWYYVKLRPGLEEFLENVAEMYELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP LF R++SR + G K+L + +++ VVIIDD
Sbjct: 243 RSYAQAIANIIDPDRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDD 294
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 295 RGDVWRWSP-NLIKVIPYDFF 314
Score = 41.6 bits (96), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/103 (26%), Positives = 52/103 (50%), Gaps = 2/103 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV++IL A +R+I+ +V S ++ + E ++ L Q A+ FG V I + TH++
Sbjct: 494 DVKDILPAIKRRIMGNVHVVHSGLWKLNEDIANIE-LAQHAKTFGVVFADRITHKTTHLL 552
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+ T K A+ + +V W+ S L ++ +E + +
Sbjct: 553 SAGKRTAKFQEAMQRPKIKIVRKEWLVDSLLQWKHLDEGPYLV 595
>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
Length = 782
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 92/168 (54%), Gaps = 14/168 (8%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K++ +KL LVLDLDHT++++ + + + +++++ H P M
Sbjct: 128 KRLLMEKKLSLVLDLDHTVIHAVTEQGFNSSPE---WRNKDKNKNGIHTITVNGP-MNYC 183
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 1031
K RP + FL +K++E+H+YTMG + YA E+AK++DP+ +F R++SR D +G F
Sbjct: 184 IKKRPHLVKFLTEVNKIYELHIYTMGTRNYANEIAKLIDPESSIFKERILSRDDGNGINF 243
Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+R+ D S V+I+DD VW +K NLI + Y YF
Sbjct: 244 KSLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVYF 282
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 74/130 (56%), Gaps = 2/130 (1%)
Query: 1104 DGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANP 1162
D L L ++++H++++ S V +++ ++ IL G IVFS V+P+G
Sbjct: 394 DCHLLIILDKLQKIHRLYYESKDKGGSPHVIDLVEIVKKDILKGTYIVFSGVYPLGTP-I 452
Query: 1163 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 1222
PL AE+FG+V I+++ THV+A GT KVN ALS G V+ P W+ S ++
Sbjct: 453 QKQPLRWLAEEFGSVVQNDINNETTHVIAQRKGTSKVNKALSKGLKVISPQWLIESTRIW 512
Query: 1223 RRANEQDFAI 1232
+ A+E DF++
Sbjct: 513 QMADEDDFSL 522
>gi|412985958|emb|CCO17158.1| RNA Polymerase II CTD phosphatase Fcp1 [Bathycoccus prasinos]
Length = 490
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 162/368 (44%), Gaps = 67/368 (18%)
Query: 920 KLCLVLDLDHTLLNS---AKFHEVDP-----------VHDEILRKKEEQDREKPHRHLFR 965
KL LVLDLD TLL+S KF +P + + +K E + P + F
Sbjct: 101 KLPLVLDLDSTLLHSVEKTKFLFPNPGESNTSEEEMKIIKQAQKKIESRLESSPDK--FF 158
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA-KVLDPKGVLFAGRVISR 1024
+ + +TK+RP FL S+++E+++ T G++ YA +A +VLDP G F R ++R
Sbjct: 159 YVNDQYFTKIRPQARRFLSELSEMYELYIVTAGSQAYAEAIANQVLDPLGKYF-NRDVNR 217
Query: 1025 GDDGDPFDG------DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
++ D R D + G ES ++++D +W + ++ V+ Y Y
Sbjct: 218 IKGMKQWNSEVNQWVDVRTKIVND--ALEGAESVTIVVEDKPEMW-DGECAVMQVKPYYY 274
Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH------QSLDDVD- 1131
FP S + L + D E+++ + + L + +H++ F Q+ D
Sbjct: 275 FPESLEELKLSHFYNM-TDESEKNDSYLVDNILPRLRNVHRMMFREAVPHFLQNGGSADG 333
Query: 1132 -----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI---- 1182
V +L+ E++ IL C I F+ VF + E P +W+ A+ GA +
Sbjct: 334 QYWPCVSELLSLERKSILRNCFICFTNVFKIHEV-PKDSQIWKEAKNLGATVQETFIDNA 392
Query: 1183 -----------DDQVTHVVANSLG-----------TDKVNWALSTGRFVVHPGWVEASAL 1220
DD+ THV+ G T K+N A R +V ++ SA
Sbjct: 393 DIAKRTESIVKDDRTTHVIVGEHGKRNKATGLVLPTSKINQARRNNRPLVDCSFIAESAY 452
Query: 1221 LYRRANEQ 1228
L++ A+E+
Sbjct: 453 LWKPADEK 460
>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata TFB-10046
SS5]
Length = 796
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/250 (29%), Positives = 110/250 (44%), Gaps = 73/250 (29%)
Query: 890 FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
+ GY D +A IQ E RR+E + +++ RKL L++DLD T++++
Sbjct: 123 YTGYSDSARANIQMTHLAGGPTVSLEEARRIEHETAERLLKNRKLSLIVDLDQTIVHAT- 181
Query: 937 FHEVDPVHDEILRKK---EEQDREKPH--------------------RHLFRF------P 967
VDP E + + EE KP R + RF P
Sbjct: 182 ---VDPTVGEWIAQGQAWEEYQARKPSESTTPEPDAPPEPNANWEALRDVRRFTLAHDGP 238
Query: 968 HMG-----------------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
H+ + K RPG+ FLE S+ +EMH+YTMG + YA ++ +
Sbjct: 239 HLNHKHPWKGKEKEDEHGCLYYIKPRPGLQAFLEAISQKYEMHVYTMGTRAYAEKVCAAI 298
Query: 1011 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN 1069
DP G +F R++SR + G +K LE + + S VVIIDD VW + N
Sbjct: 299 DPDGRMFGRRILSRDESGS--------LTAKSLERLFPCDTSMVVIIDDRSDVWDRSP-N 349
Query: 1070 LIVVERYTYF 1079
L+ V RY +F
Sbjct: 350 LVEVVRYDFF 359
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DVR ++ A + + AG +FS + P+ E P P+W+ A +FGA+C + ++THV+
Sbjct: 510 DVRKLIGAIKAQTFAGMHFLFSSLIPL-EDKPEESPIWKQAREFGAICHSEVSPRLTHVI 568
Query: 1191 ANSLGTDKVNWALSTGRFVVHPGWVE 1216
T KV+ A G H WV+
Sbjct: 569 TAKRSTAKVDAARRRGE--AHIVWVQ 592
>gi|365991295|ref|XP_003672476.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
gi|343771252|emb|CCD27233.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
Length = 778
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 95/187 (50%), Gaps = 32/187 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK------------KEEQDREKP 959
+K++ +KL LV+DLD T+++ VDP E R KE E+P
Sbjct: 203 KKRLRDDKKLILVVDLDQTVIHCG----VDPTIGEWKRDPTNPNFETLKDVKEFALEEEP 258
Query: 960 HRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
L P W K+RPG+ F ++ + LFEMH+YTM + YA+E+AK++DP
Sbjct: 259 ILPLMYMGPKPPARKCWYYVKVRPGLKDFFQKVAPLFEMHIYTMATRAYASEIAKIIDPT 318
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
G LF R++SR ++G +K LE + +S V+IIDD VW + NLI
Sbjct: 319 GDLFGNRILSRDENGS--------LTTKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIK 369
Query: 1073 VERYTYF 1079
V Y +F
Sbjct: 370 VIPYNFF 376
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 68/147 (46%), Gaps = 13/147 (8%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF-----SHQSLDDV--DVRNILAAEQRKILAGC 1147
HD+R +D L + ++H ++ ++++ D + D++ I+ ++ +L C
Sbjct: 490 HDQRLLIDDDEELYHLKDTLSKIHSKYYEMLQKANENKDSIEPDIKYIMPQMKQTVLQDC 549
Query: 1148 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-- 1205
VFS + P+G N + FGA T ID+ THV+ + GT K A S
Sbjct: 550 HFVFSGLIPLG-TNIQKSDIVIWTNMFGARSTADIDENTTHVITKTPGTYKARLAKSYDP 608
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
+VHP WV + ++ E+ + +
Sbjct: 609 DIKIVHPDWVFECLMRWKHVEEKPYEL 635
>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
Length = 760
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 101/204 (49%), Gaps = 43/204 (21%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
+Q+ A ++ T RL + A+KL L++DLD T++ + VDP + +R
Sbjct: 183 EQEAARLEDASTTRLRK------AKKLSLIVDLDQTIIQAT----VDPTVGDWMR----- 227
Query: 955 DREKPHRH------LFRFPHM----------GMWT--KLRPGIWTFLERASKLFEMHLYT 996
D P+ +F+ G W KLRPG+ FL + + L+EMH+YT
Sbjct: 228 DGTNPNHSALKDVCVFKLGTQEDKEVVADVDGCWYYLKLRPGLQAFLRKMADLYEMHVYT 287
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VI 1055
MG + YA + +++DP G F+ R++SR + G K LE + ++++ VI
Sbjct: 288 MGTRSYAMAVCRIIDPDGTYFSTRILSRDESGS--------LTRKSLERLFPCDTSMAVI 339
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
IDD VW H NL+ VE + +F
Sbjct: 340 IDDRSDVW-HWSPNLVKVEPFEFF 362
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 68/119 (57%), Gaps = 4/119 (3%)
Query: 1113 VIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAE 1172
++ ++H F++ QS DV + I+ + +L GC+I FS + P+G NP +W+ A+
Sbjct: 488 LLLKVHTDFYAEQSSQDVKL--IIPNIKESVLRGCKIAFSSMIPLG-TNPEAADIWKLAK 544
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
FGA C+ ++ + TH+VA + GT KV A VV W+ SA L++R E+++
Sbjct: 545 MFGAYCSSDVNSKTTHLVARNPGTVKVQQAQKRQDLHVVWSNWLVRSAALWQRQPEREY 603
>gi|256073745|ref|XP_002573189.1| rna polymerase II ctd phosphatase [Schistosoma mansoni]
gi|360045501|emb|CCD83049.1| putative rna polymerase II ctd phosphatase [Schistosoma mansoni]
Length = 1345
Score = 90.9 bits (224), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 99/176 (56%), Gaps = 27/176 (15%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L++Q+ + +ARKL L++DLD T++++ DP Q + + H +R P
Sbjct: 139 LQDQQSLLAARKLVLLVDLDQTIIHTTN----DP-----------QAFKYKNVHRYRLPG 183
Query: 969 --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+ T+LRP + L+ S+ ++MH+ T GN++YA ++A ++DPK F+ R++SR +
Sbjct: 184 SPLVYHTRLRPHLEKVLDCLSQYYQMHICTFGNRVYAHQLASMIDPKRRYFSQRILSRDE 243
Query: 1027 DGDPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+P V KS +L+ + G+ + V IIDD VW + NLI V+ Y +FP
Sbjct: 244 CFNP------VTKSANLKALFPRGL-NLVCIIDDRGEVWDWSS-NLIHVKPYRFFP 291
>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Acyrthosiphon pisum]
Length = 736
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 94/170 (55%), Gaps = 20/170 (11%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
++K++ +KL L++DLD TL+++ D + + I ++ H L+
Sbjct: 134 DEKRLLGDKKLVLLVDLDQTLIHTTN----DNIPNNI--------KDIHHFQLYGPNSPW 181
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
T+LRPG + FL S+L+E+H+ T G + YA + +LDPKG LF+ RV+SR + +P
Sbjct: 182 YHTRLRPGTYNFLSSISELYELHICTFGARNYAHTITHILDPKGKLFSHRVLSRDECFNP 241
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K+ +L+G+ ++ V IIDD VW + LNLI V+ Y +F
Sbjct: 242 N------SKTGNLKGLFPCGDNMVCIIDDREDVWDY-ALNLIHVKPYHFF 284
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 65/145 (44%), Gaps = 19/145 (13%)
Query: 1103 EDGTLASSLGVIERLHKIFFSH---------QSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
ED L +++ LHK +F +++ D++ I+ + KILAG ++VFS
Sbjct: 373 EDDYLLYLEDILKNLHKEYFKEYDQNYELYGNNVEVPDMKRIIPTYRSKILAGKKLVFSG 432
Query: 1154 V----FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF- 1208
+ P+ E+ ++ A GA T++I TH+VA GT K + A
Sbjct: 433 LVPTPVPLTESRA-----YKVARLLGAEVTENIKPDSTHLVAVRQGTLKASAARKRPDIK 487
Query: 1209 VVHPGWVEASALLYRRANEQDFAIK 1233
V P W+ A + E+ F ++
Sbjct: 488 TVTPEWLWLCAERWEHVEERLFPLR 512
>gi|363752479|ref|XP_003646456.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae DBVPG#7215]
gi|356890091|gb|AET39639.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae DBVPG#7215]
Length = 751
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 95/187 (50%), Gaps = 32/187 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRH--LFRFP 967
QK++ ARKL LV+DLD T+++ VDP E + + + E K R L P
Sbjct: 159 QKQLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPDNPNYESLKDVRSFSLHEEP 214
Query: 968 -----HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+MG + KLRPG+ F + FE+H+YTM + YA E+AK++DP
Sbjct: 215 VLPPFYMGPKPPTRKCWYYVKLRPGLQDFFSNIAPHFELHIYTMATRTYALEIAKIIDPD 274
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
G LF R++SR ++G K LE + M +S VVIIDD VW + NLI
Sbjct: 275 GTLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVIIDDRGDVWNWCE-NLIK 325
Query: 1073 VERYTYF 1079
V Y +F
Sbjct: 326 VVPYDFF 332
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/147 (23%), Positives = 67/147 (45%), Gaps = 13/147 (8%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFFSHQSLDDV-------DVRNILAAEQRKILAGC 1147
H+ER +D L ++ R+H ++ + D+++++ + + + GC
Sbjct: 438 HNERLLIDDDDELPHLTDILLRVHSAYYDQLEKNKHGATELSPDIKHLIPSLKEIVFKGC 497
Query: 1148 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 1207
VFS + P+ N + FGA T +ID TH++ + GT K A S R
Sbjct: 498 NFVFSGLIPL-HTNIDRADIVLWTNTFGAKTTINIDYNTTHLITKTPGTMKARLASSFNR 556
Query: 1208 --FVVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + ++R +E+ + +
Sbjct: 557 DIKIVHPDWIFECIVNWKRVDEKPYEL 583
>gi|452981165|gb|EME80925.1| hypothetical protein MYCFIDRAFT_115122, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 770
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 98/181 (54%), Gaps = 29/181 (16%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF----R 965
E ++++ +R+L LV+DLD T+++++ V+P E + D P+ +
Sbjct: 151 EGKRRLLQSRRLSLVVDLDQTIIHAS----VEPTIAEW-----QNDPSNPNYEALQDVQK 201
Query: 966 F------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
F P+ + K RPG+ FL S+++EMH+YTMG + YA +AK++DP+ +F
Sbjct: 202 FQLDDDKPNTWYYIKPRPGLKQFLSTLSEIYEMHIYTMGTRAYAESVAKIIDPEKKIFGD 261
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTY 1078
R++SR + G +K+L+ + +++ VVIIDD VW H NLI V + +
Sbjct: 262 RILSRNESGS--------MTAKNLKRLFPVDTRMVVIIDDRADVW-HWTSNLIKVNVFEF 312
Query: 1079 F 1079
F
Sbjct: 313 F 313
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 61/115 (53%), Gaps = 8/115 (6%)
Query: 1125 QSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 1180
+S+DD+ D I+ + K+L+G +VFS V P+G + H H A+ FGA ++
Sbjct: 506 RSVDDLENIPDAAAIMDRMKAKVLSGVHLVFSGVVPLG-VDIHSHDTAVWAKSFGATVSE 564
Query: 1181 HIDDQVTHVVAN-SLGTDKVNWAL-STGRF-VVHPGWVEASALLYRRANEQDFAI 1232
+I + THV+A+ T KV A +GR +V W+ A +++ +E + I
Sbjct: 565 NITKKTTHVIASPERRTAKVRQAAKKSGRIAIVSQHWLHACFAQWKKVDENMYRI 619
>gi|358253094|dbj|GAA51983.1| RNA polymerase II subunit A C-terminal domain phosphatase [Clonorchis
sinensis]
Length = 1535
Score = 89.7 bits (221), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 100/191 (52%), Gaps = 31/191 (16%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L++++ + +ARKL L++DLD T+L++ DP + R K R+
Sbjct: 175 LQDEQSLLAARKLVLLVDLDETVLHTTN----DP---QAYRYKNVS------RYCLPGSP 221
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T RP + L+R SK ++MH+ T GN++YA ++A ++DPK F+ R++SR +
Sbjct: 222 LVYHTSFRPHLKAVLDRLSKYYQMHICTFGNRMYAHQLAGMIDPKRRYFSHRILSRDECF 281
Query: 1029 DPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF--PC 1081
+P V KS +L+ + G+ + V IIDD VW PH LI V+ Y +F C
Sbjct: 282 NP------VTKSANLKALFPRGL-NLVCIIDDRGEVWEWSPH----LIQVKPYRFFQDAC 330
Query: 1082 SRRQFGLLGPS 1092
+ F PS
Sbjct: 331 DTKHFAWSSPS 341
>gi|452840538|gb|EME42476.1| hypothetical protein DOTSEDRAFT_73343 [Dothistroma septosporum NZE10]
Length = 855
Score = 89.7 bits (221), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 98/184 (53%), Gaps = 32/184 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
E ++++ AR+L LV+DLD T++++ V+P E + D P+ + + +
Sbjct: 152 EAKRRLLEARRLSLVVDLDQTVIHAC----VEPTIGEW-----QSDPTNPNHEAVKDVCK 202
Query: 966 F---------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
F P + KLRPG+ FL S+ +EMH+YTMG + YA +AK++DP +
Sbjct: 203 FQLADDAPGRPGTWYYIKLRPGLKEFLTTMSQYYEMHIYTMGTRAYAENIAKIIDPDRSV 262
Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 1075
F R++SR + G ++K+L+ + +++ VVIIDD VW NLI V+
Sbjct: 263 FGDRILSRDESGS--------MQAKNLKRLFPVDTKMVVIIDDRADVWSWIS-NLIKVKV 313
Query: 1076 YTYF 1079
+ +F
Sbjct: 314 FEFF 317
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
D I++ +RK+LAG +VFS V P+G + H +W A FGA +++I + TH+
Sbjct: 535 DAAAIMSDMKRKVLAGVHLVFSGVVPLGVDIQSHDMAVW--ARSFGAKVSENIGKKTTHI 592
Query: 1190 VAN-SLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAI 1232
+A+ T KV A G +V GW+ ++R +E + I
Sbjct: 593 IASPERRTAKVRQAAKKGNRIAIVGQGWLFECFSQWKRLDENPYRI 638
>gi|410076480|ref|XP_003955822.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
gi|372462405|emb|CCF56687.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
Length = 724
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 98/186 (52%), Gaps = 30/186 (16%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 963
+K++ + +KL LV+DLD T+++ K +P +D + + E+P +
Sbjct: 161 KKRLRNEKKLVLVVDLDQTVIHCGVDPTIGEWKSDPNNPNYDTLKDVQMFALEEEP---V 217
Query: 964 FRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
F +MG + K+RPG+ F ++ + LFEMH+YTM + YA E+ K++DP G
Sbjct: 218 LPFMYMGPKPTPRKCWYYVKVRPGLKEFFKKVAPLFEMHIYTMATRAYALEITKIIDPTG 277
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
LF R++SR ++G SK LE + +S V+IIDD VW + NLI V
Sbjct: 278 ELFGNRILSRDENGS--------LTSKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIKV 328
Query: 1074 ERYTYF 1079
Y++F
Sbjct: 329 VPYSFF 334
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 70/145 (48%), Gaps = 11/145 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFFS---HQSLDD--VDVRNILAAEQRKILAGCRI 1149
H++R +D L + ++H ++S ++SLD+ D++ I+ ++ + GC
Sbjct: 441 HNQRLLIDDDDELYFLKDTLAKVHNEYYSILENKSLDESGADIKLIMPKLKQSVFEGCHF 500
Query: 1150 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGR 1207
VFS + P+ + T+ FGA + I++ THV+ + GT K A ++
Sbjct: 501 VFSGLIPLHTDIRRADIVVWTS-MFGAESSSEINEVTTHVITKTPGTFKARLAKSFNSDI 559
Query: 1208 FVVHPGWVEASALLYRRANEQDFAI 1232
+VHP WV + + +E+ + +
Sbjct: 560 KIVHPDWVFECLIRWTHVDEKPYTL 584
>gi|367004465|ref|XP_003686965.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
gi|357525268|emb|CCE64531.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
Length = 732
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 93/192 (48%), Gaps = 42/192 (21%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 965
+ ++ ++KL LV+DLD T+++ VDP E + D P+ R
Sbjct: 159 KSRLIRSKKLILVVDLDQTVIHCG----VDPTISEW-----KNDPSNPNYETLRNVKSFV 209
Query: 966 ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
P M M + K+RPG+ F E+ + ++EMH+YTM + YA E+AK
Sbjct: 210 LEEEAILPPMYMGPKPPVHKCSYYVKVRPGLKEFFEKVAPIYEMHIYTMATRAYAEEIAK 269
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 1067
++DP G LF R++SR ++G K LE + +S VVIIDD VW +
Sbjct: 270 IIDPDGSLFGNRILSRDENGS--------LTHKSLERLFPTDQSMVVIIDDRGDVWNWSP 321
Query: 1068 LNLIVVERYTYF 1079
NLI V Y +F
Sbjct: 322 -NLIKVTPYNFF 332
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 1101 RSEDGTLASSLGVIERLHKIFFSHQSL-----DDVDVRNILAAEQRKILAGCRIVFSRVF 1155
R +D L+ ++ + F+ L DD ++ ++ +R + C VFS +
Sbjct: 448 RDDDDELSYLKEILATVQSEFYKALELKNGNSDDTSIQELMPKLKRSVFKDCNFVFSGLI 507
Query: 1156 PVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG--RFVVHP 1212
P+G + LW + FGA T +I + THV+ + GT K A S VVHP
Sbjct: 508 PLGTDIQKADIVLW--TKMFGASSTSNITENTTHVITKTPGTLKARIAKSFNPSIKVVHP 565
Query: 1213 GWVEASALLYRRANEQDFAIK 1233
WV ++ +E+ F +K
Sbjct: 566 DWVFECLSTWKHVDEKSFELK 586
>gi|320581076|gb|EFW95298.1| RNA Pol II CTD phosphatase component, putative [Ogataea
parapolymorpha DL-1]
Length = 743
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 117/249 (46%), Gaps = 54/249 (21%)
Query: 885 DVEHL-FEGYDDQQKAAIQ--------KERTRRLEE-----QKKMFSARKLCLVLDLDHT 930
+VE L + G++D+ +A I K T+ E +++ KL LV+DLD T
Sbjct: 122 NVEDLDYTGFNDKDRAPISMSHGTTNLKVSTKEAENIERSSTQRLLKEEKLSLVVDLDQT 181
Query: 931 LLNSAKFHEVDPVHDEILRKK----------------EEQDREKPHRHLFRFPHMGMW-- 972
++++ VDP E + EE+ P+ + P W
Sbjct: 182 VIHAT----VDPTVGEWMSDPTNPNYESIKDVRSFCLEEEPILPPNYKGPKPPSHKRWYY 237
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
KLRPG+ FLE+ SKL+E+H+YTM + YA +AK++DP G+ F R++SR + G
Sbjct: 238 VKLRPGLQEFLEKVSKLYELHIYTMATRSYAKSIAKIIDPDGIYFGDRILSRDESGS--- 294
Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF--------PCSR 1083
K L+ + ++ S VV+IDD VW + NLI V Y +F
Sbjct: 295 -----LTQKTLKRLFPVDTSMVVVIDDRGDVWNWSP-NLIKVVPYDFFVGIGDINSSFLP 348
Query: 1084 RQFGLLGPS 1092
RQ LLGPS
Sbjct: 349 RQSTLLGPS 357
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 73/146 (50%), Gaps = 13/146 (8%)
Query: 1097 DHDERSEDGTLASSLG-VIERLHKIFF-----SHQSLDDVDVRNILAAEQRKILAGCRIV 1150
+H+ +D +LG + R+H F+ S ++L DV ++I+ + ++ + G +
Sbjct: 496 EHNLLCDDDNELETLGQALVRIHNEFYFEVEKSSKTLPDV--KDIMTSMKQLVFQGYVFL 553
Query: 1151 FSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GR 1207
S V P+G +W A+ FGA + + VTHV+ + GT KV A S
Sbjct: 554 LSGVLPLGTPLETADIVIW--AKSFGASFVANYNRSVTHVICKNPGTFKVRLAKSIDPNV 611
Query: 1208 FVVHPGWVEASALLYRRANEQDFAIK 1233
VV+P W+ + L+ + E+D+A+K
Sbjct: 612 HVVNPDWLFSCIALWEKVPEEDYAMK 637
>gi|403222664|dbj|BAM40795.1| uncharacterized protein TOT_030000057 [Theileria orientalis strain
Shintoku]
Length = 656
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 87/178 (48%), Gaps = 27/178 (15%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDEILRKKEEQDREKPHRH 962
E K RKLCLVLDLD+TL+++ ++ +L+ + E + +
Sbjct: 188 EITKYLEDRKLCLVLDLDNTLVHATSQSPPADIDVETIEISSSSVLKTIVYNETETSYCN 247
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
F KLRPGI+ F SK +++ L+TMG + +A ++LDP+GV F RV
Sbjct: 248 SF--------FKLRPGIFKFFRSVSKRYKLFLFTMGTRQHAQSALRILDPQGVYFGNRVF 299
Query: 1023 SRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R D +R+ P K+L V+++DDS VW +KL LI V Y YF
Sbjct: 300 CRNDSRSCMKSLDRLFPNHKNL---------VLVMDDSEYVWT-SKLALIKVHPYYYF 347
>gi|6689545|emb|CAB65510.1| FCP1 serine phosphatase [Xenopus laevis]
Length = 867
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 91/171 (53%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q ++ +KL L++DLD TL+++ + H I H L R M
Sbjct: 53 EDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 101
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+ T+LRP FLE+ +KL+E+H++T G++LYA +A LDP+ LF+ R++SR + D
Sbjct: 102 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P+ K+ +L + +S V IIDD VW NLI V++ F
Sbjct: 161 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKMCIF 204
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ + +DDV D+RNI+ + K+L I FS + FP+
Sbjct: 486 ILVRVHTDYYAKYDRILKKEVDDVPDIRNIVPELKSKVLENVIITFSGLHPTNFPIERTR 545
Query: 1162 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H A GA +K++ D+VTH++A GT+KV A + VV+P W
Sbjct: 546 EFYH-----ARALGANISKNLILKPDDSDRVTHLIAARSGTEKVRQAQNCKHLHVVNPEW 600
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F +K
Sbjct: 601 LWSCLERWEKVEEQLFPLK 619
>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Piriformospora indica DSM 11827]
Length = 782
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 103/221 (46%), Gaps = 43/221 (19%)
Query: 886 VEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
++H EG Q+ A + ER E ++ RKL L++DLD T+L++ DP
Sbjct: 126 MDHGAEGPLLSQEVAAKIER----ENTDRLLKNRKLSLIVDLDQTILHAT----FDPTVG 177
Query: 946 EILRKKEEQDREKPHRHL-------------------FRFP----HMG----MWTKLRPG 978
E ++ K+ ++ + F+ P HMG + K RPG
Sbjct: 178 EWIKAKDAFEKRRSTTPPDHDPPPESVNWPALEDVISFQLPSDHGHMGHSERYYVKPRPG 237
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ F+ S+L+EMH+YTMG + YA + LDP G F RV+SR + G +RV
Sbjct: 238 LQRFMNNLSELYEMHVYTMGVRSYANAICAALDPSGAWFGSRVLSRNE-----SGSDRVK 292
Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L +S VV+IDD VW + NL+ V + +F
Sbjct: 293 NLKRL--FPSDQSMVVVIDDRADVWNWSP-NLVRVIPFEFF 330
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 2/104 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV ++ + K LAG +VFS + P+ + P P+W+ A +FGA C I+ QVTH+V
Sbjct: 462 DVLGVIPGIKAKTLAGVHLVFSGILPL-DGRPERQPIWKAALEFGATCHVDINPQVTHLV 520
Query: 1191 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDFAIK 1233
N LGT K + A + G FVV+ W S + + R E ++ ++
Sbjct: 521 TNKLGTVKADKAFAQGNIFVVNIKWFNDSLIKWERQPEANYLME 564
>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
Length = 895
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 97/178 (54%), Gaps = 25/178 (14%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-H 968
+++ ++ KL L++DLD TL+++ + + EE+D F F H
Sbjct: 134 QDKSRLHKLNKLVLLVDLDQTLIHTTQNQAFAAMC------SEEKD-------FFTFQLH 180
Query: 969 MG---MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
++TKLRP FL+ SK +E+ + T G++LYA ++A+ +DPK FA R++SR
Sbjct: 181 KNEPTLYTKLRPYCREFLQEISKCYELQVVTFGSRLYAHKIAEFIDPKKKFFANRILSRD 240
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
+ +P + KS +L + +S V IIDD VW + NL++V++Y+YFP S
Sbjct: 241 ECINP------MKKSGNLRHLFPCGDSMVCIIDDRDDVWS-SAPNLVMVKKYSYFPGS 291
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
D+R ++ + K+L GC IV + + P +A PH+H A Q GA +D+ TH+
Sbjct: 573 DLRGVVPFLRSKVLYGCCIVLTGIIPNNFKAAPHMHRAHIVARQLGAAINSTVDENTTHL 632
Query: 1190 VANSLGTDKVNWALSTGRFVVHP---GWVEASALLYRRANEQDF 1230
+ GT K AL G+ +H W+ A + + R +E+ F
Sbjct: 633 IGAKKGTAKYQDALKMGKKKIHMVSIEWLWACSDRWERVSEKLF 676
>gi|194886507|ref|XP_001976627.1| GG19916 [Drosophila erecta]
gi|190659814|gb|EDV57027.1| GG19916 [Drosophila erecta]
Length = 876
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 193 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 239
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 240 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 296
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 297 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 344
Score = 41.2 bits (95), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 554 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRCEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 612
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 613 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 672
Query: 1230 FAI 1232
F +
Sbjct: 673 FPL 675
>gi|195489702|ref|XP_002092848.1| GE11441 [Drosophila yakuba]
gi|194178949|gb|EDW92560.1| GE11441 [Drosophila yakuba]
Length = 879
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 110/213 (51%), Gaps = 32/213 (15%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 196 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 242
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 243 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 299
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP------- 1080
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 300 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFFQHTGDINA 355
Query: 1081 ---CSRRQFGLLGPSLLEIDHDERSEDGTLASS 1110
S+ + G EI + +D T +SS
Sbjct: 356 PPGLSKHELDGEGVDFKEITEKQEDKDKTESSS 388
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 557 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRCEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 615
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 616 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 675
Query: 1230 FAI 1232
F +
Sbjct: 676 FPL 678
>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
Length = 880
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 197 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 243
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 244 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 300
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 301 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 348
Score = 44.7 bits (104), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 61/123 (49%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 558 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 616
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +ID ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 617 AKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 676
Query: 1230 FAI 1232
F +
Sbjct: 677 FPL 679
>gi|195586452|ref|XP_002082988.1| GD24941 [Drosophila simulans]
gi|194194997|gb|EDX08573.1| GD24941 [Drosophila simulans]
Length = 877
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 194 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 240
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 241 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 297
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 298 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 345
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 555 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 613
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 614 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 673
Query: 1230 FAI 1232
F +
Sbjct: 674 FPL 676
>gi|195353179|ref|XP_002043083.1| GM11819 [Drosophila sechellia]
gi|194127171|gb|EDW49214.1| GM11819 [Drosophila sechellia]
Length = 874
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 191 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 237
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 238 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 294
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 295 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 552 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 610
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 611 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 670
Query: 1230 FAI 1232
F +
Sbjct: 671 FPL 673
>gi|255540897|ref|XP_002511513.1| conserved hypothetical protein [Ricinus communis]
gi|223550628|gb|EEF52115.1| conserved hypothetical protein [Ricinus communis]
Length = 161
Score = 88.6 bits (218), Expect = 2e-14, Method: Composition-based stats.
Identities = 59/170 (34%), Positives = 91/170 (53%), Gaps = 18/170 (10%)
Query: 989 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
+FEM++YT +++ A +M LDP F R+I R +G V K+ + VLG
Sbjct: 1 MFEMYVYTSSSQVNARKMMSFLDPANRYFNSRLIVR-------EGST-VMALKNPDVVLG 52
Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLA 1108
E AVVI+DD WP +K N+I VE+Y YF ++ G SL E DE + +A
Sbjct: 53 HERAVVILDDRKSAWPMHKANVINVEKYNYFASNQSDPGSKSKSLAERKKDEHTR--VMA 110
Query: 1109 SSLGVIERLHKIFFSHQSLDDV-------DVRNILAAEQRKILAGCRIVF 1151
+ L ++ ++H+ FF + L+ + DVR ++ + KIL GC+I F
Sbjct: 111 AYLRILRKIHRQFFDPK-LEAIVTAGAARDVREVMRMVRAKILKGCKICF 159
>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
Length = 896
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I + H L+ PH
Sbjct: 213 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 259
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T+LRPG FLER S+L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 260 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 316
Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 317 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 364
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 61/123 (49%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F+S ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 574 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 632
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +ID ++TH+VA + GT KVN A VV+ W+ A + E+
Sbjct: 633 AKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 692
Query: 1230 FAI 1232
F +
Sbjct: 693 FPL 695
>gi|396499223|ref|XP_003845421.1| similar to RNA polymerase II subunit A C-terminal domain phosphatase
[Leptosphaeria maculans JN3]
gi|312222002|emb|CBY01942.1| similar to RNA polymerase II subunit A C-terminal domain phosphatase
[Leptosphaeria maculans JN3]
Length = 887
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 95/188 (50%), Gaps = 21/188 (11%)
Query: 902 QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKE 952
QKE E+ KK + A+KL L++DLD T++++ + +P H + K+
Sbjct: 224 QKEANAAEEDAKKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KD 280
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ + ++ + K RPG+ F +R SKL+EMH+YTM + YA + K++DP
Sbjct: 281 VEGFQLADDNVSNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDP 340
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1072
F R++SR ++ K+K L + + VVIIDD VW ++ +L+
Sbjct: 341 DRRYFGDRILSRDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVR 391
Query: 1073 VERYTYFP 1080
V + +FP
Sbjct: 392 VPVFNFFP 399
>gi|21914376|gb|AAM81360.1|AF522873_3 RNA polymerase II C-terminal domain phosphatase component
[Leptosphaeria maculans]
Length = 804
Score = 88.2 bits (217), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 91/177 (51%), Gaps = 20/177 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
+K++ A+KL L++DLD T++++ + +P H + K+ + + ++
Sbjct: 152 KKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KDVEGFQLADDNV 208
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
+ K RPG+ F +R SKL+EMH+YTM + YA + K++DP F R++S
Sbjct: 209 SNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDPDRRYFGDRILS 268
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
R ++ K+K L + + VVIIDD VW ++ +L+ V + +FP
Sbjct: 269 RDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVRVPVFNFFP 316
>gi|367009794|ref|XP_003679398.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
gi|359747056|emb|CCE90187.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
Length = 713
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 92/187 (49%), Gaps = 32/187 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRH 962
+ ++ ++KL LV+DLD T+++ VDP E R K+ Q
Sbjct: 152 KTRLRESKKLVLVVDLDQTVIHCG----VDPTIGEWKRDSSNPNYEALKDVQSFALDEEP 207
Query: 963 LFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ +MG + K+RPG+ F ++ + LFEMH+YTM + YA E+AK++DP
Sbjct: 208 ILPLLYMGPKPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYALEIAKIIDPD 267
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
G LF R++SR ++G K LE + +S VV+IDD VW NLI
Sbjct: 268 GSLFGDRILSRDENGS--------ITQKSLERLFPTDQSMVVVIDDRGDVWNWCP-NLIK 318
Query: 1073 VERYTYF 1079
V Y +F
Sbjct: 319 VVPYNFF 325
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 66/144 (45%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFFSHQSLDD---VDVRNILAAEQRKILAGCRIVF 1151
H++R +D L ++ ++H ++ D D++ +L + K+ C VF
Sbjct: 432 HNQRLLVDDDDELNYLREILAKVHDEYYKTLGQDSEKKADIKTLLPLMKSKVFEDCHFVF 491
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK--VNWALSTGRF 1208
S + P+G N LW FGA + IDD THV+ + GT K + A +
Sbjct: 492 SGLIPLGTNINRADIVLW--TNMFGAQSSPEIDDSTTHVITRTPGTYKARIAKAFNENIK 549
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
VVHP WV + ++ +E+ + +
Sbjct: 550 VVHPDWVFECLVSWKHVDEKPYEL 573
>gi|164658688|ref|XP_001730469.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
gi|159104365|gb|EDP43255.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
Length = 364
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 47/212 (22%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-----HD-------- 945
+ +E R+ E+ + + RKL L++DLD T+++ VDP HD
Sbjct: 22 VSREEAIRMDSEDTRHLIEQRKLALIVDLDQTIIHVT----VDPTVKEWAHDPKNPNWCM 77
Query: 946 --------------EILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKL 989
+ + E D+ G W KLRPG+ FL+ S +
Sbjct: 78 LKDVVAFQLGSDGKTVSHQPERMDQHDVKSFATDGDENGCWYYVKLRPGLQAFLQSVSPM 137
Query: 990 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG--DERVPKSKDLEGVL 1047
+EMH+YTMG + YA + +++DP G LF R++SR ++G+ P S D+
Sbjct: 138 YEMHVYTMGTRSYADCICRIVDPDGHLFGARILSRDENGNEVQKSLSRLFPISTDM---- 193
Query: 1048 GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
VV+IDD VW + NLI VE Y +F
Sbjct: 194 -----VVVIDDRADVWSWSP-NLIKVEPYEFF 219
>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
Length = 889
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 92/173 (53%), Gaps = 24/173 (13%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I K H +
Sbjct: 174 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 219
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T+LRPG FLER S+L+E+H+ T G + YA +A++LDP G F+ R++SR
Sbjct: 220 SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 276
Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW NLI V+ Y +F
Sbjct: 277 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVWSMAS-NLIQVKPYHFF 325
Score = 43.9 bits (102), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 60/119 (50%), Gaps = 4/119 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F++ ++ + D++ ++ + ++L G +VFS + P + +
Sbjct: 566 ILRNIHKRFYAIYDETTEIPDLKIVVPKIRSEVLRGQNLVFSGLVPT-QMKMEQSRAYFI 624
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQ 1228
A+ GA +I+ +TH+VA + GT KVN A VV+ W+ A A + +E+
Sbjct: 625 AKSLGAEVKSNINKDITHLVAVNAGTYKVNAAKKEANIRVVNANWLWACAERWEHVDER 683
>gi|366991271|ref|XP_003675401.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
gi|342301266|emb|CCC69032.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
Length = 725
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 97/195 (49%), Gaps = 42/195 (21%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR--- 965
L + ++ +KL LV+DLD T+++ VDP E + D + P+ +
Sbjct: 157 LNVRTRLRKEKKLVLVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNFETLKDVK 207
Query: 966 ---------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
P + M + K+RPG+ FLE+ + LFEMH+YTM + YA+E
Sbjct: 208 QFSLEEEPILPTLYMGPKPPLRKCWYYVKVRPGLKEFLEKIAPLFEMHIYTMATRAYASE 267
Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP 1064
+AK++DP G LF R++SR ++G +K LE + +S V++IDD VW
Sbjct: 268 IAKIIDPNGDLFGDRILSRDENGS--------MTTKSLERLFPTDQSMVIVIDDRGDVWN 319
Query: 1065 HNKLNLIVVERYTYF 1079
+ NLI V Y +F
Sbjct: 320 WSP-NLIKVVPYNFF 333
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 68/144 (47%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFFSHQSLD---DVDVRNILAAEQRKILAGCRIVF 1151
HD+R +D L + R+H +++ D D++ I+ A ++ +L CR VF
Sbjct: 442 HDQRLLIDDDDELYHLKDTLVRIHDKYYNVLKKDGELKADIKLIMPAMKQNVLDNCRFVF 501
Query: 1152 SRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG--RF 1208
S + P+G +W FGA+ T ID++ THV+ + GT K A S
Sbjct: 502 SGLIPLGTDIQRADIVIWTNT--FGAISTSDIDEKTTHVITKTPGTYKARLAKSFNPKIK 559
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
++HP WV ++ +E+ + +
Sbjct: 560 ILHPDWVFECLTSWKHVDEKPYEL 583
>gi|302306421|ref|NP_982820.2| ABL127Wp [Ashbya gossypii ATCC 10895]
gi|299788508|gb|AAS50644.2| ABL127Wp [Ashbya gossypii ATCC 10895]
gi|374106022|gb|AEY94932.1| FABL127Wp [Ashbya gossypii FDAG1]
Length = 728
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 90/187 (48%), Gaps = 32/187 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK------------- 958
Q K+ ARKL LV+DLD T+++ VDP E + + E
Sbjct: 157 QLKLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPNNPNYEALKDVQSFSLDEEP 212
Query: 959 ---PHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
P + P W KLRPG+ F + + FE+H+YTM + YA E+AK++DP
Sbjct: 213 VLPPFYMGPKPPTRKCWYYVKLRPGLKEFFAKIAPHFELHIYTMATRAYALEIAKIIDPD 272
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
G LF R++SR ++G K LE + M +S VV+IDD VW + NLI
Sbjct: 273 GKLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVVIDDRGDVWNWCE-NLIK 323
Query: 1073 VERYTYF 1079
V Y +F
Sbjct: 324 VVPYDFF 330
Score = 47.8 bits (112), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 47/104 (45%), Gaps = 3/104 (2%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D++ ++ + + GCR VFS + P+ N + FGA T ++D TH++
Sbjct: 474 DIKILMPKLKETVFEGCRFVFSGLIPL-HTNIERADIVLWTNMFGASTTANLDYNTTHLI 532
Query: 1191 ANSLGTDKVNWA--LSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
+ GT K A + +VHP W+ + + R E + +
Sbjct: 533 TRTPGTMKARLAKSFNPAIKIVHPDWIFECLVGWERVEESPYEL 576
>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
lacrymans S7.3]
Length = 811
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/217 (29%), Positives = 100/217 (46%), Gaps = 40/217 (18%)
Query: 890 FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLLNSAK 936
+ G+ D +A+IQ T LEE +K + ++RKL L++DLD T++++
Sbjct: 117 YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHAT- 175
Query: 937 FHEVDPV-------------HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL 983
VDP + E L+ + K + + K RPG FL
Sbjct: 176 ---VDPTVATDSESDDECNPNWEALKDVRKFQLVKGKQKFIENEGCMYYIKPRPGWQHFL 232
Query: 984 ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 1043
+ +EMH+YTMG + YA E+ +DP G +F GR++SR + G K L
Sbjct: 233 HSIANKYEMHVYTMGTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSL 284
Query: 1044 EGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ + + S VVIIDD VW + NL+ V Y +F
Sbjct: 285 QRLFPCDTSMVVIIDDRADVWEWSP-NLVKVIPYDFF 320
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 2/103 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV I+ + +++ L G I+FS V P+ + P +W+ AE FGA C + ++THVV
Sbjct: 494 DVTKIIPSMRKETLDGIHILFSSVIPL-DTKPETTEIWKVAEMFGAQCCTELSSRITHVV 552
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ A G +V W S L+ +E + +
Sbjct: 553 AAKHGTVKVDAARKRGGIKIVWLSWFTDSIALWSHQDETPYLL 595
>gi|317027693|ref|XP_001399857.2| RNA polymerase II subunit A C-terminal domain phosphatase
[Aspergillus niger CBS 513.88]
Length = 800
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 97/188 (51%), Gaps = 32/188 (17%)
Query: 895 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + ++ R+EE ++++ + RKL LV+DLD T++++ VDP E + KE
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDKE 187
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ + R W + +FL+ S+++E+H+YTMG + YA +A ++DP
Sbjct: 188 NPNYQASER----------WLE------SFLQNVSEMYELHIYTMGTRSYAQHIASIIDP 231
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
LF R++SR + G +K+L + +++ VVIIDD VW N NLI
Sbjct: 232 DRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLI 282
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 283 KVSPYDFF 290
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ I+ + +IL G +VFS V P+G + LW A+ FGAV I+ TH+
Sbjct: 478 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADIXLW--AKSFGAVIASKINMXTTHL 535
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ ++R E+ + +
Sbjct: 536 VAGRNRTXKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 579
>gi|269860082|ref|XP_002649764.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
H348]
gi|220066823|gb|EED44294.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
H348]
Length = 409
Score = 87.0 bits (214), Expect = 6e-14, Method: Composition-based stats.
Identities = 48/166 (28%), Positives = 84/166 (50%), Gaps = 28/166 (16%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
+++ +KL L LDLD TL+++ +KP F+ ++ +
Sbjct: 98 ELYHNKKLILFLDLDQTLIHATL-------------------SKKPCNFSFKLHNIEFFI 138
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RPG+ FL + S+ FE H+YTMG + YA + K+LDP + F R+++R ++ F
Sbjct: 139 KKRPGLDKFLSKLSRFFEFHVYTMGTREYANYICKILDPNKIFFGDRIVTRTENNKMF-- 196
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K LE + + V+I+DD V VW + N+ +++ + Y+
Sbjct: 197 ------KKYLERITNFSNNVIILDDRVDVWGFSP-NVFLIKPFYYY 235
>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
[Saccoglossus kowalevskii]
Length = 896
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 96/172 (55%), Gaps = 21/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ +RKL ++DLD T++++ +D V + + ++ H L+ P
Sbjct: 169 EDEQRLLKSRKLVCIVDLDQTIIHTT----MDNVPENL--------KDVYHFQLWSGPQY 216
Query: 970 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ T++RP FLE+ SKL+E+H++T G +LYA +A +DP LF+ R++SR
Sbjct: 217 PWFHTRIRPKCKEFLEKISKLYELHIFTFGARLYAHMIAGFIDPDKKLFSHRIVSR---D 273
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ FD K+ +L+ + ++ V IIDD VW + N+I V+ Y YF
Sbjct: 274 ECFDAS---SKTANLQAIFPCGDNMVCIIDDREDVW-NFAPNMIHVKPYHYF 321
Score = 47.4 bits (111), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 12/104 (11%)
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK----HIDDQ------VTHVV 1190
R++L G I+FS VFP +P W+ A+ GA + D+ THVV
Sbjct: 576 RQVLKGTNILFSGVFPTN-MSPEKSRAWKVAQTLGANVQSSFVPKLKDKTNAATATTHVV 634
Query: 1191 ANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 1233
A GT KV A T G +V P W+ + R +E+ F +K
Sbjct: 635 AAKAGTVKVKQAQCTRGIHIVTPEWLWCCYDRWERVDERIFRLK 678
>gi|254586061|ref|XP_002498598.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
gi|238941492|emb|CAR29665.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
Length = 764
Score = 86.3 bits (212), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 32/187 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDP----------------VHDEILRKKEEQD 955
++++ ++KL LV+DLD T+++ VDP + D + EE+
Sbjct: 156 KQRLRQSKKLVLVVDLDQTVIHCG----VDPTIGEWKKDPSNPNYETLKDVQMFSLEEEP 211
Query: 956 REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
P R P W K+RPG+ F + + L+EMH+YTM + YA E+AK++DP
Sbjct: 212 IVPPMYMGPRLPERKCWYFVKVRPGLREFFAQLAPLYEMHIYTMATRTYALEIAKIIDPD 271
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
G LF R++SR ++G K LE + +S V++IDD VW NLI
Sbjct: 272 GSLFGDRILSRDENGS--------LTQKSLERLFPTDQSMVIVIDDRGDVWNWCP-NLIK 322
Query: 1073 VERYTYF 1079
V Y +F
Sbjct: 323 VVPYNFF 329
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 61/136 (44%), Gaps = 3/136 (2%)
Query: 1099 DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 1158
D+ E L +L I R + + + D+++++ + ++ C VFS + P+G
Sbjct: 443 DDDDELYYLKDTLAKIHRKYYKLLDEGNENSADIQSLVPTMKGQVFKDCYFVFSGLIPLG 502
Query: 1159 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRFVVHPGWVE 1216
N + FGA+ + ID+ THV+ + GT K A + VVHP WV
Sbjct: 503 -TNIRRADIVLWTGMFGAITSSEIDENTTHVITKTPGTYKARIAKGFNENVKVVHPDWVF 561
Query: 1217 ASALLYRRANEQDFAI 1232
+ + +E+ + +
Sbjct: 562 ECLVTWTHVDEKPYEL 577
>gi|195121496|ref|XP_002005256.1| GI20391 [Drosophila mojavensis]
gi|193910324|gb|EDW09191.1| GI20391 [Drosophila mojavensis]
Length = 880
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I K H +
Sbjct: 178 DDTRRLLTDRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 223
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T+LRPG FLE+ S+L+E+H+ T G + YA +A++LDP G F+ R++SR
Sbjct: 224 SPWYHTRLRPGTAEFLEKMSELYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 280
Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 281 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 329
Score = 42.4 bits (98), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 61/123 (49%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F++ ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 556 ILRNIHKRFYAIYDETTEIPDLKIIVPKIRCEVLRGQVLVFSGLVPT-QMKLEQSRAYFI 614
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I VTH+VA + GT KVN A VV+ W+ A A + +E+
Sbjct: 615 AKSLGAEVQSNISKDVTHLVAVNAGTYKVNAAKKEPNIKVVNANWLWACAERWEHVDERL 674
Query: 1230 FAI 1232
F +
Sbjct: 675 FPL 677
>gi|167520468|ref|XP_001744573.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776904|gb|EDQ90522.1| predicted protein [Monosiga brevicollis MX1]
Length = 858
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 18/171 (10%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E K+ ARKL L+LDLD TL++S +D + LR E + H F
Sbjct: 56 ENANKLLEARKLILILDLDKTLIHST----IDSIASHWLR---EGVYDIFH---FDLGKH 105
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
+TK+RPG+ FLE +EMH+YTMG + YA + +++DP F+ R++++ D+
Sbjct: 106 TYYTKVRPGLHAFLEDLYPYYEMHIYTMGRRNYAERILRIIDPSNRFFSTRILTQ-DESF 164
Query: 1030 PFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ K+K+L+ +L G +S VI+DD VW + N++ Y +F
Sbjct: 165 SIEN-----KAKNLDALLPGGDSMAVILDDLPAVWDF-QTNVVPALPYEFF 209
Score = 47.4 bits (111), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 9/98 (9%)
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI-------DDQVTHVVANSLGT 1196
LAGC ++F+ + P G+ + H W+ A GA + +VTHV+A GT
Sbjct: 592 LAGCILLFTGIIPKGQ-DVATHRAWRQAVAMGARVVDEVKFASILMSRRVTHVIARKAGT 650
Query: 1197 DKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 1233
+K+N AL G F+V W+E + RA E + ++
Sbjct: 651 EKLNQALDYPGVFLVSLRWLEDTFHQGARAKESKYPLE 688
>gi|448097224|ref|XP_004198617.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
gi|359380039|emb|CCE82280.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
Length = 830
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 95/203 (46%), Gaps = 48/203 (23%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
I+ T RL E+KK L LV+DLD T++++ VDP E + D P+
Sbjct: 152 IETNTTDRLVEEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPN 196
Query: 961 RHLFR---------------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 997
+ P W K+RPG+ FLE+ SKL+EMH+YTM
Sbjct: 197 YKAVKDVKSFCLEEESIAPLGWEGPKLPATKCWYYVKVRPGLEQFLEQISKLYEMHIYTM 256
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
+ YA E+AK++DP G F R++SR + G K+L+ + + +S V II
Sbjct: 257 ATRNYALEIAKIIDPNGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308
Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
DD VW NLI V Y +F
Sbjct: 309 DDRGDVWQWEN-NLIKVVPYDFF 330
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 61/135 (45%), Gaps = 17/135 (12%)
Query: 1114 IERLHKIFFSHQSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLW 1168
+ER+H +++ + + D+ I+ +R L G I+FS + P+G N +W
Sbjct: 523 LERIHNAYYTDLKVAENGQRPDLTEIIPNLKRTCLEGITILFSGILPLGTNYNNADIVIW 582
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDK----VNWALSTGRFV------VHPGWVEAS 1218
+QFG + +VTH+VA T + + R + V+P W+
Sbjct: 583 --CKQFGVKVVNEVVPEVTHIVARDPSTSNYKTGFTFKVRVAREILPSAKLVNPDWLFMC 640
Query: 1219 ALLYRRANEQDFAIK 1233
+++ +E+D+ I+
Sbjct: 641 LSSWKKVDEEDYLIQ 655
>gi|8778093|gb|AAF79202.1| CTD phosphatase-like protein [Emericella nidulans]
Length = 409
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 99/188 (52%), Gaps = 34/188 (18%)
Query: 908 RLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR---- 961
R+EE ++++ + RKL LV+DLD T++++A VDP E + D++ P+
Sbjct: 42 RVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM-----ADKDNPNHAPVS 92
Query: 962 --HLFRFPHMG-------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
F+ G + KLRPG+ FL+ + ++E+H+YTMG + YA +A ++DP
Sbjct: 93 DVRAFQLVDDGPGMRGLLVLCKLRPGLEEFLKNVADMYELHIYTMGTRSYAQAIANIIDP 152
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLI 1071
LF R++SR + G K+L + +++ VVIIDD VW + NLI
Sbjct: 153 DRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDDRGDVWRWSP-NLI 203
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 204 KVIPYDFF 211
>gi|198460927|ref|XP_001361849.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
gi|198137180|gb|EAL26428.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
Length = 873
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V + I K H +
Sbjct: 180 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 225
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T+LRPG FLER S+L+E+H+ T G + YA +A++LDP G F+ R++SR
Sbjct: 226 SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 282
Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 283 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331
>gi|340377687|ref|XP_003387360.1| PREDICTED: hypothetical protein LOC100639785 [Amphimedon
queenslandica]
Length = 913
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 93/187 (49%), Gaps = 26/187 (13%)
Query: 898 KAAIQKERTRRLEEQKK--MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
+ + K+ +RL K + RKL L++DLD TL +H I R E
Sbjct: 125 QVKVNKKEAQRLGNLDKECLLKNRKLALIIDLDQTL-----------IHTSIDRNIE--- 170
Query: 956 REKPHRHLFRFP-HMGMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
R P H F P H ++ +LRP + FL S+ +E+H+ TMG + YA + K+LD +
Sbjct: 171 RGLPDVHSFTLPGHSCVYHCRLRPYVREFLNHISQYYELHVATMGTRDYADAITKILDQE 230
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
LF+ RVISR + DP K+ L+ V + V I+DD VW H + NLI
Sbjct: 231 KKLFSHRVISRNELLDPHS------KAVRLKSVFPCGDEMVAIMDDRGDVWGH-RPNLIH 283
Query: 1073 VERYTYF 1079
V+ Y +F
Sbjct: 284 VKAYVFF 290
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 64/147 (43%), Gaps = 28/147 (19%)
Query: 1113 VIERLHKIFFSH-------------------QSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
++ER+H+ F+ +SL D++ I+ + ++ + +G R++F+
Sbjct: 604 LLERIHETFYKEYDEMKREEEENKGGRDTGSESLSSPDLKVIIPSLRKSVFSGLRLLFTG 663
Query: 1154 VFPVGEANPHLHPLWQTAEQFGA------VCTKHIDDQ-VTHVVANSLGTDKVNWALST- 1205
+ P NP W T FG V T DD+ TH++ TDK AL
Sbjct: 664 IIPTN-MNPEKSREWNTTRAFGGEIHSTLVTTPAPDDEHTTHIIVGKKDTDKYKQALRIP 722
Query: 1206 GRFVVHPGWVEASALLYRRANEQDFAI 1232
G +V P W A ++R +E ++ +
Sbjct: 723 GIEIVSPEWFWDCAERWKRLDEDEYRL 749
>gi|195440020|ref|XP_002067857.1| GK12500 [Drosophila willistoni]
gi|194163942|gb|EDW78843.1| GK12500 [Drosophila willistoni]
Length = 657
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 92/172 (53%), Gaps = 22/172 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ DPV + I K H +
Sbjct: 181 DDTRRLLNDRKLVLLVDLDQTIIHTTN----DPVPENI----------KGIHHFQLYGSQ 226
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T LRPG FLER S+++E+H+ T G + YA +A+++DP+G LF+ R++SR
Sbjct: 227 SPWYHTCLRPGTTQFLERMSQMYELHICTFGARKYAHMIAQLIDPEGKLFSHRILSR--- 283
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ ++ K L + V IIDD VW + NLI V+ Y +F
Sbjct: 284 DECFNATSKMDNLKAL--FPNGDKMVCIIDDREDVW-NMATNLIQVKPYHFF 332
>gi|226288832|gb|EEH44344.1| RNA polymerase II C-terminal domain phosphatase component
[Paracoccidioides brasiliensis Pb18]
Length = 920
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K R+EE K L V+DLD T++++ VDP E
Sbjct: 103 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153
Query: 953 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+QDR+ P+ + P M G W KLRPG+ FL+ S L+E+H+YTMG
Sbjct: 154 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 213
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ YA +A ++DP +F R++SR + G +K+L+ + +++ VVIIDD
Sbjct: 214 RAYAQNIAAIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 266 RGDVWKWSD-NLIKVSPYDFF 285
>gi|295671060|ref|XP_002796077.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284210|gb|EEH39776.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 829
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 34/201 (16%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K R+EE K L V+DLD T++++ VDP E
Sbjct: 228 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 278
Query: 953 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+QDR+ P+ + P M G W KLRPG+ FL+ S L+E+H+YTMG
Sbjct: 279 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 338
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDD 1058
+ YA +A ++DP +F R++SR + G +K+L+ + +++ VVIIDD
Sbjct: 339 RAYAQNIATIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 390
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 391 RGDVWKWSD-NLIKVSPYDFF 410
>gi|406602036|emb|CCH46356.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Wickerhamomyces ciferrii]
Length = 720
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 105/221 (47%), Gaps = 42/221 (19%)
Query: 901 IQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK------- 951
I K +++E+ K + KL LV+DLD T++++ VDP E + +
Sbjct: 153 ISKSEAQKVEQLMTKNLIKENKLILVVDLDQTVIHAT----VDPTIGEWMNDQSNPNFPS 208
Query: 952 ---------EEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNK 1000
EE+ P R P W K+RPG+ FL+R +K++E+H+YTMG K
Sbjct: 209 LKDVQYFSLEEEPILPPGYQGPRPPTHKRWYYVKMRPGLEDFLKRIAKIYELHIYTMGTK 268
Query: 1001 LYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
YA +AK++DP G F R++SR + G K LE + + S VVIIDD
Sbjct: 269 EYARSIAKIIDPDGEYFGERILSRDESGS--------LTQKSLERLFPTDTSMVVIIDDR 320
Query: 1060 VRVWPHNKLNLIVVERYTYF--------PCSRRQFGLLGPS 1092
VW + +LI V + +F +Q LLGP+
Sbjct: 321 GDVWNWSD-HLIKVVPFDFFVGIGDINSNFLPKQKSLLGPT 360
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 9/144 (6%)
Query: 1096 IDHDERSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
ID +D L + +H+ F+S ++ DV+ IL ++ ++ G VFS
Sbjct: 462 IDRLLYDDDDELVGLEKALNEIHETFYSILKKNQTHPDVKEILPQKKHEVFGGFNFVFSG 521
Query: 1154 VFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFV 1209
+FP+G N H+ +W + FGA + I + THV+ S T K A + V
Sbjct: 522 LFPIGGNINNEHI-VIW--VKSFGASVSTDITMETTHVITKSPHTYKARLAKTMIPNVKV 578
Query: 1210 VHPGWVEASALLYRRANEQDFAIK 1233
VHP W+ S +R A++ F IK
Sbjct: 579 VHPNWLFDSMSNWRPADDIPFEIK 602
>gi|195429765|ref|XP_002062928.1| GK19439 [Drosophila willistoni]
gi|194159013|gb|EDW73914.1| GK19439 [Drosophila willistoni]
Length = 827
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V D I K H +
Sbjct: 180 DDTRRLLADRKLVLLVDLDQTVIHTTN----DVVPDNI----------KGIYHFQLYGPQ 225
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T+LRPG FL+R S L+E+H+ T G + YA +A++LDP+G F+ R++SR
Sbjct: 226 SPWYHTRLRPGTADFLDRMSHLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR--- 282
Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 283 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331
Score = 40.4 bits (93), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F++ ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 505 ILRNIHKRFYAIYDETREIPDLKIIVPKIRCEVLRGQNLVFSGLVPT-QMQLEQSRAYFI 563
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I +TH+VA + GT KVN A VV+ W+ A A + E+
Sbjct: 564 AKSLGAEVQSNIGKDITHLVAVNAGTYKVNAAKRDEHIKVVNANWLWACAERWEHVEEKL 623
Query: 1230 FAI 1232
F +
Sbjct: 624 FPL 626
>gi|332029822|gb|EGI69691.1| RNA polymerase II subunit A C-terminal domain phosphatase [Acromyrmex
echinatior]
Length = 749
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 92/177 (51%), Gaps = 32/177 (18%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
E+++++ + RKL L++DLD T++++ D P+ + +F F
Sbjct: 145 EDEQRLLTDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 186
Query: 969 MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
G+ W T+LRP FL S+L+E+H+ T G ++YA +A +LD GVLF+ R++S
Sbjct: 187 YGLNSPWYHTRLRPNTRHFLSEMSRLYELHICTFGARIYAHTVASLLDKDGVLFSHRILS 246
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R + DP K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 247 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 296
Score = 47.0 bits (110), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 61/123 (49%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFSHQSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ R+H F++ ++ +R+I+ + ++L G + FS + P + H ++
Sbjct: 443 ILRRIHTEFYATLDQENTRKSLRDIIPRVRSQVLKGLCLTFSGLIPTHQ-KLHQSRAYKV 501
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL-STGRFVVHPGWVEASALLYRRANEQD 1229
A FGA T+ + ++ TH+VA GT K N A T +V+ W+ A + +E+
Sbjct: 502 ARAFGAEVTQDLTEKTTHLVAIRKGTAKANAARKDTNIKIVNSDWLWTCAERWEHVDERL 561
Query: 1230 FAI 1232
F +
Sbjct: 562 FPL 564
>gi|448111257|ref|XP_004201796.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
gi|359464785|emb|CCE88490.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
Length = 830
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 90/190 (47%), Gaps = 42/190 (22%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR-------- 965
++ +KL LV+DLD T++++ VDP E + D P+ +
Sbjct: 159 RLVDEKKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYKAVKDVKSFCLE 209
Query: 966 -------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
P W K+RPG+ FLE+ SKL+EMH+YTM + YA E+AK++
Sbjct: 210 EESIAPLGWEGPKLPATKCWYYVKVRPGLEEFLEQISKLYEMHIYTMATRNYALEIAKII 269
Query: 1011 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLN 1069
DP G F R++SR + G K+L+ + + +S V IIDD VW N
Sbjct: 270 DPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAIIDDRGDVWQWEN-N 320
Query: 1070 LIVVERYTYF 1079
LI V Y +F
Sbjct: 321 LIKVVPYDFF 330
Score = 44.3 bits (103), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 62/134 (46%), Gaps = 17/134 (12%)
Query: 1114 IERLHKIFFSH----QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLW 1168
+ER+H +++ ++ + D+ I+ +RK L G I+FS + P+G N +W
Sbjct: 524 LERIHNAYYTDLKEARNGERPDLTEIIPNLKRKCLEGITILFSGILPLGTNYNNADIVIW 583
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDK----VNWALSTGRFV------VHPGWVEAS 1218
+QFG + +VTH+VA T + + R V V+P W+
Sbjct: 584 --CKQFGVRVVNEVVPEVTHIVARDPSTSNYRTGFTFKVRVARKVLPSAKLVNPDWLFMC 641
Query: 1219 ALLYRRANEQDFAI 1232
+++ +E+D+ I
Sbjct: 642 LSSWKKVDEEDYLI 655
>gi|403217618|emb|CCK72111.1| hypothetical protein KNAG_0J00280 [Kazachstania naganishii CBS 8797]
Length = 742
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 90/188 (47%), Gaps = 43/188 (22%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------- 965
+A+KL LV+DLD T+++ VDP E R D P+ R
Sbjct: 176 AAQKLVLVVDLDQTVVHCG----VDPTIGEWKR-----DPRNPNYEALRDVQSFALEEEP 226
Query: 966 ---FPHMG----------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
F ++G + K+RPG+ F +R + LFEMH+YTM + YA E+AK++DP
Sbjct: 227 ILPFLYVGGKRPAPRKCWYYVKVRPGLKQFFKRLAPLFEMHIYTMATRAYALEIAKIIDP 286
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLI 1071
LF R++SR ++G K LE + +S V +IDD VW + NLI
Sbjct: 287 DKSLFGDRILSRDENGS--------LTHKSLERLFPTDQSMVTVIDDRGDVW-NWCANLI 337
Query: 1072 VVERYTYF 1079
V Y +F
Sbjct: 338 KVVPYNFF 345
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/120 (23%), Positives = 59/120 (49%), Gaps = 5/120 (4%)
Query: 1117 LHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQF 1174
LHK ++S ++ D ++++I+ + ++ C VFS + P+ N + +F
Sbjct: 486 LHKRYYSTLKENPQDTNIQSIMTQLKTRVFQDCHFVFSGLIPLN-TNVQAADIVIWTNKF 544
Query: 1175 GAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
GA T +D+ TH++ + GT+K A + +VHP W+ + + R + + + +
Sbjct: 545 GATTTNDLDETTTHLITKTPGTNKARLAKAFNPDIKIVHPDWIFECLIKWERVDVKPYLL 604
>gi|169600911|ref|XP_001793878.1| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
gi|160705543|gb|EAT90041.2| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
Length = 810
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 95/180 (52%), Gaps = 17/180 (9%)
Query: 908 RLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPH 960
R EE +K++ +++KL L++DLD T++++ V DP + K+ + +
Sbjct: 146 RAEEDTKKRLLNSKKLTLIVDLDQTVIHTTCERTVAEWQADPENPNYEAVKDVKGFQLAD 205
Query: 961 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
+L + K+RPG+ F ++ SKL+EMH+YTM + YA + K++DP F R
Sbjct: 206 DNLSNVAANWYYVKMRPGLKEFFDKMSKLYEMHVYTMATRAYAQAIMKIIDPDRKYFGDR 265
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++SR ++ K K+L + +A VVIIDD VW ++ +L+ V + +F
Sbjct: 266 ILSRDEN--------YTDKLKNLTRLFYQNTAMVVIIDDRADVWQYSP-HLVRVPVFNFF 316
Score = 44.7 bits (104), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 56/114 (49%), Gaps = 7/114 (6%)
Query: 1125 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 1181
++L D+ DV I+ + ++L G +VFS + P+G + + LW FGA T
Sbjct: 507 RALSDIIPDVAEIMPRIKEEVLDGVVVVFSGIIPLGVDVQTSDYALW--IRSFGAEVTTS 564
Query: 1182 IDDQVTHVVAN-SLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
++ + THV+AN T KV A +V+P W+ + +E ++I+
Sbjct: 565 VNRRTTHVIANPDRKTTKVKQAARYQHIKIVNPEWMFQCCTRWEHVDETPYSIE 618
>gi|297830092|ref|XP_002882928.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
lyrata]
gi|297328768|gb|EFH59187.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
lyrata]
Length = 295
Score = 84.0 bits (206), Expect = 5e-13, Method: Composition-based stats.
Identities = 73/254 (28%), Positives = 114/254 (44%), Gaps = 48/254 (18%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----LFRFPH 968
K +KL LVL+L T +S F L KE+ + K + R
Sbjct: 53 KNSLEKKKLHLVLNLYGTFFDSQAF--------PCLSNKEKYLKGKVNSRNDLWQTRIRG 104
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ KLRP + FL A+KLF +H+ T+ YA + K+LDP + F R+IS
Sbjct: 105 HDVLIKLRPFVHEFLREANKLFILHVTTLCIPEYADFVLKLLDPHQLYFGNRIISL---- 160
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW-PHNKLNLIVVERYTYFPCSRRQFG 1087
+ V K L+ VL E V+I+DD VW P N+ NL+ + Y+YF ++++
Sbjct: 161 -----SKHVIWEKTLDQVLVGEREVIILDDRYDVWSPENRSNLLQITTYSYFKATKKRNS 215
Query: 1088 LLG-------------------------PSLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
+ G S E DE +DG LA++L + ++H+ FF
Sbjct: 216 IDGGMFQNLFKYFLKIFSRDDDNLLSDSNSYSEERKDESVDDGALANALRFLFKIHQDFF 275
Query: 1123 SHQ-SLDDVDVRNI 1135
+H S +D+ R++
Sbjct: 276 NHHYSENDIYKRDV 289
>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus gattii
WM276]
gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
[Cryptococcus gattii WM276]
Length = 952
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 68/113 (60%), Gaps = 12/113 (10%)
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
G W TK RPG+ FL+ S+L+EMH+YTMG + YA + KV+DP G +F GR++SR +
Sbjct: 304 GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 363
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
G F SK+L+ + + S VV+IDD VW + NL+ V Y +F
Sbjct: 364 GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 407
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 11/124 (8%)
Query: 1116 RLHKIFFSHQSLD-------DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1168
R +K F + + D DV I+ + ++L GC +VFS + P EA+P +W
Sbjct: 610 RFYKAFDALEGWDPKKALPMSCDVEFIIPEMKAEVLDGCSLVFSGMIPR-EADPSTTTIW 668
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRAN 1226
QTAE FGA+ T + + TH+V L T+K W G VV W S L+ R +
Sbjct: 669 QTAESFGALITPSLTSRTTHLVTALLNTEKT-WRAGKMEGVKVVWAQWFWDSVALWERQD 727
Query: 1227 EQDF 1230
E+ +
Sbjct: 728 EEKY 731
>gi|50552035|ref|XP_503492.1| YALI0E03278p [Yarrowia lipolytica]
gi|49649361|emb|CAG79071.1| YALI0E03278p [Yarrowia lipolytica CLIB122]
Length = 750
Score = 83.6 bits (205), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 96/193 (49%), Gaps = 34/193 (17%)
Query: 907 RRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR--KKEEQDREKPHRH 962
+RLEE K++ RKL LV+DLD T+++ VDP E + D K R
Sbjct: 155 QRLEEGSTKQLLKQRKLILVVDLDQTVIHVT----VDPTVGEWKKDPSNPNYDAVKDVR- 209
Query: 963 LFRFPHMGM---------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
+F M M + KLRP + FLE S+ +E+H+YTM + YA +A
Sbjct: 210 VFSLEEMTMVSYDGGKPVPQLCYYYVKLRPHLKEFLEVVSEKYELHIYTMATRAYAKAIA 269
Query: 1008 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVWPHN 1066
+++DP G F R++SR + G K L+ + +++++V IIDD VW +
Sbjct: 270 EIIDPDGRYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVAIIDDRGDVWKWS 321
Query: 1067 KLNLIVVERYTYF 1079
K NLI V Y +F
Sbjct: 322 K-NLIRVVPYDFF 333
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV++IL++ + GC +FS + P+G H +W ++FGA + + QVTHV
Sbjct: 539 DVQDILSSLTYPVFGGCVFLFSGIIPIGVNIQHADISMW--VKKFGATVVEDVSKQVTHV 596
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRR 1224
+A S T KV A R V+ W+ ++R
Sbjct: 597 IAASGNTRKVRQASRYKRIKFVYISWIFDCISQWKR 632
>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
Length = 397
Score = 83.6 bits (205), Expect = 7e-13, Method: Composition-based stats.
Identities = 87/349 (24%), Positives = 148/349 (42%), Gaps = 50/349 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+ L++++DH L +S K E + E + K + + K RP
Sbjct: 57 KMNLIINIDHILFHSTKNPESNETQGESVIKCVVDESNT------------YYVKFRPYA 104
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
TFL+ LF + L+++ +K Y ++ ++LD +F ++ISR G+ ++V K
Sbjct: 105 ATFLQSLQPLFNLILFSLYSKSYVFKLIELLDLNNNIFK-QIISRESFGESLP-KQQVGK 162
Query: 1040 SKDLEGV---------LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
L + ++ I+DD +W + NLI ER+TYF + GL
Sbjct: 163 PYALWNTPSHFTKIFKISAHESLAILDDREDIWRQFRDNLISPERFTYFTKEDDENGLSS 222
Query: 1091 PSLLEIDHDERSEDGTLA---SSLGVIERLHKIFFS----------------HQSLDDVD 1131
E S+ T +S+ +E ++ S QS DD +
Sbjct: 223 TINYPFSIQELSKQFTFPRFNASIPYLEIFSRLLISIQNEYMSRLQETSLQQQQSADDEN 282
Query: 1132 -----VRNILAAEQRKILAGCRIVFSRVFP--VGEANPHLHPLWQTAEQFGAVCTKHIDD 1184
+ ++ ++ +L C IVFS +FP + H + Q AE FGA + I
Sbjct: 283 NNFNQSKAVVKELRQCVLMDCNIVFSGIFPKQIDATKLHQTRIVQMAESFGAQVHQDITP 342
Query: 1185 QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
TH++ GT KV A+ G+ VVH W+ S + + NE +++I
Sbjct: 343 TTTHLIFIKEGTSKVIQAVKQGQVKVVHFSWLRDSLYNWEKMNESNYSI 391
>gi|300176006|emb|CBK22223.2| unnamed protein product [Blastocystis hominis]
Length = 680
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 142/314 (45%), Gaps = 48/314 (15%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L QK++ +AR+L LV DLD+TL+ + DP R P+ H +F
Sbjct: 9 LVNQKRLLAARRLGLVFDLDNTLMEQSD----DP------RCSVAPSFGIPNIHFIQFKR 58
Query: 969 MGMWTK----LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
+K LRP + + L SK +E+ +YT G + YA + + +DPK LF RVI+R
Sbjct: 59 NNQLSKHTIILRPEVQSILTELSKYYELSIYTNGVRTYAQAIIESIDPKHQLFGSRVIAR 118
Query: 1025 GDDGDPFDG---DERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
D D + + +P SKD+ VL G+E V++DDSV VW + ++ + ++ ++
Sbjct: 119 DDVPDNSETNFFNNFLPASKDISFVLPGLERLGVVVDDSVEVWKDRAI-VLHIPKFCFWR 177
Query: 1081 CSRRQFGLLGP---SLLE-----IDHDERSEDGTLASSLGVIE----RLHKIFFSHQSLD 1128
+ + G ++ E I++D S+ +++ ++H+ F++
Sbjct: 178 SFLKCYETGGKKVETMFEAVGWIINND----------SMNIVKDTLVQIHQQFYARAQQR 227
Query: 1129 D-----VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 1183
D V ++ + + I F F + +P L+ +Q G +
Sbjct: 228 DTGVPSTSVGEVIGQLRASLFRNTLIYFDETF--SKRDPKSQYLFSLVKQMGGKIAEAYT 285
Query: 1184 DQVTHVVANSLGTD 1197
VTHVV+ + T+
Sbjct: 286 PDVTHVVSAGVETE 299
>gi|302497759|ref|XP_003010879.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
gi|291174424|gb|EFE30239.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
Length = 1048
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 347 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 397
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 398 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 456
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 457 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 498
>gi|150866706|ref|XP_001386384.2| hypothetical protein PICST_63097 [Scheffersomyces stipitis CBS 6054]
gi|149387962|gb|ABN68355.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 790
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ A I++ T RL E+KK L LV+DLD T++++ VDP
Sbjct: 148 EAAKIEQSTTDRLNEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
+ D EE+ P R W K+RPG+ FLE L+EMH+YTM
Sbjct: 198 PAIKDVKTFCLEEEAIVPPGWTGPRLAPTKCWYYVKVRPGLSDFLEEIVNLYEMHIYTMA 257
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
+ YA +AK++DP G F R++SR + G K+L+ + + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPTGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D +W NLI V Y +F
Sbjct: 310 DRGDIWQWES-NLIKVVPYDFF 330
>gi|260949511|ref|XP_002619052.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
gi|238846624|gb|EEQ36088.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
Length = 776
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 94/202 (46%), Gaps = 38/202 (18%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ I++ T RL KK L LV+DLD T++++ VDP
Sbjct: 145 EATKIEQSSTERLAADKK------LILVVDLDQTVIHAT----VDPTVGEWQRDPQNPNY 194
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
V D L EE+ P R P W KLRPG+ FL SKL+E+H+YTM
Sbjct: 195 PFVKDVQLFSLEEEPIVPPGWVGPRPPPTKCWYYVKLRPGLKEFLAEVSKLYELHIYTMA 254
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
+ YA +A ++DP G F R++SR + G K+L + + +S VVIID
Sbjct: 255 TRNYALAIASIIDPDGKYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIID 306
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW + NLI V Y +F
Sbjct: 307 DRGDVW-QWEANLIKVVPYDFF 327
>gi|443696103|gb|ELT96883.1| hypothetical protein CAPTEDRAFT_23527, partial [Capitella teleta]
Length = 562
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 96/174 (55%), Gaps = 27/174 (15%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+++++ +KL L++DLD TL+++ +D++ ++ H F+ H
Sbjct: 132 DEQRLIRDKKLVLLVDLDQTLIHTT--------NDKVPANLKDV-------HHFQLHHGR 176
Query: 971 --MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+W TK RPG FLER SKL+E+H+ T G ++YA +AK+LDP G F+ R++SR +
Sbjct: 177 NLLWYHTKFRPGTEKFLERISKLYELHICTFGVRMYAHTIAKLLDPDGKYFSHRILSRDE 236
Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+P K+ +L+ + +S V IIDD VW + +L+ V+ Y +F
Sbjct: 237 CFNP------TSKTGNLKALFPCGDSMVCIIDDREDVWRFSP-SLVHVKPYLFF 283
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 66/141 (46%), Gaps = 15/141 (10%)
Query: 1104 DGTLASSLGVIERLHKIFFS--HQSLDDV-------DVRNILAAEQRKILAGCRIVFSRV 1154
D L ++ R+H +F +Q ++ D++ +L ++++L G I+FS +
Sbjct: 423 DNYLVYLEDILSRIHTAYFQFYNQMIEKKAEDKQLPDIKTVLPYVKKRVLRGVNILFSGM 482
Query: 1155 FPVGEANPHLHPLWQTAEQFGAVCTKHI----DDQVTHVVANSLGTDKVNWALST-GRFV 1209
P+ + N + A+ GA + +D+ THVVA GT K+N A G +
Sbjct: 483 IPINK-NYEKSRAYIVAKSLGANIQTSLETEGEDRTTHVVAARDGTQKINDARKMKGVHI 541
Query: 1210 VHPGWVEASALLYRRANEQDF 1230
V+ W+ A + R +EQ +
Sbjct: 542 VNADWLWTCAYRWERVDEQLY 562
>gi|326477486|gb|EGE01496.1| RNA Polymerase II CTD phosphatase Fcp1 [Trichophyton equinum CBS
127.97]
Length = 866
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+N++ +R L +VFS V P+G + LW A+ FGA T I+ + TH+
Sbjct: 528 DVKNLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 585
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ + ++ +E+ + +
Sbjct: 586 VAGRNRTAKVREATRYTKIKIVTVQWLLDCMIQWKHLDEEPYLV 629
>gi|405966173|gb|EKC31485.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Crassostrea gigas]
Length = 837
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 92/173 (53%), Gaps = 25/173 (14%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
++ ++ RKL L++DLD TL+++ + + P ++ + F+ H
Sbjct: 138 DEDRLLRTRKLVLLVDLDQTLIHTTNDN-IPPNLKDV--------------YHFQLSHGN 182
Query: 971 M--W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
M W T++RP FLE SKL+E+H+ T G+++YA +AK LDP G F+ R++SR
Sbjct: 183 MMPWYHTRIRPRTEKFLENVSKLYELHICTFGSRMYAHIIAKFLDPDGKYFSHRILSR-- 240
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ + ++ K L +S V IIDD VW + NLI V+ Y +F
Sbjct: 241 -DECFNQNSKMANLKAL--FPCGDSMVCIIDDREDVWNFSP-NLIHVKPYRFF 289
>gi|302657133|ref|XP_003020296.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
gi|291184115|gb|EFE39678.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
Length = 865
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314
>gi|326475449|gb|EGD99458.1| RNA Polymerase II CTD phosphatase [Trichophyton tonsurans CBS 112818]
Length = 866
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+N++ +R L +VFS V P+G + LW A+ FGA T I+ + TH+
Sbjct: 528 DVKNLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 585
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ + ++ +E+ + +
Sbjct: 586 VAGRNRTAKVREATRYTKIKIVTVQWLLDCMIQWKHLDEEPYLV 629
>gi|327296037|ref|XP_003232713.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
gi|326465024|gb|EGD90477.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
Length = 836
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 285
>gi|344233336|gb|EGV65209.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
gi|344233337|gb|EGV65210.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
Length = 788
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 38/203 (18%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP------------- 942
++ A I++ T RL +Q RKL LV+DLD T++++ VDP
Sbjct: 147 EEAAKIEQNSTTRLTQQ------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPSNPN 196
Query: 943 ---VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 997
V D EE+ P+ + W KLRPG+ FL ++++EMH+YTM
Sbjct: 197 YRAVKDVQSFCLEEEPITPPNWSGPKLSPTKCWYYVKLRPGLEEFLREMAEIYEMHIYTM 256
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
+ YA +AK++DP+G F R++SR + G K+L+ + + +S V II
Sbjct: 257 ATRNYALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308
Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
DD VW NLI V Y +F
Sbjct: 309 DDRGDVWQWED-NLIKVVPYDFF 330
>gi|348665920|gb|EGZ05748.1| hypothetical protein PHYSODRAFT_566275 [Phytophthora sojae]
Length = 684
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 42/170 (24%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
++ A+KL LVLDLDHTLL++ + VD V EI P GM
Sbjct: 266 RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVGEI-------------------PKSGML 303
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
S L+++ +YT G +LYA ++ K++DP F R+++R D D
Sbjct: 304 --------------SALYDLFIYTHGTRLYAEQIVKIIDPDESYFKNRIVARTDTPDMLH 349
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
KS L +S ++++DD + VW N+ N+ ++E Y YF C+
Sbjct: 350 ------KSLKLLFPSCDDSMILVLDDRIDVWKENEGNVFLIEPYHYFKCT 393
>gi|194757423|ref|XP_001960964.1| GF11242 [Drosophila ananassae]
gi|190622262|gb|EDV37786.1| GF11242 [Drosophila ananassae]
Length = 854
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 92/173 (53%), Gaps = 24/173 (13%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++ +++ + RKL L++DLD T++++ D V + I K H +
Sbjct: 191 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 236
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
W T+LRPG FLE S+L+E+H+ T G + YA +A++LDP G F+ R++SR
Sbjct: 237 SPWYHTRLRPGTAEFLESMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 293
Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 294 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342
>gi|190346120|gb|EDK38128.2| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC 6260]
Length = 732
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 42/191 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
+++ S RKL LV+DLD T++++ VDP E + D P+ +
Sbjct: 110 ERLTSERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYRAVKDVRSFCL 160
Query: 966 -----------FPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
P M + K+RPG+ FL+R S+L+EMH+YTM + YA +A +
Sbjct: 161 EEDPIAPPGWSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHI 220
Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 1068
+DP G F R++SR + G K+L + + +S VVIIDD VW K
Sbjct: 221 IDPDGRYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIIDDRGDVWQWEK- 271
Query: 1069 NLIVVERYTYF 1079
NLI V Y +F
Sbjct: 272 NLIKVVPYEFF 282
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 15/145 (10%)
Query: 1103 EDGTLASSLGVIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 1155
+D L S + R+H ++ +SL + D+ I+ + + K L G ++FS +
Sbjct: 462 DDNELESLNDALVRVHDEYYRIWDSYNEDKSLTNPDLTTIIPSLKSKCLEGIVVLFSGIL 521
Query: 1156 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN--SLG---TDKVNWALST--GRF 1208
G ++P + +QFG + +VTHV+ S G T KV A T
Sbjct: 522 RWG-SDPQKADIVIWCQQFGVKVVNEVYPEVTHVICREPSAGGGLTFKVRVAKKTLPNVH 580
Query: 1209 VVHPGWVEASALLYRRANEQDFAIK 1233
+V P W+ A + R +E+ +AI+
Sbjct: 581 IVTPDWLFACMSKWTRVDEKPYAIE 605
>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
[Danaus plexippus]
Length = 683
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 92/175 (52%), Gaps = 26/175 (14%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFP- 967
E+ ++ RKL L++DLD TL+++ + P ++L H F R P
Sbjct: 134 EDADRLLKDRKLVLLVDLDQTLVHTTN-DNIPPNIKDVL-------------HFFLRGPG 179
Query: 968 HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+ G W T+LRP FLE A+K +E+H+ T G + YA + ++LDP+ F+ R++SR
Sbjct: 180 NQGRWCHTRLRPKTHEFLESAAKNYELHVCTFGARQYAHAITELLDPQKKFFSHRILSR- 238
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ FD KS +L+ + ++ V IIDD VW H NLI V Y++F
Sbjct: 239 --DECFDAR---TKSANLKALFPCGDNMVCIIDDREDVWRHAS-NLIQVRPYSFF 287
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 14/134 (10%)
Query: 1113 VIERLHKIFFS-HQSLDDV-------DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 1164
+++R+H F+ + +++ D++ I+ + ++LAG +VFS + P +
Sbjct: 395 ILKRIHNHFYDIYDKMENSENEKSIPDLKYIIPEVKSQVLAGSSLVFSGLVPTHQ-RLET 453
Query: 1165 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR-----FVVHPGWVEASA 1219
+Q A+ GA T+ D+ TH+VA GT KVN + G VV P W+ A
Sbjct: 454 SRAYQVAKTLGAEVTQDFTDKTTHLVAMRAGTAKVNASKKLGEDKSKIHVVTPEWLWTCA 513
Query: 1220 LLYRRANEQDFAIK 1233
+ R E+ + ++
Sbjct: 514 ERWERVEEKLYPLQ 527
>gi|315051428|ref|XP_003175088.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
CBS 118893]
gi|311340403|gb|EFQ99605.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
CBS 118893]
Length = 867
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVGEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 272
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 314
>gi|297819964|ref|XP_002877865.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
lyrata]
gi|297323703|gb|EFH54124.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 100/207 (48%), Gaps = 41/207 (19%)
Query: 919 RKLCLVLDLDHTLLN---SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT-- 973
RKL LVL L+HTL++ +K E+D H +L + + R+ LFR + ++
Sbjct: 96 RKLHLVLSLEHTLIDLISVSKLSEIDRYH--LLEEADSGSRDD----LFRLANESFYSSD 149
Query: 974 ---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
K RP + FL A K+F MH+YT A ++ K+LDP + F R+I+ D
Sbjct: 150 ALVKFRPFVREFLREAEKIFTMHVYTNYGPGLAKKVVKLLDPHMIYFGNRIITSKDS--- 206
Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
+GD K LE VL V+I+D R+W N+I + +Y YF
Sbjct: 207 -NGD-----LKSLELVLAEPRGVLIVDYDHRLWKSPGHNVIFMSKYVYFK---------- 250
Query: 1091 PSLLEIDHDERSEDGTLASSLGVIERL 1117
EI +EDG LA +L +++++
Sbjct: 251 ----EIS----NEDGVLAKTLNLLKKI 269
>gi|384488044|gb|EIE80224.1| hypothetical protein RO3G_04929 [Rhizopus delemar RA 99-880]
Length = 433
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 77/133 (57%), Gaps = 17/133 (12%)
Query: 900 AIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
+ + RLE++ K++ +RKL L+LDLD T+++++ DP I K E+ R+
Sbjct: 9 TVSRSEAERLEKENAKRLLESRKLSLILDLDQTIVHAS----CDP---RISHWKNEEIRQ 61
Query: 958 KPHRHLFRFPH--MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
F P + KLRPG+ FL+ L+++H+YTMG K YA +A+ +DP+G
Sbjct: 62 ------FTLPKSPTMYYIKLRPGLREFLKEIENLYDLHIYTMGTKDYAKAVAREMDPEGS 115
Query: 1016 LFAGRVISRGDDG 1028
LF R++SR ++G
Sbjct: 116 LFKERILSRDENG 128
>gi|444319376|ref|XP_004180345.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
gi|387513387|emb|CCH60826.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
Length = 768
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 42/185 (22%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------F 966
+KL LV+DLD T+++ VDP E + D + P+ +
Sbjct: 181 KKLILVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNYETLKDVRSFSLDEEPIL 231
Query: 967 P--HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
P +MG + K+RPG+ F + + L+EMH+YTM + YA E+AK++DP G
Sbjct: 232 PPSYMGPRPPVRKCWYYVKVRPGLKEFFAKIAPLYEMHIYTMATRAYALEIAKIIDPDGS 291
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 1074
LF R++SR ++G K LE + +S V+IIDD VW NLI V
Sbjct: 292 LFGSRILSRDENGS--------LTQKSLERLFPTDQSMVIIIDDRGDVWNWCN-NLIKVI 342
Query: 1075 RYTYF 1079
Y +F
Sbjct: 343 PYNFF 347
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 8/136 (5%)
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLD---DVDVRNILAAEQRKILAGCRIVFSRVFPVG- 1158
+D L ++ +H F+ D V ++ ++ + + +G VFS + P+G
Sbjct: 485 DDDELFYLKDILTEVHDTFYEQLDEDKNESVSIQTLMPRLKFSVFSGYNFVFSGLIPLGT 544
Query: 1159 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRFVVHPGWVE 1216
+ LW FGA T ID+ THV+ + GT K A + VVHP W+
Sbjct: 545 DIRRADIVLW--TNMFGANSTADIDENTTHVITKTAGTYKARLAKAFNPEIKVVHPDWIF 602
Query: 1217 ASALLYRRANEQDFAI 1232
+ ++R E+ + +
Sbjct: 603 ECLVSWKRMPEKPYEL 618
>gi|297792863|ref|XP_002864316.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
lyrata]
gi|297310151|gb|EFH40575.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
lyrata]
Length = 142
Score = 82.4 bits (202), Expect = 1e-12, Method: Composition-based stats.
Identities = 52/150 (34%), Positives = 78/150 (52%), Gaps = 15/150 (10%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
MG++ YA + K++DP+ V F RVI+R + P K L+ VL E VVI+
Sbjct: 1 MGDRDYAKNVLKLIDPEKVYFGDRVITRNES----------PYIKTLDLVLADECGVVIV 50
Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 1116
DD+ +VWP +K NL+ + +Y YF R+ S E DE DG+L + L VI+
Sbjct: 51 DDTAQVWPDHKRNLLEITKYNYFSDKTRRDVKYSKSYAEEKRDEGRNDGSLGNVLKVIKE 110
Query: 1117 LHKIFFS-----HQSLDDVDVRNILAAEQR 1141
+++ FFS +D DVR + +R
Sbjct: 111 VYERFFSGGVEKELDIDSKDVRLLFHDARR 140
>gi|325087549|gb|EGC40859.1| RNA polymerase II C-terminal domain phosphatase component
[Ajellomyces capsulatus H88]
Length = 885
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 285
Score = 45.1 bits (105), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA+ ++ I + TH+
Sbjct: 482 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISQKISSKTTHL 539
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 583
>gi|327358124|gb|EGE86981.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis ATCC
18188]
Length = 839
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 62 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 112
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 113 YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 171
Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 172 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 213
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA + I + TH+
Sbjct: 410 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 467
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 468 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRIDEEPYLV 511
>gi|225556539|gb|EEH04827.1| RNA polymerase II C-terminal domain phosphatase component
[Ajellomyces capsulatus G186AR]
Length = 871
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 285
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA+ + I + TH+
Sbjct: 482 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISHKISSKTTHL 539
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 583
>gi|294658166|ref|XP_460501.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
gi|202952923|emb|CAG88814.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
Length = 795
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 92/198 (46%), Gaps = 38/198 (19%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK--------- 951
I+ T RL +KK L LV+DLD T++++ VDP E
Sbjct: 152 IEHNTTDRLSREKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNYPAVK 201
Query: 952 -------EEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
EE P + P W KLRPG+ FL AS L+EMH+YTM + Y
Sbjct: 202 NVRSFCLEEDPIAPPGWTGPKLPPSKCWYYVKLRPGLEEFLRSASDLYEMHIYTMATRNY 261
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVR 1061
A +AK++DP+G F R++SR + G K+L+ + + +S VVIIDD
Sbjct: 262 ALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIIDDRGD 313
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VW NLI V Y +F
Sbjct: 314 VWQWEN-NLIKVVPYDFF 330
>gi|254568460|ref|XP_002491340.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031137|emb|CAY69060.1| hypothetical protein PAS_chr2-1_0845 [Komagataella pastoris GS115]
gi|328352145|emb|CCA38544.1| hypothetical protein PP7435_Chr2-0862 [Komagataella pastoris CBS
7435]
Length = 733
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 98/210 (46%), Gaps = 46/210 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLL---------------NSAKFHEVDPVHDEILRKK----EE 953
K++ +KL LV+DLD T++ N+A + V V L+++ E
Sbjct: 165 KRLLKEKKLSLVVDLDQTVIHATVDPTVGEWMKDPNNANYPAVKDVRSFSLKEEVILPEN 224
Query: 954 QDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
+KP P W KLRP + FLE S+ +E+H+YTM + YA E+AK++D
Sbjct: 225 YVGQKP-------PATVCWYYVKLRPHLREFLEHVSERYELHIYTMATRQYAKEIAKIID 277
Query: 1012 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNL 1070
P F R++SR + G K L+ + ++ S VV+IDD VW + NL
Sbjct: 278 PDEKYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVVVIDDRGDVWNWSS-NL 328
Query: 1071 IVVERYTYF--------PCSRRQFGLLGPS 1092
I V Y +F RQ LLGPS
Sbjct: 329 IKVVPYDFFVGIGDINSSFLPRQHALLGPS 358
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 77/175 (44%), Gaps = 12/175 (6%)
Query: 1069 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERS----EDGTLASSLGVIERLHKIFFSH 1124
NL + ++ + P ++ Q L E DH+ + +D L + + R+H F+
Sbjct: 447 NLEIEQQQSERPLAKLQENLEKIVQTESDHESANLLFDDDNELETLEQALIRIHNEFYFE 506
Query: 1125 QSLDDV---DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTK 1180
V DV++IL + ++ + + S + P+G + N +W A FGA
Sbjct: 507 YDDSKVGNPDVKDILNSMKQLVFKEYTFLLSGILPLGTKLNSADIVIW--ARSFGATVVA 564
Query: 1181 HIDDQVTHVVANSLGTDKVNWA--LSTGRFVVHPGWVEASALLYRRANEQDFAIK 1233
D VTHV+ + GT KV A L +V P W+ + + +E ++ ++
Sbjct: 565 DYDKSVTHVITRNTGTFKVKLAKTLDPNVKIVDPNWLFKCISFWDKVDEDEYLLE 619
>gi|154284394|ref|XP_001542992.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150406633|gb|EDN02174.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 654
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 89 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 139
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 140 YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 198
Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 199 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 240
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA+ ++ I + TH+
Sbjct: 437 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISQKISSKTTHL 494
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 495 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 538
>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 955
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 67/113 (59%), Gaps = 12/113 (10%)
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
G W TK RPG+ FL+ +L+EMH+YTMG + YA + KV+DP G +F GR++SR +
Sbjct: 303 GRWYFTKPRPGLQRFLDEMCQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 362
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
G F SK+L+ + + S VV+IDD VW + NL+ V Y +F
Sbjct: 363 GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 406
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 11/124 (8%)
Query: 1116 RLHKIFFSHQSLD-------DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1168
R +K F + D DV I+ + ++L GC +VFS + P EANP +W
Sbjct: 616 RFYKAFDAIDGWDPKKALPMSCDVEFIIPEMKAEVLDGCNLVFSGMIPR-EANPSTTAIW 674
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRAN 1226
QTAE FGA+ T + + TH+V L T+K W G VV W S L+ R +
Sbjct: 675 QTAESFGALITPSLTPRTTHLVTALLNTEKT-WRAGKMEGVKVVWAEWFWDSVALWERQD 733
Query: 1227 EQDF 1230
E+ +
Sbjct: 734 EEKY 737
>gi|296810642|ref|XP_002845659.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Arthroderma otae CBS 113480]
gi|238843047|gb|EEQ32709.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Arthroderma otae CBS 113480]
Length = 832
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 90/170 (52%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL+ S L+E+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLKVVSSLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L + +++ VVIIDD VW ++ NLI V Y +F
Sbjct: 244 -------LTAKNLHRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 285
Score = 42.4 bits (98), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ ++ +R L +VFS V P+G + LW A+ FGA T I+ + TH+
Sbjct: 491 DVKVLMPEIKRAALESVVLVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 548
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ ++R +E+ + +
Sbjct: 549 VAGRNRTAKVREATRYPKVKIVTVQWLVDCMTQWKRLDEEPYLV 592
>gi|239606973|gb|EEQ83960.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
ER-3]
Length = 901
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 285
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA + I + TH+
Sbjct: 482 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 539
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRIDEEPYLV 583
>gi|146421209|ref|XP_001486555.1| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC 6260]
Length = 732
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 95/184 (51%), Gaps = 28/184 (15%)
Query: 913 KKMFSARKLCLVLDLDHTLLN-----SAKFHEVDPVHDEILRKKE------EQDREKPHR 961
+++ S RKL LV+DLD T+++ + ++DP++ K+ E+D P
Sbjct: 110 ERLTSERKLILVVDLDQTVIHATVDPTVGEWQLDPLNPNYRAVKDVRSFCLEEDPIAPPG 169
Query: 962 HLFRFPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ P M + K+RPG+ FL+R S+L+EMH+YTM + YA +A ++DP G
Sbjct: 170 --WSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHIIDPDGRY 227
Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVER 1075
F R++SR + G K+L + ++ VVIIDD VW K NLI V
Sbjct: 228 FGDRILSRDESGS--------LTHKNLRRLFPVDQLMVVIIDDRGDVWQWEK-NLIKVVP 278
Query: 1076 YTYF 1079
Y +F
Sbjct: 279 YEFF 282
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 15/145 (10%)
Query: 1103 EDGTLASSLGVIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 1155
+D L S + R+H ++ +SL + D+ I+ + + K L G ++FS +
Sbjct: 462 DDNELESLNDALVRVHDEYYRIWDSYNEDKSLTNPDLTTIIPSLKSKCLEGIVVLFSGIL 521
Query: 1156 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN--SLG---TDKVNWALST--GRF 1208
G ++P + +QFG + +VTHV+ S G T KV A T
Sbjct: 522 RWG-SDPQKADIVIWCQQFGVKVVNEVYPEVTHVICREPSAGGGLTFKVRVAKKTLPNVH 580
Query: 1209 VVHPGWVEASALLYRRANEQDFAIK 1233
+V P W+ A + R +E+ +AI+
Sbjct: 581 IVTPDWLFACMSKWTRVDEKPYAIE 605
>gi|261194090|ref|XP_002623450.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
SLH14081]
gi|239588464|gb|EEQ71107.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
SLH14081]
Length = 901
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ + P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 184
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
KLRPG+ FL S LFE+H+YTMG + YA +A ++DP +F R++SR + G
Sbjct: 185 YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 243
Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K+L+ + +++ VVIIDD VW NLI V Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 285
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
D++ I+ +R +L +VFS V P+G + LW A+ FGA + I + TH+
Sbjct: 482 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 539
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A +V W+ S ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRVDEEPYLV 583
>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM 1558]
Length = 944
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 65/109 (59%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+TK RPG+ FLE +KL+EMH+YTMG + YA + ++DP+G F GR++SR D
Sbjct: 354 FTKPRPGLAKFLEEMNKLYEMHVYTMGTRTYAEAIVGIVDPEGKYFGGRILSRDDS---- 409
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R +K+L+ + + S VV+IDD VW + NL+ V Y +F
Sbjct: 410 ----RNFTTKNLKRLFPTDTSMVVVIDDRADVW-GDCPNLVKVRPYDFF 453
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 21/155 (13%)
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--HQSLDDVDV------RNILAAE 1139
LL P+ E+D E+ ++ R+H+ F++ + L DVD+ I+
Sbjct: 621 LLDPNDDELDRVEK-----------LLRRIHRKFYNAYDRRLSDVDIPLACDCELIIPEI 669
Query: 1140 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199
+ ++ GC VFS + + P WQ AE FGA C + + TH + + GT+KV
Sbjct: 670 KAQVFDGCYFVFSGII-ARDVEPETTSHWQWAEMFGARCQPTLTRKTTHCITTNAGTEKV 728
Query: 1200 NWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 1233
A G +V W S L+ R E +F+++
Sbjct: 729 YQASKLPGCKIVWVQWFYQSLSLWERQPEDEFSVE 763
>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
[Ectocarpus siliculosus]
Length = 985
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 95/209 (45%), Gaps = 30/209 (14%)
Query: 884 GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE---- 939
GD + + R + + ++ +++KL LVLDLD+TLL+ + +
Sbjct: 223 GDTHQVLMKGGKMMSVTAEGRRMMHMNKSGRLLNSKKLSLVLDLDNTLLHCSDHPDAGRV 282
Query: 940 VDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTM 997
V P D I H R P+ + KLRPG+ FL +A+ +FEM +YT
Sbjct: 283 VVPGVDGI--------------HALRLPNQQREYYIKLRPGLRRFLAQAATMFEMTIYTA 328
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--GMESAVVI 1055
G YA +A VLDP LF GR S D +K LE + G++ A +I
Sbjct: 329 GTSQYADAVASVLDPDRSLFQGRHFSTCYTPDLGR------NTKSLERIFPNGLDMA-LI 381
Query: 1056 IDDSVRVWPHNKL-NLIVVERYTYFPCSR 1083
+DD VW + NL++V Y +F R
Sbjct: 382 VDDRDDVWRGEQAKNLLLVRPYKFFVGQR 410
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 64/135 (47%), Gaps = 6/135 (4%)
Query: 1104 DGTLASSLGVIERLHKIFFSHQSLDDVDVR---NILAAEQRKILAGCRIVFSRVFPV--G 1158
D L + +E +H F++ ++ + R LA + ++L G R+VFS V PV
Sbjct: 682 DPQLDCTFKTLEAVHGAFYAPENSNHGQPRAAAGFLAKVRLRVLTGVRMVFSGVIPVSGA 741
Query: 1159 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEA 1217
A+P H LW AE GA + I THVVA LGT K L G FVVH W+
Sbjct: 742 PADPRTHRLWMMAESHGATVERDIGRHTTHVVAVRLGTAKTKTGLRMPGVFVVHLDWLMN 801
Query: 1218 SALLYRRANEQDFAI 1232
S RR E F +
Sbjct: 802 SVWHCRRERETMFLL 816
>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 1 [Megachile rotundata]
Length = 760
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ + P ++ H L+
Sbjct: 142 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G+LF+ R++SR + D
Sbjct: 190 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 250 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%)
Query: 1095 EIDHDERSEDGTLASSLGVI---------------ERLHKIFFSHQSLDD--VDVRNILA 1137
EI D +SED TL+ +I R+H F++ + + +R+I+
Sbjct: 410 EITQDTKSEDKTLSGENNIIDEDDDDYLLYLEDILRRIHAEFYATKEKESGRSSLRDIIP 469
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
+ ++L G I FS + P + H ++ A FGA ++ + D+ TH+VA GT
Sbjct: 470 RVRAQVLKGVHITFSGLIPTHQK-IHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTA 528
Query: 1198 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
K N A +V+P W+ A + +E+ F +
Sbjct: 529 KANAAKKNLNIKIVNPDWLWTCAERWEHVDERLFPL 564
>gi|365758888|gb|EHN00710.1| Fcp1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 677
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 88/191 (46%), Gaps = 48/191 (25%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------F 966
+KL LV+DLD T+++ VDP E + D P+ R
Sbjct: 123 KKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVL 173
Query: 967 PHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
P M M + K+RPG+ F ++ + LFEMH+YTM + YA ++AK+
Sbjct: 174 PLMYMNEDGSVLKPPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYAIQIAKI 233
Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 1068
+DP G LF R++SR ++G +K L + +S VV+IDD VW
Sbjct: 234 VDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP- 284
Query: 1069 NLIVVERYTYF 1079
NLI V Y +F
Sbjct: 285 NLIKVVPYNFF 295
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/144 (21%), Positives = 68/144 (47%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +H ++ S + + ++ I+ + ++K+ C VF
Sbjct: 402 HDQKLLVDDDDELYYLMGTLSNIHNTYYRMISQSNEPEPNLMEIIPSLKQKVFQNCHFVF 461
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 462 SGLIPLGTDIQRSDIVIWTST--FGATSTSDIDYLTTHLITKNPSTYKARLAKKFNPEIK 519
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
++HP W+ + +++ +E+ + +
Sbjct: 520 IIHPDWIFECLVNWKKVDEKPYTL 543
>gi|399215912|emb|CCF72600.1| unnamed protein product [Babesia microti strain RI]
Length = 545
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 22/171 (12%)
Query: 895 DQQKAAIQKERTRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
++ +I R++EE + R LCLVLDLD+TL +H + L K E
Sbjct: 145 NEASMSISATFVRQMEESNLHSLLIKRLLCLVLDLDNTL-----------IHAKTLDKNE 193
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
D + ++ ++ +LRPG+ FL+ SK ++++L+TMG +AT +LDP
Sbjct: 194 VLDSNDDFKAIYFGGRCNLY-RLRPGVSEFLDAMSKYYQLYLFTMGTSEHATAALSLLDP 252
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
+G LF+ R+ SR D + R S+ G+ V ++DD W
Sbjct: 253 QGKLFSNRIFSRSD-----SQNSRKTLSRIFPNYQGI---VCVVDDCEHAW 295
>gi|50294127|ref|XP_449475.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528789|emb|CAG62451.1| unnamed protein product [Candida glabrata]
Length = 758
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 96/197 (48%), Gaps = 36/197 (18%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKE 952
+ K+ T RL+ +KK L LV+DLD T+++ K +P ++ + K
Sbjct: 154 LDKQITTRLKNEKK------LVLVVDLDQTVIHCGVDPTIGEWKADPSNPNYETLKDVKC 207
Query: 953 EQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
E+P + +MG + K+RPG+ F E+ + L+EMH+YTM + YA
Sbjct: 208 FSLEEEP---ILPLIYMGPKPPVRTCWYYVKIRPGLKEFFEKIAPLYEMHIYTMATRAYA 264
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
E+AK++DP LF R++SR ++G K L + +S VV+IDD V
Sbjct: 265 LEIAKIIDPDKSLFGDRILSRDENGS--------LTQKSLTRLFPTDQSMVVVIDDRGDV 316
Query: 1063 WPHNKLNLIVVERYTYF 1079
W NLI V Y +F
Sbjct: 317 WNWCP-NLIKVVPYNFF 332
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF----SHQSLDDVDVRNILAAEQRKILAGCRIV 1150
HD+R +D L + +HK F+ ++ D+ +L ++K+ C V
Sbjct: 439 HDQRLLIDDDDELFYLKDTLAYIHKEFYHLLEDASDQNEADITMLLPKLKKKVFKECHFV 498
Query: 1151 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRF 1208
FS + P+G + T+ FGA T ID+ THVV + GT K A +
Sbjct: 499 FSGLIPLGTDIKRADIVIWTS-LFGAESTSDIDEHTTHVVTKTPGTYKARLAKAFNPDIK 557
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
VVHP W+ ++R +E+ + +
Sbjct: 558 VVHPDWIFECLASWKRIDEKPYLL 581
>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 2 [Megachile rotundata]
Length = 759
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ + P ++ H L+
Sbjct: 142 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G+LF+ R++SR + D
Sbjct: 190 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 250 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%)
Query: 1095 EIDHDERSEDGTLASSLGVI---------------ERLHKIFFSHQSLDD--VDVRNILA 1137
EI D +SED TL+ +I R+H F++ + + +R+I+
Sbjct: 410 EITQDTKSEDKTLSGENNIIDEDDDDYLLYLEDILRRIHAEFYATKEKESGRSSLRDIIP 469
Query: 1138 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 1197
+ ++L G I FS + P + H ++ A FGA ++ + D+ TH+VA GT
Sbjct: 470 RVRAQVLKGVHITFSGLIPTHQK-IHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTA 528
Query: 1198 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
K N A +V+P W+ A + +E+ F +
Sbjct: 529 KANAAKKNLNIKIVNPDWLWTCAERWEHVDERLFPL 564
>gi|350579777|ref|XP_003122350.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like, partial [Sus scrofa]
Length = 284
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 76/131 (58%), Gaps = 13/131 (9%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
Q E+ R E+Q+++ RKL L++DLD TL+++ + H + I H
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
L R M + T+LRP FLE+ ++L+E+H++T G++LYA +A LDP+ LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKIAQLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268
Query: 1022 ISRGDDGDPFD 1032
+SR + DPF
Sbjct: 269 LSRDECIDPFS 279
>gi|378756636|gb|EHY66660.1| hypothetical protein NERG_00300 [Nematocida sp. 1 ERTm2]
Length = 507
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 62/110 (56%), Gaps = 10/110 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ KLR + FL+ A K EMH+YTMGNK YAT + K+LDP G LF R+I+R D+
Sbjct: 203 YYVKLRDRLEWFLKEAEKYCEMHIYTMGNKAYATAIVKILDPTGKLFGSRIITRDDNFGC 262
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
FD KD++ + S V+I+DD VW NL ++ Y +F
Sbjct: 263 FD--------KDIKRLFPTNSKHVIILDDRPDVWGFVD-NLYPIKPYYFF 303
>gi|344301528|gb|EGW31840.1| hypothetical protein SPAPADRAFT_140004 [Spathaspora passalidarum NRRL
Y-27907]
Length = 770
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ I++ T RL E+KK L LV+DLD T++++ VDP
Sbjct: 155 EATKIEQSTTDRLTEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 204
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
V D EE P+ + W K+RPG+ FLE+ S +EMH+YTM
Sbjct: 205 PAVKDVKSFCLEEDPITPPNWTGPKLAPTKCWYYVKVRPGLAEFLEQVSNKYEMHIYTMA 264
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
+ YA +A ++DP+G F R++SR + G K+L+ + + +S VVIID
Sbjct: 265 TRNYALAIANIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 316
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW NLI V Y +F
Sbjct: 317 DRGDVWQWES-NLIKVVPYDFF 337
Score = 40.8 bits (94), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 61/139 (43%), Gaps = 11/139 (7%)
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG-EAN 1161
+D L S V+ +H ++ + D+ I+ + + K L G ++FS + P+G +
Sbjct: 481 DDNELNSLNKVLANIHDAYYKQANSSKPDLTEIIPSLKSKCLEGITLLFSGIIPLGVPLD 540
Query: 1162 PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG-----TDKVNWA---LSTGRFVVHPG 1213
+W A+QFG + +VTHV+ + T K A +V+P
Sbjct: 541 SADIVIW--AKQFGVKVVNEVYPEVTHVICRDITPESGPTFKARAAHHLYPDTIKIVNPD 598
Query: 1214 WVEASALLYRRANEQDFAI 1232
W+ A + R +E ++ +
Sbjct: 599 WLFACLSTWSRVDETEYLL 617
>gi|255732778|ref|XP_002551312.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
gi|240131053|gb|EER30614.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
Length = 818
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
+ A I+ T RL ++KK L LV+DLD T++++ VDP E + D
Sbjct: 148 EAAKIEHSTTDRLIDEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192
Query: 957 EKPHRHLFR------------------FPHMG-----MWTKLRPGIWTFLERASKLFEMH 993
P+ + P + + KLRPG+ FLER S+ +EMH
Sbjct: 193 SNPNYRAVKDVRSFCLEEQPIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLERMSEKYEMH 252
Query: 994 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
+YTM + YA +AK++DP+G F R++SR + G K+L+ + + +S
Sbjct: 253 IYTMATRNYALAIAKIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
V IIDD VW NLI V Y +F
Sbjct: 305 VAIIDDRGDVWQWES-NLIKVVPYDFF 330
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 31/141 (21%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 1101 RSEDGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGE 1159
R +D L + V+ +H+ ++ ++ D+ I+ + K+L G ++FS + P+G
Sbjct: 500 RDDDNELVALDKVLVNIHEEYYKRYDKENKPDLTEIIPTMKSKVLDGITVLFSGIIPLG- 558
Query: 1160 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD-------KVNWALSTGRF-VVH 1211
N + +QFG + +THVV +G D +V L +V+
Sbjct: 559 INLDSADIVIWCKQFGVKVVNEVYPDITHVVCRDIGEDIGPTFKARVARKLYPDTVKIVN 618
Query: 1212 PGWVEASALLYRRANEQDFAI 1232
P W+ A + +E+++ +
Sbjct: 619 PDWLFACLSNWTIVDEKEYLV 639
>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
Length = 766
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 57/174 (32%), Positives = 93/174 (53%), Gaps = 19/174 (10%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+ ++ S++KL L++DLD TL+ H D L E+ + H+F P
Sbjct: 218 DMNRLLSSKKLVLIVDLDLTLI-----HTRMASPDIKLSNLTEEKQIYYTCHMF--PGYN 270
Query: 971 MW----TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
++ TKLRP + FL+ AS LFE+H+ TMG++ YA ++ +LDP G LF R++SR
Sbjct: 271 VYHQYLTKLRPHVEEFLKVASTLFELHVVTMGSRSYAQDIVGILDPTGSLFYNRILSRD- 329
Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ ++ KS +L + + ++ V IIDD +W + + I V Y+YF
Sbjct: 330 -----ELKSQLLKSTNLNQLFPLGDNLVCIIDDRPEMWAFHP-SCIPVPPYSYF 377
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 1116 RLHKIFFSH-----QSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 1169
R+H F+ + LD + D+++I+ +R +L +IVFS + P G +P W
Sbjct: 516 RIHNRFYQQYDQLKEQLDQLPDLKSIIPEIRRNVLKDVKIVFSAIIPSGHPSPEKTYEWI 575
Query: 1170 TAEQFGAVCTKHI----DDQVTHVVANSLG-----TDKVNWALST-GRFVVHPGWVEASA 1219
AE GA T + THVV + T KV+ A+ T G FVV W+
Sbjct: 576 LAESLGAKVTHKFHTSPSRKTTHVVTKRVAFQSGYTQKVHLAMKTAGVFVVDIDWLYKCN 635
Query: 1220 LLYRRANEQDFAIKP 1234
+++ E+ + + P
Sbjct: 636 EFWKKIEEEPYLLAP 650
>gi|388853856|emb|CCF52577.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Ustilago hordei]
Length = 471
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 92/198 (46%), Gaps = 41/198 (20%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPH 968
E + S RKL LV+DLD T++++A VDP E + + + E FR
Sbjct: 20 ETTTHLLSQRKLALVVDLDQTIIHTA----VDPTVGEWMEDESNPNYEALKSVAKFRLGI 75
Query: 969 MG--------------------------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
G + KLRPG+ L++ S+ +++H+YTMG + Y
Sbjct: 76 GGEEIKDDDDPPAPKDSAAALKASRACWYYVKLRPGVPEILKKLSEKYQLHVYTMGTRSY 135
Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVR 1061
A + K++DP +F R++SR ++G K L+ + M+ S VVIIDD
Sbjct: 136 ANLVCKLIDPDASIFGNRIVSRNENGSLV--------RKSLDKLFPMDHSMVVIIDDRED 187
Query: 1062 VWPHNKLNLIVVERYTYF 1079
VW + NL+ V Y +F
Sbjct: 188 VWSKSP-NLLQVVPYEFF 204
Score = 43.1 bits (100), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 45/101 (44%), Gaps = 14/101 (13%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAV-CTKHIDDQVTHVV 1190
V + ++A + K+LAGC IVF P LW A +FGA K THV+
Sbjct: 362 VVDFISARKTKVLAGCTIVF---------YPETSELWALAREFGATPAFKEEAGVTTHVI 412
Query: 1191 A---NSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
+ + +GT K G +V P W + S + R +E
Sbjct: 413 SALQDDIGTVKATRLARAGMVELVWPSWFDISTSRWERQDE 453
>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var. grubii
H99]
Length = 921
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 67/115 (58%), Gaps = 17/115 (14%)
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
G W TK RPG+ FL+ S+L+EMH+YTMG + YA + KV+DP G +F GR++SR +
Sbjct: 283 GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 342
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
G F SK+L+ + + S VV+IDD VW + NL+ V PC
Sbjct: 343 GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKV-----VPC 383
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 67/140 (47%), Gaps = 15/140 (10%)
Query: 1104 DGTLASSLGVIERLHKIFF-SHQSLD----------DVDVRNILAAEQRKILAGCRIVFS 1152
D L +++ +H F+ + +LD DV I+ + ++L GC +VFS
Sbjct: 564 DYELVRVANILQEIHSRFYKAFDALDGWNPKKALPMSCDVEFIIPEMKAEVLDGCSLVFS 623
Query: 1153 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVV 1210
+ P E+NP +WQTAE FGA+ T + + TH+V L T+K W G VV
Sbjct: 624 GMIPR-ESNPSTTTIWQTAESFGALITPSLTPRTTHLVTALLNTEKT-WRAGKMEGVKVV 681
Query: 1211 HPGWVEASALLYRRANEQDF 1230
W S L+ R NE+ +
Sbjct: 682 WAEWFWDSVALWERQNEEKY 701
Score = 40.4 bits (93), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 890 FEGYDDQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI 947
FE D + K +RLE + + S R+L L++DLD T++++ V DEI
Sbjct: 135 FEIAHDAMGVTVSKNEAQRLENLTRDALLSTRRLSLIVDLDQTIIHTTVDPTVAEWMDEI 194
Query: 948 LRKKEEQDREK 958
R++ E D+EK
Sbjct: 195 HREESEDDQEK 205
>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Strongylocentrotus purpuratus]
Length = 673
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 91/170 (53%), Gaps = 19/170 (11%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
++ + RKL L++DLD TL+++ +D V ++ Q R+ P +F + H
Sbjct: 22 DEDSLIKHRKLVLLVDLDQTLIHTT----LDEVPADMPGVHHFQLRKGP---MFPWYH-- 72
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
T++R FL+ S+ +++H++TMG +LYA +A+++DP+G F+ R++SR + DP
Sbjct: 73 --TRIRDNYQQFLDLISQFYQLHIFTMGVRLYAHTVAEIIDPEGKFFSHRILSRDECVDP 130
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K +L + + V IIDD VW + NLI V Y YF
Sbjct: 131 HS------KKANLRSIFPRGDKMVCIIDDRDDVW-NFAPNLIQVPPYRYF 173
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 70/149 (46%), Gaps = 19/149 (12%)
Query: 1100 ERSEDGTLASSLGVIERLHKIFFSH-----QSLDDVDVRNILAAEQRKILAGCRIVFSRV 1154
E +D L + R+H+ FF Q L D+++I+ + +L GC IVFS V
Sbjct: 325 EEDKDEYLLYLADTLARIHRTFFKQVEANPQEL--PDLKSIVPGMRMSVLKGCNIVFSSV 382
Query: 1155 FPVGEANPHLHPLWQTAEQFGA-----VCTKHIDDQV-----THVVANSLGTDKVNWA-L 1203
FP P +W+ A GA + TK ++Q TH+VA +GT KV+ A
Sbjct: 383 FPTN-MPPEQSRVWKVALALGAKVSPQIVTKSKEEQAKGRASTHLVAAKVGTSKVHAARR 441
Query: 1204 STGRFVVHPGWVEASALLYRRANEQDFAI 1232
S F+V P W+ + R +E+ F +
Sbjct: 442 SKSIFIVEPDWLWCCWERWERVDERLFIL 470
>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Harpegnathos saltator]
Length = 734
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 86/173 (49%), Gaps = 26/173 (15%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDREKPHRHLFRFP 967
+++++ RKL L++DLD T++++ H + VH H L+
Sbjct: 144 DEQRLLKDRKLVLLVDLDQTIVHTTNDHIPPNLKDVH---------------HFQLYGPN 188
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
T+LRP FL S L+E+H+ + G ++YA +A +LD GVLF+ R++SR +
Sbjct: 189 SPWYHTRLRPNTRHFLSEMSHLYELHICSFGARIYAHTIASLLDKDGVLFSHRILSRDEC 248
Query: 1028 GDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
DP K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 249 FDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 63/125 (50%), Gaps = 8/125 (6%)
Query: 1113 VIERLHKIFFSHQSLDDVD----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1168
++ R+H F+ +LD + +R+I+ + ++L G + FS + P + H +
Sbjct: 472 ILRRIHTEFYC--TLDKGNGRRSLRDIIPRVRSQVLKGLYLTFSGLIPTHQ-KLHQSRAY 528
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA-LSTGRFVVHPGWVEASALLYRRANE 1227
+ A FGA T+ + ++ TH+VA GT K N A T +V+P W+ A + +E
Sbjct: 529 KVARAFGAEVTQDLTEKTTHLVAIRKGTAKANAAKKDTNIKIVNPEWLWTCAERWEHVDE 588
Query: 1228 QDFAI 1232
+ F +
Sbjct: 589 RLFPV 593
>gi|195170374|ref|XP_002025988.1| GL10108 [Drosophila persimilis]
gi|194110852|gb|EDW32895.1| GL10108 [Drosophila persimilis]
Length = 757
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 86/163 (52%), Gaps = 24/163 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRP 977
KL L++DLD T++++ D V + I K H + W T+LRP
Sbjct: 88 KLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQSPWYHTRLRP 133
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
G FLER S+L+E+H+ T G + YA +A++LDP G F+ R++SR + F+
Sbjct: 134 GTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR---DECFNA---T 187
Query: 1038 PKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 188 SKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 229
Score = 40.4 bits (93), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 70/149 (46%), Gaps = 10/149 (6%)
Query: 1093 LLEIDHDE------RSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKIL 1144
+++++HDE D L ++ +HK F++ ++++ D++ I+ + ++L
Sbjct: 412 IVKVNHDEGKLIEIEDPDDYLLYLEVILRNIHKRFYAIYDETMEIPDLKIIVPKIRCEVL 471
Query: 1145 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
G +VFS + P + + A+ GA +I + TH+VA + GT KVN A
Sbjct: 472 RGKNLVFSGLVPT-QMRLEQSRAYFIAKSLGAEVQPNISKESTHLVAVNAGTYKVNAAKK 530
Query: 1205 TGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VV+ W+ A + E+ F +
Sbjct: 531 EVNIRVVNANWLWTCAERWEHVEEKLFPL 559
>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
phosphatase [Tribolium castaneum]
gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
Length = 760
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 81/172 (47%), Gaps = 30/172 (17%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--- 970
++ RKL L++DLD TL+++ H + D ++RF G
Sbjct: 136 RLIRDRKLVLLVDLDQTLIHTTNDHIQPNIKD-----------------IYRFQLYGPNS 178
Query: 971 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+T+LRPG FL +E+H+ T G + YA +A VLD F+ R++SR +
Sbjct: 179 PWYFTRLRPGTHQFLNNIYPFYELHICTFGARNYAHMIAAVLDRDQKFFSNRILSRDECF 238
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
DP K +L+ + ++ V IIDD VW N NLI V+ Y +F
Sbjct: 239 DP------TSKKANLKALFPCGDNMVCIIDDREDVWS-NAANLIHVKPYHFF 283
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 66/134 (49%), Gaps = 6/134 (4%)
Query: 1104 DGTLASSLGVIERLHKIFFS----HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE 1159
D L+ +++R+H+ F+ +S + D++ ++ + ++L G ++VFS + P
Sbjct: 444 DDYLSYLEDILKRIHQAFYDAYDKMESGEVPDLKKVIPGVRSQVLQGYKLVFSGLVPT-H 502
Query: 1160 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEAS 1218
+Q A+ GA T+ ++D TH+VA GT KVN +V P W+
Sbjct: 503 IKLEQSKAYQIAKSLGAEVTQELEDDTTHLVAVRPGTAKVNAGRRKKNLKIVTPDWLWCC 562
Query: 1219 ALLYRRANEQDFAI 1232
A + +E+ FA+
Sbjct: 563 AERWEHVDEKIFAL 576
>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
Length = 751
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 43/209 (20%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
+ E +RL E + S RKL L++DLD T++++ VDP E +R
Sbjct: 46 VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDDTNPNYDA 101
Query: 951 ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
K++ D P L + K RPG+ T L++ S+ ++
Sbjct: 102 LKSVGKFRLGIDGEEIKDDDDPTAPKDAAAALRASRACWYYVKPRPGVPTILKQLSQKYQ 161
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+H+YTMG + YA + K++DP +F R++SR ++G K L + ++
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
S VVIIDD VW N NL+ V Y +F
Sbjct: 214 SMVVIIDDREDVW-SNSPNLLPVLPYEFF 241
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 12/112 (10%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 1190
V +++ A++ ++L GC IVFS + PVG + LW TA FGA I+ V THVV
Sbjct: 503 VVDLMGAKKAQVLKGCVIVFSSMIPVGH-DAAKSELWATARAFGATPAAEIEPGVTTHVV 561
Query: 1191 ANSLGTDKVNWAL----------STGRFVVHPGWVEASALLYRRANEQDFAI 1232
+ +GT KV+ A+ S+ +V P W E S + R +E+ + +
Sbjct: 562 SARMGTAKVHQAMKLVKQQRERGSSQLKLVWPSWFEVSTSRWARQDEELYRL 613
>gi|349580569|dbj|GAA25729.1| K7_Fcp1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 732
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)
Query: 864 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 923
T G A P GDV+ + + KE RR +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 971
V+DLD T+++ VDP E + D P+ R P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228
Query: 972 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
+ K+RPG+ F + + LFEMH+YTM + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
LF R++SR ++G +K L + +S VV+IDD VW NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLTKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 340 VPYNFF 345
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593
>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
Length = 779
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 43/209 (20%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
+ E +RL E + S RKL L++DLD T++++ VDP E +R
Sbjct: 46 VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYEA 101
Query: 951 ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
K+E+D +P L + K RPG+ L+ S+ +E
Sbjct: 102 LQSVGKFRLGIDGEEIKDEEDGSEPKDPAAALKASRACWYYVKPRPGVPQVLKHLSEKYE 161
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+H+YTMG + YA + K++DP +F R++SR ++G K L + ++
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
S VVIIDD VW + NL+ V Y +F
Sbjct: 214 SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 56/104 (53%), Gaps = 4/104 (3%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 1190
V +++ ++ ++L GC IVFS + P G N LW A +FGA I+ V THVV
Sbjct: 508 VVDLIGMKKAQVLKGCTIVFSSMIPFGH-NVEKSDLWAMAREFGATPASEIEVGVTTHVV 566
Query: 1191 ANSLGTDKVNWALS-TGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ AL +G+ VV P W SA + R +E + I
Sbjct: 567 AARPGTAKVHQALRLSGQLEVVWPSWFHVSASRWSRQDEALYRI 610
>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium dendrobatidis
JAM81]
Length = 868
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 66/123 (53%), Gaps = 16/123 (13%)
Query: 908 RLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
RLE++ ++ RKL LVLDLD T++++ VDP E + D P+
Sbjct: 154 RLEKETADRLLDERKLSLVLDLDQTVIHAT----VDPTVGEWM-----ADPNNPN----- 199
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP + +W PG FL + +EMH+YTMG + YA ++K+LDP F R++SR
Sbjct: 200 FPALTVWATHEPGTREFLRELNAKYEMHIYTMGTRNYAKAVSKILDPDKRYFKDRILSRD 259
Query: 1026 DDG 1028
D G
Sbjct: 260 DSG 262
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)
Query: 1117 LHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 1173
+H+ F+ D+ DVR I+ +R IL G I+F+ + P+G P H W A
Sbjct: 504 IHQTFYERSDAGDIKTSDVRVIVPEMKRSILEGVHILFTSIIPLG-LEPQKHEHWIAATS 562
Query: 1174 FGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
+GAVC +D +VTHV+A GT KVN A ++ W+ + ++RA+E
Sbjct: 563 YGAVCHVDLDPEVTHVIAGKTGTAKVNAARKRPNVAILKIDWLIDTIRTWQRADE 617
>gi|323307594|gb|EGA60861.1| Fcp1p [Saccharomyces cerevisiae FostersO]
Length = 732
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)
Query: 864 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 923
T G A P GDV+ + + KE RR +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 971
V+DLD T+++ VDP E + D P+ R P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228
Query: 972 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
+ K+RPG+ F + + LFEMH+YTM + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
LF R++SR ++G +K L + +S VV+IDD VW NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 340 VPYNFF 345
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593
>gi|190408503|gb|EDV11768.1| TFIIF interacting component of CTD phosphatase [Saccharomyces
cerevisiae RM11-1a]
Length = 732
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)
Query: 864 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 923
T G A P GDV+ + + KE RR +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 971
V+DLD T+++ VDP E + D P+ R P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228
Query: 972 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
+ K+RPG+ F + + LFEMH+YTM + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
LF R++SR ++G +K L + +S VV+IDD VW NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 340 VPYNFF 345
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593
>gi|6323933|ref|NP_014004.1| Fcp1p [Saccharomyces cerevisiae S288c]
gi|2497216|sp|Q03254.1|FCP1_YEAST RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase FCP1
gi|825543|emb|CAA89775.1| unknown [Saccharomyces cerevisiae]
gi|151945985|gb|EDN64217.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
gi|256270710|gb|EEU05873.1| Fcp1p [Saccharomyces cerevisiae JAY291]
gi|259148865|emb|CAY82110.1| Fcp1p [Saccharomyces cerevisiae EC1118]
gi|285814283|tpg|DAA10178.1| TPA: Fcp1p [Saccharomyces cerevisiae S288c]
gi|323346974|gb|EGA81251.1| Fcp1p [Saccharomyces cerevisiae Lalvin QA23]
gi|323353207|gb|EGA85507.1| Fcp1p [Saccharomyces cerevisiae VL3]
gi|392297449|gb|EIW08549.1| Fcp1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 732
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)
Query: 864 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 923
T G A P GDV+ + + KE RR +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 971
V+DLD T+++ VDP E + D P+ R P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228
Query: 972 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
+ K+RPG+ F + + LFEMH+YTM + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288
Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
LF R++SR ++G +K L + +S VV+IDD VW NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339
Query: 1074 ERYTYF 1079
Y +F
Sbjct: 340 VPYNFF 345
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593
>gi|391332118|ref|XP_003740485.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Metaseiulus occidentalis]
Length = 646
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 134/332 (40%), Gaps = 68/332 (20%)
Query: 962 HLFRFP-HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H FR P W T++RPG FL + S+LFE+H+ T G + YA + +LDP F
Sbjct: 168 HHFRLPGSSNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIVSLLDPGKKYFQ 227
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
R+++R + P K+ +L+ + + V IIDD VW NL+ V+ Y
Sbjct: 228 YRILTRDECFHP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLVAVKPYV 280
Query: 1078 YF-------------------------PCS-----------RRQFGLLGPSLLEIDHDER 1101
+F CS R +L I+H
Sbjct: 281 FFRGAGDINAPAGLLADCHALPASEGGTCSSVLSHKNPEALRADREVLACLQSLIEHTCG 340
Query: 1102 SEDGTLASSLG---------VIERLHKIFFS----------HQSLDDVDVRNILAAEQRK 1142
+ DG + G + +H+ +F +++ D++ ++ ++K
Sbjct: 341 ATDGFIDYEDGDDYLFYLEETLRTIHRAYFELYEQMQLKKHSEAVSIPDLKTVIPYVRQK 400
Query: 1143 ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA 1202
+L IVF+ FP+ + ++ A GA K + +VTH+VA GT KV A
Sbjct: 401 VLKDVVIVFTGCFPINQRQESA-KIYLVAIALGAKVQKELSKEVTHLVAARPGTAKVQQA 459
Query: 1203 LSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
VV W+ + A + ++ E F +K
Sbjct: 460 RKFRSIKVVSAQWLWSCAERWEKSPETLFPLK 491
>gi|134056779|emb|CAK37687.1| unnamed protein product [Aspergillus niger]
Length = 788
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 66/113 (58%), Gaps = 12/113 (10%)
Query: 970 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
G W KLRPG+ +FL+ S+++E+H+YTMG + YA +A ++DP LF R++SR +
Sbjct: 175 GCWYYVKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDES 234
Query: 1028 GDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
G +K+L + +++ VVIIDD VW N NLI V Y +F
Sbjct: 235 GSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLIKVSPYDFF 278
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ I+ + +IL G +VFS V P+G + LW A+ FGAV I+ TH+
Sbjct: 466 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADIXLW--AKSFGAVIASKINMXTTHL 523
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
VA T KV A + +V W+ ++R E+ + +
Sbjct: 524 VAGRNRTXKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 567
>gi|300701489|ref|XP_002994977.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
gi|239603396|gb|EEQ81306.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
Length = 200
Score = 79.3 bits (194), Expect = 1e-11, Method: Composition-based stats.
Identities = 56/177 (31%), Positives = 87/177 (49%), Gaps = 32/177 (18%)
Query: 903 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
K++ RL + KK L LVLDLD T+L H I ++ E
Sbjct: 48 KKKLERLHKNKK------LVLVLDLDQTIL-----------HTTITKEYMEGYSN----- 85
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
F + K RP + LE K +E+H+YTMGNK+YA ++ K++DP R++
Sbjct: 86 -FIINDISYCVKFRPYLNYMLECLYKKYEIHVYTMGNKVYANKIVKLIDPTRKYIGNRIL 144
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+R ++G F KDL + + S VVI+DD +W ++ NLI+V+ Y ++
Sbjct: 145 TRDENGIGF--------KKDLNRLFSIHSNVVILDDRDDIWDYSD-NLILVKPYFFW 192
>gi|241953831|ref|XP_002419637.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
[Candida dubliniensis CD36]
gi|223642977|emb|CAX43233.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
[Candida dubliniensis CD36]
Length = 771
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
+ A I+ T RL ++ RKL LV+DLD T++++ VDP E + D
Sbjct: 148 EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192
Query: 957 EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 993
P+ F + WT KLRPG+ FLE+ ++ +EMH
Sbjct: 193 ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252
Query: 994 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
+YTM + YA +AK++DP G F R++SR + G K+L+ + + +S
Sbjct: 253 IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
VVIIDD VW NLI V Y +F
Sbjct: 305 VVIIDDRGDVWQWES-NLIKVVPYDFF 330
>gi|323332189|gb|EGA73600.1| Fcp1p [Saccharomyces cerevisiae AWRI796]
Length = 646
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)
Query: 857 TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
NHD G T G E A P GDV+ + + KE RR
Sbjct: 116 CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
+KL LV+DLD T+++ VDP E + D P+ R
Sbjct: 172 -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217
Query: 966 -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
P M M + K+RPG+ F + + LFEMH+YTM + YA
Sbjct: 218 DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
++AK++DP G LF R++SR ++G +K L + +S VV+IDD V
Sbjct: 278 LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329
Query: 1063 WPHNKLNLIVVERYTYF 1079
W NLI V Y +F
Sbjct: 330 WNWCP-NLIKVVPYNFF 345
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 69/144 (47%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKTLHV 593
>gi|149241937|ref|XP_001526384.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450507|gb|EDK44763.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 883
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 105/227 (46%), Gaps = 45/227 (19%)
Query: 885 DVEHLFEGYDDQQKAAI-----------QKERTRRLEEQ--KKMFSARKLCLVLDLDHTL 931
D E + GYD +++A+I + ++E ++ RKL LV+DLD T+
Sbjct: 117 DDEKDYSGYDYEERASIAMSHDNTELRISYDEAAKIEHNTTDRLNQERKLILVVDLDQTV 176
Query: 932 LNSAKFHEVDP----------------VHDEILRKKEEQDREKPHRHLFRFPHMGMW--T 973
+++ VDP V D EE P + + W
Sbjct: 177 IHAT----VDPTVGEWQLDPENPNYPAVKDVRTFCLEEDPVAPPGWNGPKLAPTKCWYYV 232
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K+RPG+ FL++ + +EMH+YTM + YA +AK++DP+G F R++SR + G
Sbjct: 233 KVRPGLAEFLKKMDEKYEMHIYTMATRNYALSIAKIIDPEGKYFGDRILSRDESGS---- 288
Query: 1034 DERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K+L+ + + +S VVIIDD VW NLI V Y +F
Sbjct: 289 ----LTHKNLKRLFPVDQSMVVIIDDRGDVWQWEN-NLIKVVPYDFF 330
>gi|68472089|ref|XP_719840.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|68472324|ref|XP_719723.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|46441553|gb|EAL00849.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|46441679|gb|EAL00974.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
Length = 768
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 95/207 (45%), Gaps = 48/207 (23%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
+ A I+ T RL RKL LV+DLD T++++ VDP E + D
Sbjct: 148 EAAKIEHNTTDRL------IDERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192
Query: 957 EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 993
P+ F + WT KLRPG+ FLE+ ++ +EMH
Sbjct: 193 ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252
Query: 994 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
+YTM + YA +AK++DP G F R++SR + G K+L+ + + +S
Sbjct: 253 IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
VVIIDD VW NLI V Y +F
Sbjct: 305 VVIIDDRGDVWQWES-NLIKVVPYDFF 330
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/142 (23%), Positives = 61/142 (42%), Gaps = 9/142 (6%)
Query: 1099 DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 1158
D+ +E L LG I + + F ++ D+ I+ + + K L G ++FS + P+G
Sbjct: 476 DDDNELAALDKVLGNIHQGYYNLFDKDKINKPDLTEIIPSMKSKTLEGITVLFSGIIPLG 535
Query: 1159 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG-------TDKVNWALSTGRF-VV 1210
N + QFG + +VTHVV + +V L +V
Sbjct: 536 -INLDSADIVIWCRQFGVKVVNEVYPEVTHVVCRDVSEGAGPTFKTRVARKLYPDTIKIV 594
Query: 1211 HPGWVEASALLYRRANEQDFAI 1232
+P W+ A + + +E+D+ I
Sbjct: 595 NPDWLFACLSNWTKVDEKDYLI 616
>gi|350413080|ref|XP_003489872.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bombus impatiens]
Length = 751
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ D + I ++ H L+
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G LF+ R++SR + D
Sbjct: 191 WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 59/123 (47%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ R+H F++ + +R+I+ + ++L G + FS + P + H ++
Sbjct: 444 ILRRIHTEFYTTIEKEGGRKSLRDIIPRVRAQVLKGVYLTFSGLIPTHQK-LHQSRAYKV 502
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A FGA + + D+ TH+VA GT K N A +V+P W+ A + +E+
Sbjct: 503 ARAFGAEVAQDLSDKTTHLVAIRPGTAKANAAKKNRNIKIVNPDWLWTCAERWEHVDERL 562
Query: 1230 FAI 1232
F +
Sbjct: 563 FPL 565
>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Apis mellifera]
Length = 745
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ V P ++ H L+
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G LF+ R++SR + D
Sbjct: 191 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/123 (24%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFSHQSLDD--VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ R+H F++ + +R+I+ + ++L G + FS + P + H ++
Sbjct: 438 ILRRIHTEFYATMEKEGGRKSLRDIIPRVRAQVLKGVHLTFSGLIPTHQK-LHQSRAYKV 496
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A FGA + + + TH+VA GT K N A +V+P W+ A + +E+
Sbjct: 497 ARAFGAEVAQDLSKKTTHLVAIRPGTAKANTAKKNPNIKIVNPDWLWTCAERWEHVDERL 556
Query: 1230 FAI 1232
F +
Sbjct: 557 FPL 559
>gi|238881126|gb|EEQ44764.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 525
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 44/205 (21%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ A I+ T RL ++ RKL LV+DLD T++++ VDP
Sbjct: 63 EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPANPNY 112
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 995
V D EE+ P + P + + KLRPG+ FLE+ ++ +EMH+Y
Sbjct: 113 AAVKDVKTFCLEEEAIVPPG---WTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIY 169
Query: 996 TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVV 1054
TM + YA +AK++DP G F R++SR + G K+L+ + + +S VV
Sbjct: 170 TMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVV 221
Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYF 1079
IIDD VW NLI V Y +F
Sbjct: 222 IIDDRGDVWQWES-NLIKVVPYDFF 245
>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Apis florea]
Length = 749
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ V P ++ H L+
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G LF+ R++SR + D
Sbjct: 191 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 60/124 (48%), Gaps = 5/124 (4%)
Query: 1113 VIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 1169
++ R+H F++ + + +R+I+ + ++L G + FS + P + H ++
Sbjct: 440 ILRRIHTEFYATIEKEGGNRKSLRDIIPRVRAQVLKGVHLTFSGLIPTHQK-LHQSRAYK 498
Query: 1170 TAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQ 1228
A FGA + + ++ TH+VA GT K N A +V+P W+ A + +E+
Sbjct: 499 VARAFGAEVAQDLSEKTTHLVAIRPGTAKANTAKKNSNIKIVNPDWLWTCAERWEHVDER 558
Query: 1229 DFAI 1232
F +
Sbjct: 559 LFPL 562
>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
Length = 759
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 89/177 (50%), Gaps = 32/177 (18%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
E+++++ RKL L++DLD T++++ D P+ + +F F
Sbjct: 149 EDEQRLLRDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 190
Query: 969 MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
G W T+LRP FL + S L+E+H+ T G ++YA +A +LD VLF+ R++S
Sbjct: 191 YGPNSPWYHTRLRPNTRRFLSKMSSLYELHICTFGARIYAHTVASLLDKDKVLFSHRILS 250
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R + DP K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 251 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 300
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 65/125 (52%), Gaps = 8/125 (6%)
Query: 1113 VIERLHKIFFSHQSLDDVD----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 1168
++ER+H+ F+ ++LD + +R+I+ + ++L G + FS + P + H +
Sbjct: 450 ILERIHREFY--KTLDQENTRKSLRDIIPRVRSQVLKGVCLTFSGLIPTHQ-KLHQSRAY 506
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
+ A FGA T+ + ++ TH+VA GT K N A G+ +V+ W+ A + R E
Sbjct: 507 KVARAFGAEVTQELTEKTTHLVAIRKGTAKANAAKKHGKIKIVNSDWLWTCAERWERVEE 566
Query: 1228 QDFAI 1232
F +
Sbjct: 567 DLFQL 571
>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase [Camponotus
floridanus]
Length = 721
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 32/177 (18%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
E+++++ RKL L++DLD T++++ D P+ + +F F
Sbjct: 146 EDEQRLLKDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 187
Query: 969 MG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
G T+ RP FL S L+E+H+ T G ++YA +A +LD G+LF+ R++S
Sbjct: 188 YGPNSPWYHTRFRPNTRHFLSEMSHLYELHICTFGARIYAHTVASLLDKDGILFSHRILS 247
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R + DP K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 248 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 297
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 62/123 (50%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ R+H F++ Q D +R+I+ + ++L G + FS + P + H +++
Sbjct: 448 ILRRIHTEFYAILDQGNDRKSLRDIIPRVRSQVLKGLCLTFSGLIPTHQ-KLHQSRVYKV 506
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A FGA T+ + ++ TH+VA GT K N A +V+P W+ A + +E+
Sbjct: 507 ARAFGAEITQDLTEKTTHLVAIRKGTAKANAARKDANIKIVNPEWLWTCAERWEHVDERL 566
Query: 1230 FAI 1232
F +
Sbjct: 567 FPL 569
>gi|297819962|ref|XP_002877864.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
lyrata]
gi|297323702|gb|EFH54123.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
lyrata]
Length = 284
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 110/242 (45%), Gaps = 32/242 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MW 972
K ++L LVL L TL +S ++ + + E + R R FP+ G +
Sbjct: 54 KSLKEKRLTLVLGLHGTLYDSRLVSQLSDGENYL--TGEVKSRFDLRRSKKFFPNQGEVL 111
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
KLRP + FL A+KLF+M ++ + + E+ LDP G F R+I+ D
Sbjct: 112 FKLRPFVHEFLREANKLFQMTVFELCSPEQGEEVISFLDPHGTYFEKRIITNRD------ 165
Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDS-VRVWPHNKLNLIVVERYTYFPCSRRQFGL--- 1088
+ K+L+ VL E +VI+DD V WP + NL+ + Y +F + +
Sbjct: 166 -----SEMKNLDLVLADERGIVILDDKHVYWWPDDTTNLLQIAPYHFFKRNNNNTWITKL 220
Query: 1089 --LGPSLLEID------------HDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRN 1134
L ID DE +EDG L ++L +++ +HK FF + D DVR
Sbjct: 221 VNFFKKTLSIDDESDPKSYAEERRDEDAEDGGLENALELLKEVHKNFFDEEDEDSRDVRA 280
Query: 1135 IL 1136
+L
Sbjct: 281 LL 282
>gi|340709144|ref|XP_003393173.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Bombus terrestris]
Length = 751
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+++++ + RKL L++DLD T++++ D + I ++ H L+
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
T+LRP FL S+L+E+H+ T G + YA +A +LD G LF+ R++SR + D
Sbjct: 191 WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250
Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
P K+ +L+ + + V IIDD VW NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 59/123 (47%), Gaps = 4/123 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ R+H F++ + +R+I+ + ++L G + FS + P + H ++
Sbjct: 444 ILRRIHTEFYATIEKEGGRKSLRDIIPRVRAQVLKGVYLTFSGLIPTHQK-LHQSRAYKV 502
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A FGA + + D+ TH+VA GT K N A +V+P W+ A + +E+
Sbjct: 503 ARAFGAEVAQDLSDKTTHLVAIRPGTAKANAAKKNRNIKIVNPDWLWTCAERWEHVDERL 562
Query: 1230 FAI 1232
F +
Sbjct: 563 FPL 565
>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
lacrymans S7.9]
Length = 874
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 107/263 (40%), Gaps = 82/263 (31%)
Query: 890 FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLLNSA- 935
+ G+ D +A+IQ T LEE +K + ++RKL L++DLD T++++
Sbjct: 117 YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHATV 176
Query: 936 --------------------------KFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
+ E + V DE+ E D P+ + + +
Sbjct: 177 DPTVGEWIAEGEAWEGKRAMKMKPPQRSKEDEDVSDEVATDSESDDECNPNWEALKDVRK 236
Query: 966 F----PHMGM------------------------WTKLRPGIWTFLERASKLFEMHLYTM 997
F GM + K RPG FL + +EMH+YTM
Sbjct: 237 FQLGPESFGMPSSPRASRKVKGKQKFIENEGCMYYIKPRPGWQHFLHSIANKYEMHVYTM 296
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVII 1056
G + YA E+ +DP G +F GR++SR + G K L+ + + S VVII
Sbjct: 297 GTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVII 348
Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
DD VW + NL+ V Y +F
Sbjct: 349 DDRADVWEWSP-NLVKVIPYDFF 370
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 2/103 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV I+ + +++ L G I+FS V P+ + P +W+ AE FGA C + ++THVV
Sbjct: 557 DVTKIIPSMRKETLDGIHILFSSVIPL-DTKPETTEIWKVAEMFGAQCCTELSSRITHVV 615
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ A G +V W S L+ +E + +
Sbjct: 616 AAKHGTVKVDAARKRGGIKIVWLSWFTDSIALWSHQDETPYLL 658
>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
98AG31]
Length = 736
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 83/176 (47%), Gaps = 44/176 (25%)
Query: 907 RRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
RRLE + ++ KL L++DLD T++++ VDP
Sbjct: 257 RRLESETRSRLLKDTKLSLIVDLDQTIVHAT----VDPT--------------------- 291
Query: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
+G W PG+ FL ++ +EMH+YTMG + YA + +++DP LF RV+SR
Sbjct: 292 ----VGEWI---PGLSEFLRTLAEKYEMHVYTMGTRAYADAVCRIIDPTSELFGSRVLSR 344
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ G K L + ++ S VVIIDD VW ++ NL+ V Y +F
Sbjct: 345 DESGS--------MTQKSLTRLFPVDTSMVVIIDDRGDVWEYSP-NLVSVVPYNFF 391
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 1154 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHP 1212
V+P+ +A P W+ AEQFGA C + +VTH+VA +N AL+ + VV P
Sbjct: 525 VWPM-DAIPDQQDAWKLAEQFGAQCYTRLTPRVTHLVA----AKAINIALTRRKVSVVKP 579
Query: 1213 GWVEASALLYRRANEQDFAIK 1233
W+ + L+R NE+++ K
Sbjct: 580 KWLYDAVTLWRHPNEEEYLWK 600
>gi|294898997|ref|XP_002776453.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|294900793|ref|XP_002777118.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|239883444|gb|EER08269.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|239884575|gb|EER08934.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
Length = 370
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 126/300 (42%), Gaps = 52/300 (17%)
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP--KGVLFAGRVISR 1024
PH + KLRPG+ FLE ++E +++T ++Y + + LDP KG V SR
Sbjct: 29 PH---FVKLRPGVHQFLEALQPMYEFYIHTKATRVYLEYVMEALDPHKKGFFRNDNVFSR 85
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1083
DD + +KD+ V V+I+DD ++W + N+I Y Y
Sbjct: 86 CDDMKHGSNE-----NKDIRAVCSRPREEVIILDDKDKIWLDFQPNVIKCPPYKYMDQKL 140
Query: 1084 RQF------------------GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ 1125
Q G P E+D ++ DG L + + V +H + +
Sbjct: 141 LQVVRALKQTSDWIKEGGPESGYPKP---ELDDASKNFDGYLPAMVRVFTEIHHRYC--K 195
Query: 1126 SLDD-------VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC 1178
+L D VDV+N++ +++ CRI+ + F EA+ + E +G
Sbjct: 196 ALVDAAGDGFVVDVKNVIDDTRKQTFKNCRIMLTG-FNQNEASERA----EMIEMYGGTV 250
Query: 1179 TKHIDDQVTHVVANSLGTDKVNWALSTGR------FVVHPGWVEASALLYRRANEQDFAI 1232
+++D+ TH+V GT K + + +VHP W++ ++ +E F I
Sbjct: 251 INNVEDEPTHLVCAKGGTAKCHLIAKAMKEDSLKCKIVHPCWLDHGFATWKHPSEFIFEI 310
>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Sporisorium reilianum SRZ2]
Length = 773
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 95/209 (45%), Gaps = 43/209 (20%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
+ E +RL E + S RKL L++DLD T++++ VDP E +R
Sbjct: 46 VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYDA 101
Query: 951 ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
K++ D P L + K RPG+ L++ S+ ++
Sbjct: 102 LQSVGKFRLGIDGEEIKDDDDESAPRDSAAALRASRACWYYVKPRPGVPKVLKQLSEKYQ 161
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+H+YTMG + YA + K++DP +F R++SR ++G K L + ++
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
S VVIIDD VW + NL+ V Y +F
Sbjct: 214 SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 57/107 (53%), Gaps = 7/107 (6%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 1190
V ++++A + +L GC IVFS + P G +P LW A +FGA + I+ V THVV
Sbjct: 505 VVDLISARKAHVLQGCTIVFSSMIPFGH-DPEKSDLWAMAREFGATPSSEIEAGVTTHVV 563
Query: 1191 ANSLGTDKVNWAL-----STGRFVVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ AL S G VV P W SA + R +E + I
Sbjct: 564 AARPGTAKVHQALRLAQKSAGLEVVWPSWFHVSASRWARQDEGLYRI 610
>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
Length = 864
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 67/130 (51%), Gaps = 17/130 (13%)
Query: 1104 DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPH 1163
DG S+ ++ RLH+ F + IL G RIVF+ V P G+ + +
Sbjct: 617 DGVAPSTKAILPRLHRKFPHYPPW---------------ILEGVRIVFTGVIPRGQ-SAY 660
Query: 1164 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLY 1222
HP W+ A GAV +D++VTHVVA GTDKV A G VV+ W+EA A +
Sbjct: 661 THPAWRMAVNMGAVVVDQVDERVTHVVARVDGTDKVRQARKMGGVHVVYLKWLEACASQH 720
Query: 1223 RRANEQDFAI 1232
RR +E FA+
Sbjct: 721 RRVDEALFAV 730
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 61/113 (53%), Gaps = 14/113 (12%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---- 1026
+TK+RPG+ FLE ++E+H+YTMG + YA E+ ++DP F+ R++++ +
Sbjct: 31 YYTKIRPGVKEFLEAVKDMYELHVYTMGTRAYAKEICNIIDPGAHYFSTRILTQDESARI 90
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
D + + P+ D+ VVI+DD+ +W + NLI Y YF
Sbjct: 91 DTKSINLNHLFPRGDDM---------VVILDDTAAMWDF-RPNLIPAAPYDYF 133
>gi|207342073|gb|EDZ69950.1| YMR277Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 544
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 62/109 (56%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K+RPG+ F + + LFEMH+YTM + YA ++AK++DP G LF R++SR ++G
Sbjct: 58 YVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGELFGDRILSRDENGS-- 115
Query: 1032 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+K L + +S VV+IDD VW NLI V Y +F
Sbjct: 116 ------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKVVPYNFF 157
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)
Query: 1098 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
HD++ +D L +G + +HK ++ S Q+ + ++ I+ + ++K+ C VF
Sbjct: 264 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 323
Query: 1152 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 1208
S + P+G + +W + FGA T ID TH++ + T K A +
Sbjct: 324 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 381
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+VHP W+ + +++ +E+ + +
Sbjct: 382 IVHPDWIFECLVNWKKVDEKPYTL 405
>gi|401827003|ref|XP_003887594.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC 50504]
gi|392998600|gb|AFM98613.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC 50504]
Length = 408
Score = 77.0 bits (188), Expect = 5e-11, Method: Composition-based stats.
Identities = 98/379 (25%), Positives = 158/379 (41%), Gaps = 104/379 (27%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +I KE+ + LE Q K L LVLDLD T+L++ + D
Sbjct: 42 KEAVSIYKEKVKALEMQMK------LILVLDLDQTVLHTT-YGTSDC------------- 81
Query: 956 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ + +F G KLRP + L R SKL+E+H+YTMG + YA + ++DP
Sbjct: 82 -----KGIVKFTMDGCKYSVKLRPHLNRMLRRVSKLYEIHVYTMGTRPYAERIIGIIDPA 136
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
G F R+I+R D ++ V + +VI+DD VW +N+ NL++V
Sbjct: 137 GKYFHDRIITR-------DENQGVLVKRLSRLFPYNHKNIVILDDRADVWDYNE-NLVLV 188
Query: 1074 ERYTYF---------PCSRRQFGL---------------------------------LGP 1091
+ + YF RR G+ LG
Sbjct: 189 KPFWYFNRVDINDPSKLERRAEGVSDGHGDLGEFVGKRKKVEEGEDARIISRPDGTGLGG 248
Query: 1092 SL----LEIDHD---------ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAA 1138
S E+D D E S+D L + ++++HK +FS + +V+ IL
Sbjct: 249 SSGKPETEVDSDRMGEKKVREEPSDDCELLRMVKFLKKVHKKYFSSKHR---NVKKILRR 305
Query: 1139 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 1198
+++I G R F VG N L + E G + ++ ++ +V V+++ G D+
Sbjct: 306 IRKRIFDG-----DRFFIVGPTNRAW--LVKVIEMHGGMVSR-LESEVDFVISS--GLDE 355
Query: 1199 V-NWALSTGRFVVHPGWVE 1216
V A VV P W+E
Sbjct: 356 VRELAQKLECLVVSPKWIE 374
>gi|354545519|emb|CCE42247.1| hypothetical protein CPAR2_807960 [Candida parapsilosis]
Length = 786
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 93/202 (46%), Gaps = 38/202 (18%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ A I+ T RL ++KK L LV+DLD T++++ VDP
Sbjct: 148 EAAKIEHSTTDRLNDEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
V D EE P + W K+RPG+ FLE+ +EMH+YTM
Sbjct: 198 PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLEKMDTKYEMHIYTMA 257
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
+ YA +AK++DP G F R++SR + G K+L+ + + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW NLI V Y +F
Sbjct: 310 DRGDVWQWEN-NLIKVVPYDFF 330
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 66/147 (44%), Gaps = 20/147 (13%)
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLDDV---------DVRNILAAEQRKILAGCRIVFSR 1153
+D L S + V++ +H+ +++ DD D+ I+ + + L G I+FS
Sbjct: 470 DDNELTSLMKVLDNVHEEYYN--LYDDTKAKLHSIKPDLTKIIPRMKSQCLQGVTILFSG 527
Query: 1154 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD-------KVNWALSTG 1206
+ P+G N + +QFG + +VTHVV + + +V L
Sbjct: 528 IIPLG-VNIDSADIVIWCKQFGVKVVNEVYPEVTHVVCRDVSPEAGPTFKARVAHKLYPD 586
Query: 1207 RF-VVHPGWVEASALLYRRANEQDFAI 1232
+V+P W+ + L+ + +E+D+ I
Sbjct: 587 SIKIVNPDWLFSCLSLWTKVDEEDYLI 613
>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
Length = 781
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 87/172 (50%), Gaps = 24/172 (13%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
++ ++ + RKL L++DLD TL+++ + P ++ H + M
Sbjct: 133 DENRLLNDRKLVLLVDLDQTLIHTTN-DNIPPNLKDVY-------------HFRLYGQMS 178
Query: 971 MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
W T++RP FLE SK +E+H+ T G + YA +A LDP G F+ R++SR
Sbjct: 179 PWYHTRIRPRTHKFLEEISKYYELHICTFGARNYAHMIAMFLDPDGKYFSHRILSR---D 235
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F+ + K+ +L+ + ++ V IIDD VW NLI V+ Y +F
Sbjct: 236 ECFNAN---SKTANLKALFPCGDNMVCIIDDREDVWNF-AANLIHVKPYHFF 283
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 18/129 (13%)
Query: 1113 VIERLHKIFFSHQSLDDV--------DVRNILAAEQRKILAGCRIVFSRV----FPVGEA 1160
+++ +HK ++ D++ ++ ++ L GC +VFS + P+ E+
Sbjct: 482 ILKNIHKAYYEEYDFQKKINSNCVIPDLKTVIPNFKKNTLKGCHLVFSGLVPSHIPLQES 541
Query: 1161 NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS-TGRFVVHPGWVEASA 1219
+L A GA+ + I TH+VA GT KVN + G F+V P W+ A
Sbjct: 542 RAYL-----VAISLGAIVSADISSNCTHLVAARPGTAKVNSSRRHKGIFIVTPLWLWHCA 596
Query: 1220 LLYRRANEQ 1228
+ + +E+
Sbjct: 597 ERWEKVDEK 605
>gi|448520991|ref|XP_003868400.1| Fcp1 protein [Candida orthopsilosis Co 90-125]
gi|380352740|emb|CCG25496.1| Fcp1 protein [Candida orthopsilosis]
Length = 788
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 38/202 (18%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
+ A I+ T RL E++K L LV+DLD T++++ VDP
Sbjct: 148 EAAKIEHSTTDRLNEEEK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197
Query: 943 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
V D EE P + W K+RPG+ FL++ +EMH+YTM
Sbjct: 198 PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLQKMDTKYEMHIYTMA 257
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
+ YA +AK++DP G F R++SR + G K+L+ + + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309
Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
D VW NLI V Y +F
Sbjct: 310 DRGDVWQWEN-NLIKVVPYDFF 330
>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
Length = 614
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 90/181 (49%), Gaps = 31/181 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
EE +++ RKL L++DLD T++++ + + I + QD
Sbjct: 62 EEIQRLHDNRKLVLLVDLDQTVIHTTQNRPKKLTKNTISFQLTRQD-------------P 108
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV----------LDPKGVLFAG 1019
+WT+LRP F+ S+ +E+H+ T G++ YA ++A++ LD F+
Sbjct: 109 WLWTRLRPFCAKFIHEMSEKYELHIVTFGSRQYAHKIAEILEDQTRRQLNLDSNKSFFSH 168
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
R++SR + DPF KS +LE + +S IIDD VW ++ N I+V++Y +
Sbjct: 169 RILSRDECVDPF------HKSGNLEHLFPCGDSMCAIIDDRGDVWRYSP-NCILVKKYHF 221
Query: 1079 F 1079
F
Sbjct: 222 F 222
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 52/103 (50%), Gaps = 2/103 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DVRN+ ++ IL GC++VFS V P G H + A GAV + I TH++
Sbjct: 368 DVRNVCVGIRKNILKGCQLVFSGVVPNG-CRMEEHRAVKNARAMGAVIHERIQKNTTHLI 426
Query: 1191 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDFAI 1232
GT K N A FVV+P W+ + +RR +E DF I
Sbjct: 427 CARPGTAKHNEAKRKANVFVVNPAWLWVTYHEWRRQDESDFEI 469
>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 845
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL+ AS +EMH+YTMG + YA ++ +DP G LF GRV+SR + G
Sbjct: 261 YIKPRPGWKEFLQEASTKYEMHVYTMGTRAYAEQVCAAIDPDGKLFGGRVLSRDESGS-- 318
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + + S VVIIDD VW + NL+ V Y +F
Sbjct: 319 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVVPYDFF 360
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 2/103 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV I+ + ++L G I+FS V P+ + P +W+ A FGA C+ + +THVV
Sbjct: 532 DVTRIIPRLRSEVLEGVHILFSSVIPL-DTKPETTEIWRMAHMFGARCSTELTSDITHVV 590
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
A GT KV+ A G +V W ++R +E+ + +
Sbjct: 591 AAKRGTVKVDMARKRGGIKIVWLAWFTDCIASWQRQDEKPYLL 633
>gi|156087501|ref|XP_001611157.1| protein phosphatase family protein [Babesia bovis]
gi|154798411|gb|EDO07589.1| protein phosphatase family protein [Babesia bovis]
Length = 806
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 63/109 (57%), Gaps = 11/109 (10%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ KLRPG++ FL R+++L+E++L+TMG + +A K+LDP G F RV SR + + F
Sbjct: 339 YYKLRPGVYDFLRRSAELYELYLFTMGTRAHANAALKILDPDGKYFGARVFSRSETNNCF 398
Query: 1032 DGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
R+ PK ++ ++I+DDS +W + LI V Y +F
Sbjct: 399 KSLCRIFPKYRN---------HLLILDDSENIWL-DAPGLIKVYPYYFF 437
>gi|299756470|ref|XP_002912206.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
okayama7#130]
gi|298411691|gb|EFI28712.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
okayama7#130]
Length = 801
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FLE A+K +EMH+YTMG + YA E+ +DP G LF R++SR + G
Sbjct: 270 YIKPRPGWKEFLENAAKKYEMHVYTMGTRAYAQEVCAAIDPDGKLFGSRLLSRDESGS-- 327
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + + S VVIIDD VW + NL+ V Y +F
Sbjct: 328 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVIPYDFF 369
>gi|402582824|gb|EJW76769.1| hypothetical protein WUBG_12323 [Wuchereria bancrofti]
Length = 334
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 98/193 (50%), Gaps = 16/193 (8%)
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYFP------CSRRQFG--LLGPSLLEIDHDERSED 1104
+V+IDD VW ++ LI V+ Y +F R + G +L S E D E +D
Sbjct: 18 IVMIDDRPDVWQYSDA-LIQVKPYRFFKEIGDINAPRYEKGEPILSSSYAEQDM-ESEDD 75
Query: 1105 GTLASSLGVIERLHKIFFSHQSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEA 1160
TL V+ ++H F+ V D++ I++ ++++L C IV S + PVG
Sbjct: 76 ETLEYVALVLTKVHSAFYELFDSAKVNRFPDLKGIISYLRKQVLRECSIVLSGIVPVG-V 134
Query: 1161 NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR-FVVHPGWVEASA 1219
+ +++ QFGAV T ++++ THV+A GT KV+ A ++V+P W+
Sbjct: 135 DIKKTEVYRLCVQFGAVVTDNVNNSTTHVIAARWGTTKVHEAEKLRNIYIVNPKWLFTCV 194
Query: 1220 LLYRRANEQDFAI 1232
+ +A+E++F +
Sbjct: 195 ERWEKADEKEFEL 207
>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana RWD-64-598
SS2]
Length = 830
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 104/259 (40%), Gaps = 74/259 (28%)
Query: 886 VEHLFEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTL- 931
+H + G+ + +A+IQ E +R+E + + + +RKL L++DLD T+
Sbjct: 113 TDHDYTGFSNASRASIQMTHSAFGPTVSLEEAQRIERETAEHLLKSRKLSLIVDLDQTIV 172
Query: 932 -----------LNSAKFHEVDPVHDEILRKKEE--------------------------- 953
+N K E + + R + +
Sbjct: 173 HATVDPTVGEWINEGKQWEQKHIQKQKARDERKDGSDSDGTASSDEDDCNPNWDALKDVK 232
Query: 954 ------------QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
Q +++ + L + K RPG F + SK +EMH+YTMG +
Sbjct: 233 SFRLGPESFVMPQSQKRGKQKLIENDGCLYYVKPRPGWKEFFQELSKKYEMHVYTMGTRA 292
Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSV 1060
YA E+ +DP +F GR++SR + G K L+ + + S VVIIDD
Sbjct: 293 YAEEVCAAIDPDSKIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVIIDDRA 344
Query: 1061 RVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 345 DVWEWSP-NLIKVIPYDFF 362
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 19/151 (12%)
Query: 1101 RSEDGTLASSLGVIERLHKIFFS-----------------HQSLDDVDVRNILAAEQRKI 1143
+++D L G++E +H FFS +S DV +I+ + K+
Sbjct: 488 KNDDTELKRVGGLLEDIHARFFSLYDSRSTDVSSKKRRHGSKSSKSHDVTHIIPEIRAKV 547
Query: 1144 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 1203
G I+FS V P+ + P +W+ A FGA C + +THVVA LGT KV+ A
Sbjct: 548 FDGVHILFSSVIPL-DTPPETTEIWKVAHMFGAKCYTELSSSITHVVAARLGTVKVDAAR 606
Query: 1204 STGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
G VV W S L+RR +E + ++
Sbjct: 607 RRGGIKVVWVAWFTDSVALWRRMDESHYYLE 637
>gi|209879341|ref|XP_002141111.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209556717|gb|EEA06762.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 590
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 148/365 (40%), Gaps = 62/365 (16%)
Query: 913 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
K S KL +LDLD+TLL NS K + D I E P + F P
Sbjct: 161 KDHLSQNKLVAILDLDNTLLHAYNSTKVGCNINLEDFIGANGE------PEMYKFVLPQD 214
Query: 968 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+ + KLRPG+ FL + + M + T + YA + VLDPK F R+++R
Sbjct: 215 MNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPKRDKFGDRIVAR-- 272
Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHN-KLNLIVVERYTYFPCSR 1083
+ DG + KD + + +G+++ A+V++DD VW + ++ ++ + Y YF R
Sbjct: 273 --ENVDGRD---TQKDFKKICIGIDTRAIVLLDDRSDVWDSSLEIQVVKAQTYEYFEHRR 327
Query: 1084 RQF----------------------GLLGPSLLEIDHDE-----RSEDGTLASSLGVIER 1116
+L +L + + D L + V +
Sbjct: 328 DALRAHYPPLTSNSASMGAAGSAPGDILSAALSSLSNASGGNTIADYDRHLDYLIKVFKE 387
Query: 1117 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAE-- 1172
LH FF S + V NIL + +ILA C I F+ E L W ++
Sbjct: 388 LHVRFFC--SPETASVGNILQTMKSEILANCVICFTGFLKADEKPTTKSLPSTWGDSQAE 445
Query: 1173 ------QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRA 1225
+ GA ++++ TH+V T K + A + +VH W+ A ++R
Sbjct: 446 AEAASIRLGAKVEENLESMATHLVVQKTNTAKFHQAKNNPDIKIVHTLWLWACDGQWQRV 505
Query: 1226 NEQDF 1230
E F
Sbjct: 506 PESLF 510
>gi|156381374|ref|XP_001632240.1| predicted protein [Nematostella vectensis]
gi|156219293|gb|EDO40177.1| predicted protein [Nematostella vectensis]
Length = 122
Score = 75.1 bits (183), Expect = 2e-10, Method: Composition-based stats.
Identities = 44/112 (39%), Positives = 64/112 (57%), Gaps = 10/112 (8%)
Query: 971 MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
MW TK RP FL++ +K +E+H++TMG ++YA +A++LDP LF R+ SR D
Sbjct: 1 MWYHTKFRPWAHKFLQKIAKFYELHIFTMGTRMYAHTIARMLDPDLSLFGYRIRSRDDCF 60
Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F K DL + +S V IIDD VW +N +LI V+ Y +F
Sbjct: 61 NAFS------KFNDLRSLFPCGDSMVCIIDDRADVW-NNAPSLIKVKPYQFF 105
>gi|390604450|gb|EIN13841.1| hypothetical protein PUNSTDRAFT_95201 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1229
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 60/109 (55%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL S+ +EMH+YTMG + YA E+ K +DP+G +F R++SR + G
Sbjct: 618 YIKPRPGWHEFLHTLSEKYEMHVYTMGTRAYAEEVCKAIDPEGQIFGNRILSRDESGS-- 675
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + + S VVIIDD VW + NLI V Y +F
Sbjct: 676 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 717
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 76/160 (47%), Gaps = 28/160 (17%)
Query: 1101 RSEDGTLASSLGVIERLHKIFF-----SHQSLDDV---------------DVRNILAAEQ 1140
R++D L ++E++H F+ S + D++ DV+ I+ +
Sbjct: 855 RNDDVELQRVRRLLEQVHSTFYADYDASKKKTDNLSVPKRKQRTQETLPYDVKLIIPRLR 914
Query: 1141 RKILAGCRIVFSRVFPVGEAN----PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG- 1195
+ G +VFS V P+ N P +W+TA +FGA C K +++ VTHVVA G
Sbjct: 915 SNVFDGVHLVFSGVIPLESNNMPIDPEATEIWRTAIRFGARCYKEMNEAVTHVVATPEGV 974
Query: 1196 ---TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
T K+N A + G VV W S L++R +E+ ++
Sbjct: 975 ERRTSKINTARARGIPVVSISWFHDSVALWQRLDERPYSF 1014
>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
SS1]
Length = 1075
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 10/110 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ K RPG FL ++ +EMH+YTMG + YA E+ +DP G F GR++SR + G
Sbjct: 306 YYVKPRPGTREFLSSVAEKYEMHVYTMGTRAYAEEVCAAIDPDGKFFGGRILSRDESGS- 364
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L + ++ S VVIIDD VW + NLI V Y +F
Sbjct: 365 -------MTQKSLRRLFPVDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 406
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 2/101 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D I+ + L G I+FS V P+ + P +W+ A FGA C + ++THVV
Sbjct: 659 DATAIIPQLRFNTLFGVHILFSSVIPL-DTRPETTEVWRLAHAFGAKCYTELSSKITHVV 717
Query: 1191 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDF 1230
A GT KV+ A G +V P W S + R +E +
Sbjct: 718 AAKRGTVKVDQARKRGNILIVWPAWFTDSIAKWERQDETPY 758
>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
Length = 764
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 87/170 (51%), Gaps = 20/170 (11%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+ +++ RKL L++DLD TL+++ D V + + ++ H L+
Sbjct: 135 DTERLLRDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPW 182
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
T+LRPG FL + +E+H+ T G + YA +A+ LD KG F+ R++SR +
Sbjct: 183 YHTRLRPGALQFLAKMDPFYELHICTFGARNYAHMIAQFLDEKGRYFSHRILSR---DEC 239
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 240 FNA---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 285
>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
Length = 822
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 87/170 (51%), Gaps = 20/170 (11%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+ +++ S RKL L++DLD TL+++ D V + + ++ H L+
Sbjct: 137 DTERLLSDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPW 184
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
T+LRPG FL + +E+H+ T G + YA +A+ LD G F+ R++SR +
Sbjct: 185 YHTRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGNFFSHRILSR---DEC 241
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
F+ K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 242 FNA---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 287
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 29/124 (23%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
Query: 1113 VIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ ++H+ F+ +S D++ ++ + ++L G ++ FS + P +
Sbjct: 496 ILLKIHETFYEEYDKSKTISDLKQLIPQVKARVLVGAKLCFSGLIP-NNVKLEQSKAYLI 554
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A GA T++++ TH+VA ++GT KVN A + +V P W+ + A + E
Sbjct: 555 ARSLGAAVTQNLEPTTTHLVAVTIGTSKVNNARKNPKIKIVTPEWLWSCAERWEHVEELL 614
Query: 1230 FAIK 1233
+ +K
Sbjct: 615 YPLK 618
>gi|323453463|gb|EGB09334.1| putative formate/nitrite transporter [Aureococcus anophagefferens]
Length = 1144
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 120/268 (44%), Gaps = 53/268 (19%)
Query: 897 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
+ + + +R+R+LE + R+L LVLDLDHTLL + DP + + R
Sbjct: 331 EASVLAAQRSRQLEGK------RQLQLVLDLDHTLLECS----TDPRAAALAAAPGSRVR 380
Query: 957 E------KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
+PH W +LRP + F + L+E+ +YT G++ YA + L
Sbjct: 381 ALGAVAGRPH-----------WVRLRPRLEEFFAAVAPLYELAIYTHGSRQYAEAVRAAL 429
Query: 1011 DPK--GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNK 1067
+ + G+ F GRV+SR D P D R KS LE + G + +I+DD + VW +
Sbjct: 430 EAEVPGLSFGGRVVSR--DCCP---DLRGEKS--LERLFPGGAARALILDDRLDVWTRGE 482
Query: 1068 ---LNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 1124
++VV+ YTYF G +L + H + D L+ S + H F++
Sbjct: 483 DQTPRVLVVQPYTYF----------GKALADPAHAD--GDSQLSHSARALVAAHAAFYAG 530
Query: 1125 QSLDDVDVRNILAAEQRKILAGCRIVFS 1152
D L A + + AGC FS
Sbjct: 531 GGASG-DAVACLDAARGAVFAGCVFSFS 557
>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
Length = 580
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 57/105 (54%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DVR I+ + LAG IVFS V P+ + P +W+TA FGA C + ++VTHVV
Sbjct: 268 DVRIIIPRMRMDTLAGVHIVFSSVIPL-DTRPEATEIWRTAHAFGAKCYTELSNRVTHVV 326
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
A GT KV+ A +G +V W S L++R +E + + P
Sbjct: 327 AAKRGTQKVDAARRSGGIKIVWLSWFTDSVALWKRQDETPYLMDP 371
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 1055
MG + YA E+ +DP+G F GR++SR + G K L+ + +S VVI
Sbjct: 1 MGTRAYAEEVCAAIDPEGKFFGGRLLSRDESGS--------LTQKSLQRLFPTDQSMVVI 52
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
IDD VW + NL+ V Y +F
Sbjct: 53 IDDRADVWEWSP-NLVKVIPYDFF 75
>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
B]
Length = 875
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL+ + +EMH+YTMG + YA E+ +DP G +F GR++SR + G
Sbjct: 264 YIKPRPGWQDFLQDMATKYEMHVYTMGTRAYAEEVCATIDPDGKIFGGRLLSRDESGS-- 321
Query: 1032 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + +S VVIIDD VW + NL+ V Y +F
Sbjct: 322 ------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 363
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DVR I+ + + K L G I+FS V P+ + P + +W+TA FGA C + ++THVV
Sbjct: 555 DVRTIIPSIRMKALEGVHILFSSVIPL-DTRPEVTEVWRTAHAFGAQCHTELSSRITHVV 613
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
A GT KV+ A G +V W S L++R +E + + P
Sbjct: 614 AAKRGTVKVDAARKQGGIKIVWLSWFTDSIALWQRQDETPYLMDP 658
>gi|294868642|ref|XP_002765622.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
gi|239865701|gb|EEQ98339.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
Length = 956
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
+ + S+++L VLD+DHT+L+ F +V + R D EK ++
Sbjct: 488 RTLASSKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 547
Query: 967 ---PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 1021
+ KLRPG +TFLE L+E++LYT G + YA + K LDP F R+
Sbjct: 548 SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 607
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
I+R + K L + VI+DD VW N+ +LI V Y +
Sbjct: 608 IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 659
Query: 1079 FPCSRR 1084
FP S R
Sbjct: 660 FPDSER 665
>gi|221508436|gb|EEE34023.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
Length = 1228
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRPG FL R S+ FE+++YTMG L+A ++LDPK F RV SR D +
Sbjct: 626 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 685
Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ER+ P + + V+++DD +W ++ I V+ Y YF
Sbjct: 686 IERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 722
>gi|221486680|gb|EEE24941.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
Length = 1234
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 102/232 (43%), Gaps = 25/232 (10%)
Query: 860 DDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMF--- 916
DD G EA PV + A V + D K A + ++ RR +E++ +
Sbjct: 510 DDAGLGASRDEEAKPVEREAREALQTVGDMDAERDVDGKEADRTKQDRRGKEEETEYPHY 569
Query: 917 --SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------LFRFPH 968
A+ + L+ +++ E PV R+ E R RH L H
Sbjct: 570 REDAQGEAIRHTLEKSVIGVRVAVEASPVG--ASRRAREGWRRPASRHEAGALPLPNEQH 627
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ + KLRPG FL R S+ FE+++YTMG L+A ++LDPK F RV SR D
Sbjct: 628 ITFF-KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAV 686
Query: 1029 DPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ ER+ P + + V+++DD +W ++ I V+ Y YF
Sbjct: 687 NGLKAIERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 728
>gi|312373985|gb|EFR21645.1| hypothetical protein AND_16677 [Anopheles darlingi]
Length = 857
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 20/168 (11%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
+++ + RKL L++DLD TL+++ D V + + ++ H L+
Sbjct: 150 ERLLNDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPWYH 197
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
T+LRPG FL + +E+H+ T G + YA +A+ LD G F+ R++SR + F+
Sbjct: 198 TRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGRFFSHRILSR---DECFN 254
Query: 1033 GDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K+ +L+ + +S V IIDD VW + NLI V+ Y +F
Sbjct: 255 A---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 298
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 62/124 (50%), Gaps = 4/124 (3%)
Query: 1113 VIERLHKIFFS-HQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ ++H++F+ + S + D++ ++ + K+L R+ FS + P +Q
Sbjct: 538 ILLKIHEMFYKKYDSTKTISDLKQLIPKVKAKVLVNARLCFSGLIP-NNVKLEQSKAYQI 596
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A GA+ T++++ TH+VA ++GT KV A + +V P W+ A + E
Sbjct: 597 ARSLGAIVTQNLEPTTTHLVAVTIGTSKVINARKNPKIKIVTPQWLWTCAERWEHVEELL 656
Query: 1230 FAIK 1233
F +K
Sbjct: 657 FPLK 660
>gi|440493707|gb|ELQ76143.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
[Trachipleistophora hominis]
Length = 466
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 63/128 (49%), Gaps = 12/128 (9%)
Query: 950 KKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
++ D P + M T LRP + FL ASKLF MH+YTMG Y ++ V
Sbjct: 161 QRTSTDNSFPSSFTYTLSSTTMHTTLRPHLHQFLTEASKLFHMHIYTMGTAEYVHQITNV 220
Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKL 1068
+D G+ F R+++R D+ + K LE + G + VVI+DD VW +
Sbjct: 221 IDKDGMFFGDRIVTRDDE----------MQVKRLERLFGDKVDMVVIVDDRGDVWEYCG- 269
Query: 1069 NLIVVERY 1076
NL++V +
Sbjct: 270 NLVMVRPF 277
>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
MF3/22]
Length = 921
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL + +EMH+YTMG + YA ++ +DP G LF GR++SR + G
Sbjct: 273 YVKPRPGWKEFLSSVASRYEMHVYTMGTRAYAEKVCAAIDPDGRLFGGRILSRDESGS-- 330
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L + + S VVIIDD VW + NLI V Y +F
Sbjct: 331 ------LTQKSLRRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 372
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 2/104 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPH-LHPLWQTAEQFGAVCTKHIDDQVTHV 1189
DV+ I+ +++ I+FS V P H +W+ A FGA C + +D +VTHV
Sbjct: 567 DVKFIIPNIRKETFKDVHILFSGVIPTNIRMDHEATEIWRMARAFGATCHRDLDKEVTHV 626
Query: 1190 VANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAI 1232
V + GT KV A S FVV W S +RR +E+ + +
Sbjct: 627 VTSKRGTQKVEKARSQPNIFVVWLQWFTDSVAQWRRQDERRYLL 670
>gi|237834315|ref|XP_002366455.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211964119|gb|EEA99314.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
Length = 1225
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRPG FL R S+ FE+++YTMG L+A ++LDPK F RV SR D +
Sbjct: 623 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 682
Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ER+ P + + V+++DD +W ++ I V+ Y YF
Sbjct: 683 IERIFPHDQKM---------VLVVDDLECMWRYSPC-CIKVQGYHYF 719
>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor FP-101664
SS1]
Length = 900
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 10/110 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ K RPG+ FLE + +EMH+YTMG + YA E+ +DP G +F R++SR + G
Sbjct: 262 YYIKPRPGLPEFLETMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS- 320
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + +S VVIIDD VW + NL+ V Y +F
Sbjct: 321 -------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 362
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 23/156 (14%)
Query: 1101 RSEDGTLASSLGVIERLHKIFFS----HQSLDD-----------------VDVRNILAAE 1139
+++D L ++E +H F+S ++S D+ DVR I+
Sbjct: 519 KNDDAELTRVKQILELVHDRFYSAWERYKSEDEGKHGKRKHGKRTEAQREYDVRTIIPRI 578
Query: 1140 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 1199
+ + L GC I+FS V P+ + P +W+TA FGA C + ++THVVA GT KV
Sbjct: 579 RTETLDGCHILFSSVIPL-DTRPEATEIWKTAHAFGAKCYTELSPRITHVVAAKRGTQKV 637
Query: 1200 NWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
+ A G +V W S L+ R +E + + P
Sbjct: 638 DAARRRGGIKIVWLAWFTDSVALWHRQDEGPYLLDP 673
>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Ustilago hordei]
Length = 774
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 95/209 (45%), Gaps = 43/209 (20%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E +RL E + S RKL L++DLD T++++ VDP E ++ + + E
Sbjct: 47 VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMKDESNPNYEA 102
Query: 959 PHR-HLFRFPHMG--------------------------MWTKLRPGIWTFLERASKLFE 991
FR G + K RPG+ +++ S+ ++
Sbjct: 103 LKSVGKFRLGIDGEEIKDDDDDSAPKDSAAALKASRACWYYVKPRPGVPEIVKKLSEKYQ 162
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+H+YTMG + YA + K++DP +F R++SR ++G K L + ++
Sbjct: 163 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLNRLFPVDH 214
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
S VVIIDD VW + NL+ V Y +F
Sbjct: 215 SMVVIIDDREDVWSRSP-NLLPVVPYEFF 242
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 6/101 (5%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 1190
V ++++A++ K+LAGC IVFS + P G NP LW A +FGA +++ V THVV
Sbjct: 497 VVDLISAQKTKVLAGCTIVFSSMIPTGH-NPETSDLWALAREFGATPAFEVEEGVTTHVV 555
Query: 1191 ANSLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANE 1227
A GT KV A R +V P W S + R +E
Sbjct: 556 AARQGTLKVVNATRLARAGKVELVWPSWFHVSTSRWERQDE 596
>gi|294935258|ref|XP_002781353.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
gi|239891934|gb|EER13148.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
Length = 979
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
+ + ++++L VLD+DHT+L+ F +V + R D EK ++
Sbjct: 511 RTLAASKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 570
Query: 967 ---PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 1021
+ KLRPG +TFLE L+E++LYT G + YA + K LDP F R+
Sbjct: 571 SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 630
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
I+R + K L + VI+DD VW N+ +LI V Y +
Sbjct: 631 IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 682
Query: 1079 FPCSRR 1084
FP S R
Sbjct: 683 FPDSER 688
>gi|401409326|ref|XP_003884111.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
gi|325118529|emb|CBZ54080.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
Length = 1185
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 11/107 (10%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
KLRPG FL R S+ FE+++YTMG L+A ++LDP F RV SR D +
Sbjct: 649 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPGRRFFGRRVFSRQDAVNGLKA 708
Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
ER+ P + + V+++DD +W +N I V+ Y YF
Sbjct: 709 IERIFPHDRKM---------VLVVDDLDCMWSYNPC-CIKVQGYHYF 745
>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens LYAD-421
SS1]
Length = 953
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG+ FL+ + +EMH+YTMG + YA E+ +DP G +F R++SR + G
Sbjct: 287 YIKPRPGLLDFLQTMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS-- 344
Query: 1032 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + +S VVIIDD VW + NL+ V Y +F
Sbjct: 345 ------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 386
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 2/112 (1%)
Query: 1124 HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 1183
H++ + DVR I+ + LAG I+F+ V P+ + P +W+TA FGA C +
Sbjct: 598 HEAPVEHDVRTIIPRIRMDTLAGVHILFTGVIPLNQ-RPETAEIWKTATAFGAQCHTDLG 656
Query: 1184 DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
+THVV N T KV+ A +V W+ S L++R +E + + P
Sbjct: 657 KHITHVVTNKDNTQKVDAARRYADVRIVWLNWLTDSLALWQRQDETPYLLHP 708
>gi|402467220|gb|EJW02558.1| FCP1-like phosphatase, phosphatase domain-containing protein
[Edhazardia aedis USNM 41457]
Length = 905
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 78/166 (46%), Gaps = 28/166 (16%)
Query: 934 SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL---F 990
S KF + D V D ++ K + F + LRP FLE+ L +
Sbjct: 209 SDKFTKNDIVTD--------KNENKTKIYTFMLNKHKYYIALRP----FLEKLLSLDEKY 256
Query: 991 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
EMH+YTMGN YA ++ K++DP G +F R+I+R ++ K L+
Sbjct: 257 EMHIYTMGNNQYAQKVKKIIDPTGTIFGNRIITRDENNQEL--------FKSLDRFSTNH 308
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 1096
+V+IDD + VW + +N++ V + +F R + PS+L I
Sbjct: 309 DNIVVIDDRIDVWNFS-VNVVGVRPFWFF----RDGDINDPSVLRI 349
>gi|303389951|ref|XP_003073207.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
gi|303302352|gb|ADM11847.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
Length = 407
Score = 72.0 bits (175), Expect = 2e-09, Method: Composition-based stats.
Identities = 56/186 (30%), Positives = 89/186 (47%), Gaps = 35/186 (18%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +I KE+ + LE Q K L LVLDLD T+L++ + +H +
Sbjct: 42 KEAVSIYKEKMKTLETQMK------LILVLDLDQTILHTT--YGESRIHGTV-------- 85
Query: 956 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
RF G KLRP + L + S+L+E+H+YTMG + YA + ++DP
Sbjct: 86 ---------RFIMDGSKYCVKLRPNLDHMLRKISRLYEIHVYTMGTRAYAERIVGIVDPS 136
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
G F R+I+R D +E V + +VI+DD VW +++ NL++V
Sbjct: 137 GKYFQDRIITR-------DENEGVLVKRLSRLFPHNHKNIVILDDRPDVWDYSE-NLLLV 188
Query: 1074 ERYTYF 1079
+ YF
Sbjct: 189 RPFWYF 194
>gi|393225696|gb|EJD33619.1| HAD-like protein, partial [Auricularia delicata TFB-10046 SS5]
Length = 155
Score = 71.6 bits (174), Expect = 2e-09, Method: Composition-based stats.
Identities = 52/171 (30%), Positives = 86/171 (50%), Gaps = 25/171 (14%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ-DREKPHRHLFRFPHM 969
E++++ RKL LV+DLD+T+ VH ++R +E+ R + H H
Sbjct: 3 ERERLLGCRKLSLVVDLDNTI-----------VHTIVVRTDDERMARMQDHNH----GST 47
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DG 1028
RPG+ FL+ S+ +E +YTMG + YA ++ +D +F GR+ SR + +G
Sbjct: 48 TFTGSCRPGLRAFLQTISEKYEPTVYTMGTRGYAEKVCAAVDGDERVFGGRIFSRDENEG 107
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ R+ D +S IIDDS +VW +K N++ V+ Y +F
Sbjct: 108 NSTKSLSRLFPPCD-------KSMTAIIDDSRKVW-EDKKNIVSVQPYVFF 150
>gi|396081720|gb|AFN83335.1| Fcp1-like phosphatase [Encephalitozoon romaleae SJ-2008]
Length = 408
Score = 71.6 bits (174), Expect = 2e-09, Method: Composition-based stats.
Identities = 53/186 (28%), Positives = 88/186 (47%), Gaps = 35/186 (18%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +I KE+ + LE Q K L LVLDLD T+L++A
Sbjct: 42 KEAVSIYKEKIKTLEMQMK------LILVLDLDQTVLHTAY------------------- 76
Query: 956 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ + RF G KLRP + L + S+L+E+H+YTMG + YA + +++DP
Sbjct: 77 GASSEKGIVRFTMDGCKYSVKLRPNLKRMLRKVSRLYEIHVYTMGTRPYAERIVRIIDPT 136
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
F R+I+R D ++ V + +VI+DD VW + + NL+++
Sbjct: 137 RKYFHDRIITR-------DENQGVLVKRLSRLFPYNHKNIVILDDRADVWDYCE-NLVLI 188
Query: 1074 ERYTYF 1079
+ + YF
Sbjct: 189 KPFWYF 194
>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
HHB-10118-sp]
Length = 863
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 60/110 (54%), Gaps = 10/110 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ K RPG FLE ++ +EMH+YTMG + YA E+ +DP G +F GR++SR + G
Sbjct: 257 YYIKPRPGWNEFLEDMAEKYEMHVYTMGTRAYAEEVCAAIDPDGKIFGGRLLSRDESGS- 315
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + +S VV+IDD VW + NL+ V + +F
Sbjct: 316 -------LTQKSLQRLFPCDQSMVVVIDDRADVWEWSP-NLVKVIPFEFF 357
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DVR I+ ++ LAGC +VFS V P+ + W+ A FGA C ++ ++TH++
Sbjct: 551 DVRLIIPLMRQNALAGCHVVFSSVIPL-DTRAETSETWRIAVMFGAKCYTELNPRITHLI 609
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
A GT KV+ A G +V W S +RR +E + I P
Sbjct: 610 AAKRGTAKVDAARRQGGVKIVWVNWFLDSINQWRRQDETPYLIDP 654
>gi|19074511|ref|NP_586017.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
[Encephalitozoon cuniculi GB-M1]
gi|51701436|sp|Q8SV03.1|FCP1_ENCCU RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase FCP1
gi|19069153|emb|CAD25621.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
[Encephalitozoon cuniculi GB-M1]
gi|449329538|gb|AGE95809.1| hypothetical protein ECU07_0890 [Encephalitozoon cuniculi]
Length = 411
Score = 71.2 bits (173), Expect = 3e-09, Method: Composition-based stats.
Identities = 56/184 (30%), Positives = 88/184 (47%), Gaps = 31/184 (16%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI KE+ LE Q K L LVLDLD T+L++ + + + K D
Sbjct: 42 EEAVAIHKEKMEALEMQMK------LILVLDLDQTVLHTT--YGTSSLEGTV---KFVID 90
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
R + KLRP + L R SKL+E+H+YTMG + YA + +++DP G
Sbjct: 91 RCR------------YCVKLRPNLDYMLRRISKLYEIHVYTMGTRAYAERIVEIIDPSGK 138
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVER 1075
F R+I+R D ++ V + +VI+DD VW + + NL+++
Sbjct: 139 YFDDRIITR-------DENQGVLVKRLSRLFPHDHRNIVILDDRPDVWDYCE-NLVLIRP 190
Query: 1076 YTYF 1079
+ YF
Sbjct: 191 FWYF 194
>gi|402584910|gb|EJW78851.1| hypothetical protein WUBG_10241, partial [Wuchereria bancrofti]
Length = 278
Score = 70.9 bits (172), Expect = 5e-09, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 89/172 (51%), Gaps = 29/172 (16%)
Query: 901 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+ E R++ +++ + ARKL L++DLD TL+++ H K E+D +
Sbjct: 125 VSDELARKIGSRDRELLLKARKLVLLVDLDQTLIHTTN-HTF----------KLEKDTDV 173
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
H ++ +TK+RP FL R + L+EMH+ + G + YA +A+ LDP+ + F
Sbjct: 174 LH---YKLKGTDFYTKIRPHAREFLRRMAGLYEMHIISYGERQYAHRIAEFLDPEKIYFG 230
Query: 1019 GRVISRGDDGDPFDGDE---RVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHN 1066
R++SR DE + K+++++ + + +V+IDD VW ++
Sbjct: 231 HRILSR---------DELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQYS 273
>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
Length = 1207
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
DV+ I+ + + + GC I FS V P+ + P H W+ A FGA C + +VTHVV
Sbjct: 880 DVKRIIPSIRASVFQGCHICFSSVIPL-DIQPESHECWRIANMFGARCHATLAPEVTHVV 938
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 1234
A GT KV+ A G VV P W + S +++R E + + P
Sbjct: 939 AGKQGTAKVDEARRRGNIKVVTPMWFKDSVNMWQRQPESRYLLDP 983
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 59/124 (47%), Gaps = 13/124 (10%)
Query: 961 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
RH+ + K RPG F+ S +EMH+YTMG + YA + VLDP G LF R
Sbjct: 608 RHIALDEGCVYYIKPRPGWQEFMNNMSAKYEMHVYTMGTRAYAMAVCNVLDPDGRLFGER 667
Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHN----KLNLIVVER 1075
++SR + G K L+ + +S VVIIDD VW NLI V
Sbjct: 668 ILSRDESGS--------LTQKSLDRLFPTDQSMVVIIDDRADVWSGGLQFWSPNLIKVVP 719
Query: 1076 YTYF 1079
Y +F
Sbjct: 720 YDFF 723
>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
Length = 569
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 86/170 (50%), Gaps = 20/170 (11%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+ +++ +KL L++DLD TL+++ D V + + ++ H L+
Sbjct: 136 DTERLLRDKKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGSNSPW 183
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
T+LRPG FL + +E+H+ T G + YA +A+ LD G LF+ R++SR +
Sbjct: 184 YHTRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDRDGKLFSHRILSR---DEC 240
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
F+ K+ +L + +S V IIDD VW + NLI V+ Y +F
Sbjct: 241 FNA---TSKTDNLRALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 286
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 59/124 (47%), Gaps = 4/124 (3%)
Query: 1113 VIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ ++H+ F+ ++ D++ ++ + ++L G +VFS + P +Q
Sbjct: 429 ILLKIHQTFYEEYEKTKQISDLKRLIPQVKSQVLVGFNLVFSGLVP-NSMKLEESKAYQV 487
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A GA T+ TH+VA + GT KV+ A + +V P W+ A A + E+
Sbjct: 488 ARSLGATVTQDFTPDTTHLVAVTFGTSKVHNARKNPKIKMVTPEWLWACAERWEHVEERL 547
Query: 1230 FAIK 1233
+ +K
Sbjct: 548 YPLK 551
>gi|321460734|gb|EFX71774.1| hypothetical protein DAPPUDRAFT_308742 [Daphnia pulex]
Length = 798
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 30/175 (17%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
+++++ RKL L++DLD TL+++ +DEI E+ +F F G
Sbjct: 148 DEERLLKDRKLVLLVDLDQTLIHT--------TNDEIPANIED---------VFHFQLHG 190
Query: 971 -----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
T+LRP L S L+E+H+ T G++ YA +A LD KG F+ R++SR
Sbjct: 191 PNSPWYHTRLRPFTKELLCSMSSLYELHICTFGSRTYAHMIANFLDEKGRYFSHRILSR- 249
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ F K+ +L+ + + VVIIDD VW NLI V Y +F
Sbjct: 250 --DECFSAH---SKTANLKALFPCGDQMVVIIDDREDVWNFAP-NLIHVRPYHFF 298
>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
Length = 917
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 70/152 (46%), Gaps = 33/152 (21%)
Query: 929 HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
L + AKF D V + R+ + + P +TK RPG+ FLE SK
Sbjct: 277 EALKDVAKFQLADDVPPGVSRRHQPE------------PVRWYYTKPRPGLNKFLEDMSK 324
Query: 989 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
L+EMH+YTMG + YA + K++DP+G FA +K L +
Sbjct: 325 LYEMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFP 365
Query: 1049 M-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+S VVIIDD VW + NL+ V Y +F
Sbjct: 366 HDQSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 2/102 (1%)
Query: 1130 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
DV ++ + ++L+GC IVF+ V + + P +WQ AE FGA C +D++VTH
Sbjct: 591 CDVPLLIGEIKDQVLSGCVIVFTGVIAINQ-KPQDSEIWQQAEAFGAQCQVELDERVTHC 649
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
V S+GT+K+ A VV W++ S +RR E+ F
Sbjct: 650 VIGSIGTEKMRRASRMPHVQVVWLAWLQTSIAFWRREPEEPF 691
>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
Length = 922
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 70/152 (46%), Gaps = 33/152 (21%)
Query: 929 HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
L + AKF D V + R+ + + P +TK RPG+ FLE SK
Sbjct: 277 EALKDVAKFQLADDVPPGVSRRHQPE------------PVRWYYTKPRPGLNKFLEDMSK 324
Query: 989 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
L+EMH+YTMG + YA + K++DP+G FA +K L +
Sbjct: 325 LYEMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFP 365
Query: 1049 M-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+S VVIIDD VW + NL+ V Y +F
Sbjct: 366 HDQSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 2/102 (1%)
Query: 1130 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
DV ++ + ++L+GC IVF+ V + + P +WQ AE FGA C +D++VTH
Sbjct: 596 CDVPLLIGEIKDQMLSGCVIVFTGVIAINQ-KPQDSEIWQQAEAFGAQCQVELDERVTHC 654
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
V S+GT+K+ A VV W++ S +RR E+ F
Sbjct: 655 VIGSIGTEKMRRASRMPHVQVVWLAWLQTSIAFWRREPEEPF 696
>gi|300122627|emb|CBK23195.2| unnamed protein product [Blastocystis hominis]
Length = 598
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 142/359 (39%), Gaps = 56/359 (15%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-------KPHRH---L 963
K +KL L++DLD TL+++ E + L E + E K H L
Sbjct: 146 KFLGGKKLILIIDLDMTLVHAIHEEESIGLFLNWLHGASESNEEDEWKKTLKDQVHSIEL 205
Query: 964 FRFPHMG------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
F G + K+RPG+ L+ + +EM +YT G YA ++ +++DP LF
Sbjct: 206 FYVDDNGSARMSKLLIKIRPGVRAMLQMLANSYEMIVYTQGENQYAEKVMQIVDPDNTLF 265
Query: 1018 AGRVISRGDDGDP----------------------FDGDERVPKSKDLEGVLGMESAVVI 1055
R I+RG+ + +D P+S + M ++I
Sbjct: 266 KKRFIARGETRNEPQKKLLSKIVDCWNQYVRKQNVYDPANPTPESLPELTLEEMCRRLLI 325
Query: 1056 IDDSVRVW-PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGV- 1113
+DD VW H + +I+ + C F L + L++ GV
Sbjct: 326 LDDKDEVWGMHEESGMILNPTSSLIKCFPYVFFDTKSDLYNFE--------KLSAYEGVE 377
Query: 1114 ---IERLHKIFFS-HQSL---DDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHP 1166
I RL +IF HQ+ + DVR L + + G + F+ + + E +
Sbjct: 378 QQYILRLSEIFRDIHQTFTLENAEDVRKTLRERKHSVFHGLHMAFTSIMEITE-KVETNV 436
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 1225
L+++ FG V + ++ ++ +L T KVN A VV W+E ++ A
Sbjct: 437 LYRSLVDFGGVYMSEVTEECDLLICRTLRTAKVNKAQLLRIPVVSVRWLEECVKFWKLA 495
>gi|125584005|gb|EAZ24936.1| hypothetical protein OsJ_08716 [Oryza sativa Japonica Group]
Length = 364
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/131 (41%), Positives = 68/131 (51%), Gaps = 22/131 (16%)
Query: 998 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-------ME 1050
G + YA +AK+LDP GV F R+ISR D P P K L+ V G
Sbjct: 99 GTEDYAAAVAKLLDPDGVYFGERIISR--DESP------QPDRKSLDVVFGSAPASAAER 150
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH--DERSEDGT-L 1107
+AVVI+DD+ VW N NLI +ERY YF S R FG S E H ER D +
Sbjct: 151 AAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG----SPWECTHSLSERGVDESER 206
Query: 1108 ASSLGVIERLH 1118
A++L V+ R+H
Sbjct: 207 AAALRVLRRVH 217
>gi|297830094|ref|XP_002882929.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297328769|gb|EFH59188.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 68.9 bits (167), Expect = 2e-08, Method: Composition-based stats.
Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 31/210 (14%)
Query: 930 TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 989
TL++S K + ++++++ R+ ++ R K RP + FL+ A+KL
Sbjct: 80 TLIHSMKTLNLSNAEKYLIKEEKSGSRKDLRKYNDRL------VKFRPFVEEFLKEANKL 133
Query: 990 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1049
F M YT G YA + ++LDP + F R+I+R + P K L+ VL
Sbjct: 134 FTMTAYTRGGSTYAKAVVRMLDPNKIYFGDRIITRKES----------PDLKTLDLVLAD 183
Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
E +VI NL+ + Y YF R S E DE L
Sbjct: 184 ERGIVI------------RNLLEITSYFYFKNDHRNIMRSRLSYAERKTDESRTKRALVK 231
Query: 1110 SLGVIERLHKIFFS---HQSLDDVDVRNIL 1136
L ++ +H FF+ + LD DVR ++
Sbjct: 232 LLKFLKEVHNGFFTCGLEEQLDIKDVRYLI 261
>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
[Cyanidioschyzon merolae strain 10D]
Length = 1640
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 11/104 (10%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFP--VGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
+++ILA +R +L GC + F+ VF G A P H LW+ A +FGAVC + + QV+H+
Sbjct: 1389 IKDILAELRRSVLTGCELCFTGVFAKHAGMA-PEDHELWRLAVRFGAVCHREVLPQVSHL 1447
Query: 1190 VAN-SLGTDKVNWALSTGR-----FVVHPGWVEASALLYRRANE 1227
+A+ G D + T R F+V P W+ SA RRANE
Sbjct: 1448 IADPQRGRDTLK--TQTARAMNSVFIVKPTWLVCSAEDMRRANE 1489
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 64/118 (54%), Gaps = 15/118 (12%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ KLRPG+ FL + FE+H+YTMG++ YA +A ++D LF GR+ SR D
Sbjct: 521 YYIKLRPGLHEFLRTIADRFELHIYTMGSRPYADTVASIIDSDERLFQGRITSRDD---- 576
Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP------HNKL--NLIVVERYTYF 1079
F+ D R+ K+L+ V +S V+++DD VW H + NLI Y +F
Sbjct: 577 FE-DGRL-NQKNLKHVFPCDDSMVLVVDDREDVWVAQDQSLHGRHFPNLIRARPYYFF 632
>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
Length = 239
Score = 68.6 bits (166), Expect = 2e-08, Method: Composition-based stats.
Identities = 65/231 (28%), Positives = 98/231 (42%), Gaps = 59/231 (25%)
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
M KLRP + TFL+ AS++FEM++YTMG + Y+ EMAK+LDP+ F +V + D
Sbjct: 65 MQRMNKLRPFVRTFLKEASEVFEMYIYTMGIRQYSLEMAKLLDPQVEYFKDKVWQKHKDN 124
Query: 1029 -DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
F + ++ L + ESA D ++ + + + R YF
Sbjct: 125 LTTFS----IKVARKLTSISNSESAKT---DQLKFFD------VYLIRLRYFD------- 164
Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 1147
VI L I H +L++ + ++L+GC
Sbjct: 165 ------------------------NVISLLQSIHVFH----------VLSSLRGEVLSGC 190
Query: 1148 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 1198
IVFS F L L + AE+ GA + VTHVVAN+ T +
Sbjct: 191 VIVFSCAF----HGHDLRKLRKIAERLGATHLTELRPTVTHVVANAFVTKE 237
>gi|429964988|gb|ELA46985.1| FCP1-like phosphatase, phosphatase domain-containing protein, partial
[Vavraia culicis 'floridensis']
Length = 231
Score = 68.2 bits (165), Expect = 3e-08, Method: Composition-based stats.
Identities = 54/187 (28%), Positives = 86/187 (45%), Gaps = 29/187 (15%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE-------------ILRKKEEQDREK 958
+ ++ +K+ LV+DLD T+L+S + + V D I K+ Q R +
Sbjct: 53 RDELIQKKKMILVVDLDQTILHSIEV-KGGRVGDNGSRNRNGECGGRGITNKQLLQARPR 111
Query: 959 ---PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
P + M T LRP + TFL +++F MH+YTMG Y ++ V+D
Sbjct: 112 QPLPSSFTYTLASTTMKTTLRPHLHTFLTELNEMFHMHIYTMGTSEYVHQITNVIDRDRS 171
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-MESAVVIIDDSVRVWPHNKLNLIVVE 1074
LF R+++R D+ K LE + G E VV+IDD VW + NL+++
Sbjct: 172 LFGDRIVTRDDE----------VLVKRLERLFGDREDMVVVIDDRGDVWEYCG-NLVMIR 220
Query: 1075 RYTYFPC 1081
+ C
Sbjct: 221 PFFGVDC 227
>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 853
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 10/109 (9%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL + ++MH+YTMG + YA E+ +DP G +F R++SR + G
Sbjct: 269 YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGS-- 326
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
K L+ + + S VVIIDD VW + NLI V Y +F
Sbjct: 327 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 368
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 1130 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
+V I+ + ++L G +VFS V P+ + P W+ A FGA C + VTHV
Sbjct: 539 FNVTTIIPRLRSEVLGGLHLVFSGVIPL-DTPPETTEFWRLARMFGAKCHTDLTPDVTHV 597
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
+ GT KV A G +V P W S L+ R +E +
Sbjct: 598 ITAKRGTKKVETARQRGGIKIVRPEWFNDSIALWHRQDETPY 639
>gi|156083399|ref|XP_001609183.1| hypothetical protein [Babesia bovis T2Bo]
gi|154796434|gb|EDO05615.1| hypothetical protein BBOV_IV000150 [Babesia bovis]
Length = 692
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 110/262 (41%), Gaps = 53/262 (20%)
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
F ++ + KLRPG+ FL+ S +EM +YT K YA + +LDP LF R+++R
Sbjct: 307 FANIRYYMKLRPGLRGFLQVLSLYYEMSIYTNATKEYADVVVSILDPDRSLFMDRIVART 366
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM------ESAVVIIDDSVRVW---PHNKLNLIVVERY 1076
G+ +DL+ VV DD VW PHN+ ++ E Y
Sbjct: 367 SAGE-----------RDLQKTAARLYPNLDPRFVVAFDDRADVWADVPHNQ--VVKAEHY 413
Query: 1077 TYFPCSRRQ----FGLLGPS---LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD 1129
+F + +G++ S L ID D L + V LHK FF+ +
Sbjct: 414 DFFDSHIAELSDLYGIVNSSTENTLYIDSDRH-----LDHMVKVFLELHKRFFNDPFKSN 468
Query: 1130 VDVRNILAAEQRKILAGCRIVFS--RVFPVGEANPHLHPLWQT---------AEQFGA-V 1177
V ++ Q +L I+ + R G+ + QT A + GA V
Sbjct: 469 VGT--LVQEIQSNVLKDTGILLTGYRKNSKGQGQ-----VLQTDCEQRQREIAVELGATV 521
Query: 1178 CTKHIDDQVTHVVANSLGTDKV 1199
+K D ++THVVA TD V
Sbjct: 522 VSKLSDKRLTHVVAGKNCTDNV 543
>gi|432861327|ref|XP_004069613.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
latipes]
Length = 473
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 114/245 (46%), Gaps = 28/245 (11%)
Query: 835 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYD 894
++ ++ I+ G+I + DM + GS P + P A P++++ D +F+ Y
Sbjct: 211 ITSDTSIEEGEIITETDMPPLTA---PGCMSVGSYPHSIP-SAPPETSYEDEWEVFDPYY 266
Query: 895 DQQKAA--IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
+ +++ TR+ K S + LVLDLD TL++ + E
Sbjct: 267 FIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NE 312
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+D LF+ ++ +LRP FLER S+L+E+ L+T K+YA ++ +LDP
Sbjct: 313 LEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYEIILFTASKKVYADKLLNILDP 372
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
K L R+ + G+ KDL +LG + S VIID+S + + + N I
Sbjct: 373 KKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQAFAYQLSNGI 425
Query: 1072 VVERY 1076
+E +
Sbjct: 426 PIESW 430
>gi|221055253|ref|XP_002258765.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193808835|emb|CAQ39537.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1474
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)
Query: 968 HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
H G +T KLRPG+ FL++ +K +E++LYTMG +A +LDP F RV SR
Sbjct: 548 HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSR 607
Query: 1025 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
D V K L +L S + IDDS +W + + I V Y YFP
Sbjct: 608 KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 654
>gi|297830090|ref|XP_002882927.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
lyrata]
gi|297328767|gb|EFH59186.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
lyrata]
Length = 287
Score = 66.6 bits (161), Expect = 8e-08, Method: Composition-based stats.
Identities = 60/225 (26%), Positives = 88/225 (39%), Gaps = 55/225 (24%)
Query: 916 FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
+ RKL LV+DL H LL+S G+ KL
Sbjct: 96 YGQRKLHLVVDLQHVLLDSN----------------------------------GVLVKL 121
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP FL A++LF ++ YT + A K+LDP + F R I+ E
Sbjct: 122 RPFAREFLREANELFTIYAYTKSDPKQARSFIKLLDPLKIFFPSRFITIA---------E 172
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPH-NKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
K K LE VL E VVI+D W ++ NL++++ Y YF Q G + +
Sbjct: 173 EKRKKKSLEFVLAEERGVVILDCKSETWEKDDERNLLLIKSYDYFKGMEYQQGFITKFIN 232
Query: 1095 EIDHDER-----------SEDGTLASSLGVIERLHKIFFSHQSLD 1128
+ +DG L +L ++ +H+ FF Q D
Sbjct: 233 FFNKSSSEEKRNEKEEEDDDDGVLVDALNSLKTIHQRFFHGQCKD 277
>gi|156096809|ref|XP_001614438.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803312|gb|EDL44711.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1467
Score = 66.6 bits (161), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)
Query: 968 HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
H G +T KLRPG+ FL++ +K +E++LYTMG +A +LDP F RV SR
Sbjct: 532 HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKNFFGNRVFSR 591
Query: 1025 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
D V K L +L S + IDDS +W + + I V Y YFP
Sbjct: 592 KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 638
>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
Length = 793
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 35/182 (19%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTK 974
S K+ L++D+DHTL++S K DP + K + H FP + K
Sbjct: 413 STPKMHLIVDIDHTLIHSTK----DPNGESYFLKDKT-------VHKISFPETNETFYVK 461
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---------- 1024
RP FL S+ F +++Y+ K Y +A +LDP +F+ +VIS+
Sbjct: 462 ERPNAIEFLRTLSQQFYIYVYSFHPKYYVERVASILDPHSNIFS-KVISKEIIESIENIK 520
Query: 1025 -----GDDGDPF--DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
+ PF ++ VPK E + + ++I+DD VW + + NLI+++ +
Sbjct: 521 ICRENNNSQKPFIVFNEQNVPKIFKFESI----NQLIILDDREDVWRNFQDNLILLDTFK 576
Query: 1078 YF 1079
YF
Sbjct: 577 YF 578
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 1132 VRNILAAEQRKILAGCRIVFSRVFPVGEANPH--LHP-LWQTAEQFGAVCTKHIDDQVTH 1188
+++++ + +L C IVFS +FP + +P H + + E FGA ++ ID TH
Sbjct: 680 IKSVIQEIRSSVLMDCNIVFSGIFP-KQIDPSKLCHTRVSKITESFGASISQEIDSNTTH 738
Query: 1189 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 1232
V+ GT KV AL VVH W+ + R +E ++++
Sbjct: 739 VIFIKEGTSKVLQALKNPNIKVVHFAWLRDCIHRWERIDELNYSV 783
>gi|124802229|ref|XP_001347409.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
gi|23494988|gb|AAN35322.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
Length = 1438
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ KLRPG+ FL S+ +E++LYTMG +A +LDP F RV SR D
Sbjct: 572 IYYKLRPGVIEFLRTMSEKYEIYLYTMGTLEHAKSCLFLLDPLRKFFGNRVFSRKD---- 627
Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ K L +L S + IDDS +W N + I V Y YFP
Sbjct: 628 -----CLNSLKHLNKILPTYRSVSICIDDSDYIWKENS-SCIKVHGYNYFP 672
>gi|224004348|ref|XP_002295825.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209585857|gb|ACI64542.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1073
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 107/234 (45%), Gaps = 39/234 (16%)
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN------ 1069
LF R++SR D GD + + + GV+ I+DD VW + K N
Sbjct: 771 LFGSRIVSRTDVGDLGTDVKSLKRVFPCGGVMA-----AIVDDREDVWANAKNNDTGRPG 825
Query: 1070 -----LIVVERYTYFPCS--RRQFGLLGPSLLEID---------HDERSEDGTLASSLGV 1113
L++V+ Y + P S R G L + D H ++ +D L + +
Sbjct: 826 EPPDNLLLVQPYHWKPFSGYRDVNNASGQDLSKSDDDKSSSEKEHADKEQDVQLLWTADI 885
Query: 1114 IERLHKIFFSHQ-SLDDVD---VRNILAAEQRKILA---GCRIVFSRVFPVGEANPHLH- 1165
++RLH+ ++S SL+D D V ++L + +++ L IVFS V P+ +
Sbjct: 886 LKRLHERYYSPSISLEDRDKLSVPSLLRSMRKETLLRFPQANIVFSGVIPIDKQKEERKV 945
Query: 1166 --PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWV 1215
PL + AE+ GA + VTHVVA G+DK++ A G ++V P W+
Sbjct: 946 RVPLVRYAEELGARVLPDVTKGVTHVVARRDGSDKIHRARKEVPGCYIVQPSWL 999
>gi|422292668|gb|EKU19970.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis gaditana
CCMP526]
Length = 419
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)
Query: 932 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 987
LN ++ +H I + E D +K H F + G W LRP + TFL +A
Sbjct: 207 LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 266
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
L+ + +YT G + YA ++A++LDP LF R++SR D D + K L+ +
Sbjct: 267 ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 319
Query: 1048 --GMESAVVIIDDSVRVW----PHNKLNLIVVERYTYFPCSRRQFGL 1088
G+E A +I+DDS +VW + L ++ + YT F R GL
Sbjct: 320 PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 365
>gi|348509633|ref|XP_003442352.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
niloticus]
Length = 476
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 151/359 (42%), Gaps = 59/359 (16%)
Query: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
++E V + PR L G P F P+ + GS E K
Sbjct: 117 LEETTAVEVTTSPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170
Query: 789 QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMS-VSQ-PLTSEPMVSQNSPI-QPGQ 845
QL + Q+V P T T+ L ++ S VSQ P P + + SP + G+
Sbjct: 171 QLD-------IEQAVETPTSTATSTQELCVTTNYYSSVSQLPPLRPPHILEASPTTEEGE 223
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 899
+ + AD+ + T G+ P+ P P++++ + +F+ Y +
Sbjct: 224 LHTDADLPPL--------TAPGTSPDMAHVDTLPATVPPEASYEEDWEVFDPYFFIKHVP 275
Query: 900 -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+++ TR+ K S + LVLDLD TL++ + E +D
Sbjct: 276 PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 321
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L
Sbjct: 322 TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 381
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
R+ + G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 382 HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 433
>gi|427782099|gb|JAA56501.1| Putative rna polymerase ii ctd phosphatase [Rhipicephalus pulchellus]
Length = 360
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D+++++ +RK+L G IVFS V P+ + WQ A+ GA ++ + VTH+V
Sbjct: 11 DLKHVVPYVRRKVLKGVHIVFSGVVPMNQPAEKSQ-AWQVAKSLGATVSRDLCPGVTHLV 69
Query: 1191 ANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIKP 1234
A LGT KVN A G VVHP W+ A + E F ++P
Sbjct: 70 AARLGTAKVNKARRMPGVHVVHPSWLWCCAERWEHVAEALFPLRP 114
>gi|387196292|gb|AFJ68751.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis gaditana
CCMP526]
Length = 414
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)
Query: 932 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 987
LN ++ +H I + E D +K H F + G W LRP + TFL +A
Sbjct: 202 LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 261
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
L+ + +YT G + YA ++A++LDP LF R++SR D D + K L+ +
Sbjct: 262 ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 314
Query: 1048 --GMESAVVIIDDSVRVW----PHNKLNLIVVERYTYFPCSRRQFGL 1088
G+E A +I+DDS +VW + L ++ + YT F R GL
Sbjct: 315 PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 360
>gi|71026568|ref|XP_762950.1| hypothetical protein [Theileria parva strain Muguga]
gi|68349902|gb|EAN30667.1| hypothetical protein TP03_0826 [Theileria parva]
Length = 823
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 89/217 (41%), Gaps = 59/217 (27%)
Query: 920 KLCLVLDLDHTLL------------------NSAKF------HEVDPVHDEILRKKEEQD 955
KLCLVLDLD+TLL ++A + + D V E+ +K E
Sbjct: 301 KLCLVLDLDNTLLHATSQPPPPDIAIPILNYDTADYLNQYVQYGTDSVSLELQQKLENSV 360
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+ + + + KLRPGI+ F + S F + L+TMG K +A +V+DP+G+
Sbjct: 361 IKTIVYNETETSYCVSYFKLRPGIFQFFHKISDKFRLFLFTMGTKQHAASALQVIDPQGI 420
Query: 1016 LFAGRVISRGDDGD--------------------------------PFDGDERVPKSKDL 1043
F R+ SR + D R K L
Sbjct: 421 YFGNRIFSRYNTNSHNSTNSINSVNSVNSVNSVNSMNSVSNVVGVKKLRNDLRYCM-KSL 479
Query: 1044 EGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
+ + ++ V+++DD+ VW +N L LI V Y +F
Sbjct: 480 DRIFPNYKNLVLVMDDTEHVWTNN-LGLIKVHPYYFF 515
>gi|47224149|emb|CAG13069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 159
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 51/170 (30%), Positives = 80/170 (47%), Gaps = 25/170 (14%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG- 970
Q K+ +RKL L++DLD+TL+++ + P + P +++F+ G
Sbjct: 11 QDKLHQSRKLVLMVDLDNTLIHTTEI----PC------------QLSPKKNVFKMKLEGS 54
Query: 971 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ +LRP FLE+ S+LFE++++T + YA +A LDP FA R+ISR +
Sbjct: 55 PTYYVRLRPYYKEFLEKISELFELNIFTFACQSYAKTVAGFLDPDNTFFAQRIISRDNCF 114
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
P V ES +IDD VW L+ V+ Y Y
Sbjct: 115 YPATKMANVRFFSPCG-----ESMTCMIDDREDVWNFAP-GLVAVKPYMY 158
>gi|66363226|ref|XP_628579.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
with a BRCT domain at the C-terminus [Cryptosporidium
parvum Iowa II]
gi|46229587|gb|EAK90405.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
with a BRCT domain at the C-terminus [Cryptosporidium
parvum Iowa II]
gi|323509333|dbj|BAJ77559.1| cgd7_4250 [Cryptosporidium parvum]
gi|323509917|dbj|BAJ77851.1| cgd7_4250 [Cryptosporidium parvum]
Length = 595
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)
Query: 913 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
K + KL +LDLD+TLL NS K + D I + P + F P
Sbjct: 166 KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219
Query: 968 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+ + KLRPG+ FL + + M + T + YA + VLDP+ F R+++R
Sbjct: 220 LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277
Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIV-VERYTYFPCSR 1083
+ DG + KD + + +E+ A+V++DD VW + + +V + Y YF +
Sbjct: 278 --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYFEQRK 332
Query: 1084 RQFGLLGPSLL----EIDHDERSEDGTLASSLG-----------------------VIER 1116
PSL I + + L+++L V +
Sbjct: 333 DALKSHYPSLSSGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFKE 392
Query: 1117 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL---W----- 1168
LH FF Q+ + V +IL + +IL C + F+ E P + L W
Sbjct: 393 LHTRFF--QNPETACVGDILKKMRSEILENCIVCFTGFLKADE-KPIVKGLPSNWGDSQA 449
Query: 1169 ---QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRR 1224
A + GA ++++ TH+V T K + A + +VH W+ A ++R
Sbjct: 450 DAESAAIRLGAGIEENLEPLATHLVVQKTNTAKFHQAKNNPNVKIVHTLWLWACDGQWQR 509
Query: 1225 ANEQDF 1230
+E F
Sbjct: 510 VSEDLF 515
>gi|222632581|gb|EEE64713.1| hypothetical protein OsJ_19569 [Oryza sativa Japonica Group]
Length = 485
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S+ VD ++ D P H + ++ K RP
Sbjct: 308 KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ TFL++ S++F++ ++T YA + +LDPK + F R D P DG
Sbjct: 354 VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409
Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
KDL ++ + VVIID+S V+ + N I +E +T P + L+
Sbjct: 410 --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458
>gi|218197280|gb|EEC79707.1| hypothetical protein OsI_21008 [Oryza sativa Indica Group]
Length = 485
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S+ VD ++ D P H + ++ K RP
Sbjct: 308 KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ TFL++ S++F++ ++T YA + +LDPK + F R D P DG
Sbjct: 354 VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409
Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
KDL ++ + VVIID+S V+ + N I +E +T P + L+
Sbjct: 410 --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458
>gi|68525545|ref|XP_723632.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
gi|23477988|gb|EAA15197.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
Length = 1251
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ KLRPG+ FL++ ++ +E++LYTMG +A +LDP F R+ SR D +
Sbjct: 428 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 487
Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
K L +L S + +DDS +W + I V Y YFP
Sbjct: 488 M---------KHLNRILPTYRSISICVDDSEYIWKETN-SCIKVHAYNYFP 528
>gi|225463384|ref|XP_002271705.1| PREDICTED: uncharacterized protein LOC100258847 [Vitis vinifera]
Length = 484
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L ++ D P F ++ K RP
Sbjct: 310 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 355
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ TFLER +++FE+ ++T +YA ++ +LDP G F+ R DG
Sbjct: 356 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 408
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+S +V+
Sbjct: 409 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 434
>gi|297740632|emb|CBI30814.3| unnamed protein product [Vitis vinifera]
Length = 479
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L ++ D P F ++ K RP
Sbjct: 305 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 350
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ TFLER +++FE+ ++T +YA ++ +LDP G F+ R DG
Sbjct: 351 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 403
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+S +V+
Sbjct: 404 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 429
>gi|356530555|ref|XP_003533846.1| PREDICTED: uncharacterized protein LOC100786602 [Glycine max]
Length = 470
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 87/189 (46%), Gaps = 38/189 (20%)
Query: 879 PQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH 938
PQS ++ L E + Q I K+ RR + + LVLDLD TL++S
Sbjct: 266 PQSFIKNLPELSEIEVNGQPTLIPKQSPRR----------KSITLVLDLDETLVHST--- 312
Query: 939 EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
++P D + F ++ K RP + FLER S++FE+ ++T
Sbjct: 313 -LEPC----------DDADFTFTVFFNLKEYTVYVKQRPYLHAFLERVSEMFEVVIFTAS 361
Query: 999 NKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VV 1054
+YA ++ +LDP G + R+ DG+ +KDL +LG++ A V
Sbjct: 362 QSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN---------YTKDL-TILGVDLAKVA 411
Query: 1055 IIDDSVRVW 1063
IID+S +V+
Sbjct: 412 IIDNSPQVF 420
>gi|326499061|dbj|BAK06021.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 28/181 (15%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S L E+ D P R F ++
Sbjct: 215 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVR--FNLRDHTIY 260
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
+ RP + FLER + +FE+ ++T +YA ++ VLDPK LF RV +G+
Sbjct: 261 VRCRPYLKDFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRRLFRHRVYRESCVYVEGN 320
Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
KDL VLG + S VVI+D+S + + N I +E + P + L
Sbjct: 321 YL---------KDL-SVLGRDLSRVVIVDNSPQAFGFQLDNGIPIESWFDDPNDKELLAL 370
Query: 1089 L 1089
L
Sbjct: 371 L 371
>gi|15239800|ref|NP_196747.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|30683828|ref|NP_850809.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|42573341|ref|NP_974767.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|145334381|ref|NP_001078572.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|7573353|emb|CAB87659.1| putative protein [Arabidopsis thaliana]
gi|21553575|gb|AAM62668.1| unknown [Arabidopsis thaliana]
gi|56550687|gb|AAV97797.1| At5g11860 [Arabidopsis thaliana]
gi|332004345|gb|AED91728.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004346|gb|AED91729.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004347|gb|AED91730.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004348|gb|AED91731.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
Length = 305
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 967
K+ S + LVLDLD TL++S L E D F FP
Sbjct: 104 KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143
Query: 968 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
HM ++ + RP + F+ER S+LFE+ ++T +YA ++ VLDPK LF RV
Sbjct: 144 EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
D FDG+ KDL VLG + S V+I+D+S + + N + +E + P +
Sbjct: 201 DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255
Query: 1085 QFGLL 1089
LL
Sbjct: 256 LLHLL 260
>gi|67624539|ref|XP_668552.1| NLI interacting factor [Cryptosporidium hominis TU502]
gi|54659751|gb|EAL38315.1| NLI interacting factor [Cryptosporidium hominis]
Length = 595
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/367 (24%), Positives = 149/367 (40%), Gaps = 66/367 (17%)
Query: 913 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
K + KL +LDLD+TLL NS K + D I + P + F P
Sbjct: 166 KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219
Query: 968 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
+ + KLRPG+ FL + + M + T + YA + VLDP+ F R+++R
Sbjct: 220 LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277
Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
+ DG + KD + + +E+ A+V++DD VW + L VV+ TY +R
Sbjct: 278 --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVW-DSSLESQVVKAQTYEYFEQR 331
Query: 1085 QFGL------LGPSLLEIDHDERSEDGTLASSLG-----------------------VIE 1115
+ L L I + + L+++L V +
Sbjct: 332 KDALKSHYPPLSSGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFK 391
Query: 1116 RLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL---W---- 1168
LH FF Q+ + V +IL + +IL C + F+ E P + L W
Sbjct: 392 ELHTRFF--QNPETACVGDILKKMRSEILENCIVCFTGFLKADE-KPIVKGLPSNWGDSQ 448
Query: 1169 ----QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYR 1223
A + GA ++++ TH+V T K + A + +VH W+ A ++
Sbjct: 449 ADAESAAIRLGAGIEENLEPLATHLVVQKTNTAKFHQAKNNPNVKIVHTLWLWACDGQWQ 508
Query: 1224 RANEQDF 1230
R +E F
Sbjct: 509 RVSEDLF 515
>gi|440290285|gb|ELP83711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba invadens IP1]
Length = 434
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 107/248 (43%), Gaps = 57/248 (22%)
Query: 959 PHR-HLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD--PK 1013
P R F+ P ++ R GI F+ + L+E+H+ T+G K YA + + L+ P
Sbjct: 94 PERGFFFQIPEYSKKVFVYFRDGIVAFMTKLITLYEIHVVTLGQKDYAMAIVEALNKLPG 153
Query: 1014 GVLFAGRVISR--------GDDGDPFDGDERVPKSKDLEGV--------LGMESAVVIID 1057
G G+++ DDGD F D + +++D E +G E +++D
Sbjct: 154 GPFINGKIVCSEDCISEILKDDGD-FQNDGLIERNEDTERRAVKRTVPGMGSEEVQIVVD 212
Query: 1058 DSVRVWP-HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 1116
D + VW HN L + ++F L P + E +D L +++
Sbjct: 213 DRIDVWDNHNVLQI-------------QEFVL--P------NSENVKDNELERISEILKY 251
Query: 1117 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG---EANPHLHPLW----Q 1169
+H FF +S +VR IL+ + IL G +I FS+ P G + N +W +
Sbjct: 252 VHTEFFGGKS----NVRVILSEMREHILKGKKIYFSK--PKGGETKMNKMTEFIWNAIKR 305
Query: 1170 TAEQFGAV 1177
E GAV
Sbjct: 306 NCEMMGAV 313
>gi|70952066|ref|XP_745226.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56525483|emb|CAH77992.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 1224
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ KLRPG+ FL++ ++ +E++LYTMG +A +LDP F R+ SR D +
Sbjct: 429 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 488
Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
K L +L S + +DDS +W + I V Y YFP
Sbjct: 489 M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 529
>gi|297811303|ref|XP_002873535.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319372|gb|EFH49794.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 305
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 967
K+ S + LVLDLD TL++S L E D F FP
Sbjct: 104 KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143
Query: 968 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
HM ++ + RP + F+ER S+LFE+ ++T +YA ++ VLDPK LF RV
Sbjct: 144 EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200
Query: 1026 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
D FDG+ KDL VLG + S V+I+D+S + + N + +E + P +
Sbjct: 201 DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255
Query: 1085 QFGLL 1089
LL
Sbjct: 256 LLHLL 260
>gi|147839779|emb|CAN65912.1| hypothetical protein VITISV_035567 [Vitis vinifera]
Length = 482
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L ++ D P F ++ K RP
Sbjct: 308 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 353
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ TFLER +++FE+ ++T +YA ++ +LDP G F+ R DG
Sbjct: 354 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 406
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+S +V+
Sbjct: 407 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 432
>gi|356556521|ref|XP_003546573.1| PREDICTED: uncharacterized protein LOC100799803 [Glycine max]
Length = 471
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 72/149 (48%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H D + F ++ K RP
Sbjct: 297 KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEYIVYVKQRPY 342
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ TFLER S++FE+ ++T +YA ++ +LDP G + R+ DG+
Sbjct: 343 LHTFLERVSEMFEVVIFTASQSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN------ 396
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V IID+S +V+
Sbjct: 397 ---YTKDL-TILGVDLAKVAIIDNSPQVF 421
>gi|255557435|ref|XP_002519748.1| conserved hypothetical protein [Ricinus communis]
gi|223541165|gb|EEF42721.1| conserved hypothetical protein [Ricinus communis]
Length = 474
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 28/155 (18%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ + + LVLDLD TL++S H D + F ++
Sbjct: 294 KESLMKKSVTLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEHTVY 339
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 1029
K RP + TFLER ++LFE+ ++T +YA ++ +LDP+ L + RV DG
Sbjct: 340 VKRRPHLHTFLERVAELFEVVIFTASQSIYAAQLLDILDPEKKLISRRVYRESCIFTDG- 398
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+S +V+
Sbjct: 399 --------SYTKDL-TVLGVDLAKVAIIDNSPQVF 424
>gi|68074755|ref|XP_679294.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56500009|emb|CAH99961.1| conserved hypothetical protein [Plasmodium berghei]
Length = 983
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ KLRPG+ FL++ ++ +E++LYTMG +A +LDP F R+ SR D +
Sbjct: 233 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 292
Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
K L +L S + +DDS +W + I V Y YFP
Sbjct: 293 M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 333
>gi|301115156|ref|XP_002905307.1| nuclear LIM factor interactor-interacting protein hyphal form,
putative [Phytophthora infestans T30-4]
gi|262110096|gb|EEY68148.1| nuclear LIM factor interactor-interacting protein hyphal form,
putative [Phytophthora infestans T30-4]
Length = 422
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 81/164 (49%), Gaps = 28/164 (17%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
+A K+CLVLDLD TL++ + VD V + PH F +
Sbjct: 236 NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYIVNV 278
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RP + FL+R SKLFE+ ++T +K+YA ++ +LDP L R+ R D D F
Sbjct: 279 KKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYRLY-RDDCLDVFGN 337
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + S VV++D+S + + N I +E +
Sbjct: 338 -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 375
>gi|428672173|gb|EKX73087.1| conserved hypothetical protein [Babesia equi]
Length = 937
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 128/323 (39%), Gaps = 68/323 (21%)
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP++ + KLRP I FL+ S +EM +YT K YA + +LDP LF R+++R
Sbjct: 504 FPNITYYMKLRPCIREFLQVLSLYYEMSIYTNATKEYADVVISILDPDRTLFMDRIVARN 563
Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYFP 1080
D+ D R+ DL V+ DD VW PH + ++ E Y +F
Sbjct: 564 SVDEKDLLKSAARL--YPDLN-----RRFVLAFDDRKDVWADIPHRQ--VVRAEHYDFFE 614
Query: 1081 CSRRQ----FGLL----------GPSL-------------------------LEIDHD-E 1100
+ +G + GP + + D D
Sbjct: 615 SYSMELANNYGFVAQDAFSPDVPGPQMENTSKSSNNLYISQNFDGNAENTEGITDDKDVV 674
Query: 1101 RSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS--RVFPVG 1158
R D L + + +HK FF+ + +V IL Q + L R++ + R G
Sbjct: 675 RDYDRHLRHMITLFLEIHKRFFNDPF--NANVGTILENLQSETLKDTRVLLTGYRKNAKG 732
Query: 1159 EANPHLHP-----LWQTAEQFGAVCTKHIDDQ-VTHVVANSLGTDKVNWAL-STGRFV-- 1209
+N LH + + G++ I D +TH+VA TD + + ST +
Sbjct: 733 ISN-VLHSDCEQRQREVVSELGSIVLNKITDPGLTHIVAGKNCTDNIAKSRNSTYSHIHK 791
Query: 1210 VHPGWVEASALLYRRANEQDFAI 1232
VH W+ A + R +E+ F +
Sbjct: 792 VHTLWLYACKATFSRVDEKVFDM 814
>gi|397787628|gb|AFO66533.1| putative NLI interacting factor family protein [Brassica napus]
Length = 477
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 77/150 (51%), Gaps = 26/150 (17%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
K+ S + LVLDLD TL++S+ EVD ++E HM
Sbjct: 106 KQTRSCPPISLVLDLDETLVHSSLEPCGEVDFTFTVHFNEEE---------------HM- 149
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ + RP + F+ER S+LFE+ ++T +YA ++ VLDPK LF RV D
Sbjct: 150 VYVRCRPHLKEFMERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVF 207
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
FDG+ KDL VLG + S V+I+D+S
Sbjct: 208 FDGN----YLKDL-SVLGRDLSRVIIVDNS 232
>gi|397787605|gb|AFO66511.1| putative small phosphatase-like protein 2-B [Brassica napus]
Length = 262
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 22/148 (14%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S+ ++P + + E+ HM ++
Sbjct: 61 KQTRSCPPISLVLDLDETLVHSS----LEPCGEVDFTFTVHFNEEE---------HM-VY 106
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
+ RP + F+ER S+LFE+ ++T +YA ++ VLDPK LF RV D FD
Sbjct: 107 VRCRPHLKEFMERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVFFD 164
Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDS 1059
G+ KDL VLG + S V+I+D+S
Sbjct: 165 GN----YLKDL-SVLGRDLSRVIIVDNS 187
>gi|351710351|gb|EHB13270.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
Length = 466
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D PE+G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPHLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|147798518|emb|CAN65472.1| hypothetical protein VITISV_037605 [Vitis vinifera]
Length = 506
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
EE K+ +++ LVLDLD TL++S L + D P F
Sbjct: 327 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 368
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
++ + RP + FLER +++FE+ ++T +YA ++ +LDP LF+GR
Sbjct: 369 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 428
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DG +KDL VLG++ A V IID+S +V+ N I ++ + P R
Sbjct: 429 DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 478
Query: 1086 FGLL 1089
LL
Sbjct: 479 ISLL 482
>gi|359487040|ref|XP_002265614.2| PREDICTED: uncharacterized protein LOC100267967 [Vitis vinifera]
Length = 522
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
EE K+ +++ LVLDLD TL++S L + D P F
Sbjct: 343 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 384
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
++ + RP + FLER +++FE+ ++T +YA ++ +LDP LF+GR
Sbjct: 385 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 444
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DG +KDL VLG++ A V IID+S +V+ N I ++ + P R
Sbjct: 445 DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 494
Query: 1086 FGLL 1089
LL
Sbjct: 495 ISLL 498
>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
Length = 985
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 84/183 (45%), Gaps = 28/183 (15%)
Query: 911 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
E K + K+ L++D+DHTLL+S K DP + K ++ +
Sbjct: 565 ESKSSNGSPKMYLIVDIDHTLLHSTK----DPNAESYYLKDNSINK-----FTITETNET 615
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR------ 1024
+ K RP FL S F+++LY+ K Y ++A +LDP +F +VI++
Sbjct: 616 FYVKQRPNAIEFLSSLSSQFKIYLYSFHPKYYVEQLALILDPNRSIFT-KVITKEVIEPV 674
Query: 1025 ------GDDGDPFDGDERVPKSKDLEGVLGMESA--VVIIDDSVRVWPHNKLNLIVVERY 1076
G P+ V +++ + E+ ++I+DD VW + + NLI+++ +
Sbjct: 675 EPLPPINSIGKPY----IVFNNQNFSKIFNFEAINQMIILDDREDVWRNFQDNLILLDTF 730
Query: 1077 TYF 1079
+F
Sbjct: 731 KFF 733
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 60/125 (48%), Gaps = 5/125 (4%)
Query: 1112 GVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPH--LHP-LW 1168
G++ + F S ++ + V++I+ + +L C IVFS +FP + +P H +
Sbjct: 851 GLLPAIQGEFLSRGDINTIHVKSIIDEIRSSVLMDCNIVFSGIFP-KQIDPSKLCHTRVS 909
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
+ E FGA + ID TH++ GT K AL VVH W+ + + +E
Sbjct: 910 KITESFGAKISLEIDSTTTHLIFIKEGTSKAQQALKNPNIKVVHFAWLRDCVHRWEKMDE 969
Query: 1228 QDFAI 1232
Q++ +
Sbjct: 970 QNYTV 974
>gi|357156635|ref|XP_003577523.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
[Brachypodium distachyon]
Length = 411
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S L E+ D P F ++
Sbjct: 219 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 264
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
+ RP + FLER + +FE+ ++T +YA ++ VLDPK LF RV +G+
Sbjct: 265 VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 324
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
KDL VLG + A VVI+D+S + + N I +E + P + L
Sbjct: 325 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 374
Query: 1089 L 1089
L
Sbjct: 375 L 375
>gi|296090552|emb|CBI40902.3| unnamed protein product [Vitis vinifera]
Length = 570
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
EE K+ +++ LVLDLD TL++S L + D P F
Sbjct: 391 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 432
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
++ + RP + FLER +++FE+ ++T +YA ++ +LDP LF+GR
Sbjct: 433 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 492
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
DG +KDL VLG++ A V IID+S +V+ N I ++ + P R
Sbjct: 493 DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 542
Query: 1086 FGLL 1089
LL
Sbjct: 543 ISLL 546
>gi|56605878|ref|NP_001008438.1| CTD small phosphatase-like protein 2 [Xenopus (Silurana) tropicalis]
gi|82181540|sp|Q66KM5.1|CTSL2_XENTR RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
gi|51512946|gb|AAH80328.1| MGC79498 protein [Xenopus (Silurana) tropicalis]
Length = 466
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
PE+G AH ++A+ + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|117606236|ref|NP_001071012.1| CTD small phosphatase-like protein 2-A [Danio rerio]
gi|123884286|sp|Q08BB5.1|CTL2A_DANRE RecName: Full=CTD small phosphatase-like protein 2-A;
Short=CTDSP-like 2-A
gi|115528634|gb|AAI24795.1| Zgc:154017 [Danio rerio]
Length = 469
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 111/254 (43%), Gaps = 36/254 (14%)
Query: 829 LTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEH 888
L P+ ++ ++ G+I + ADM + T GS V P A G E
Sbjct: 203 LNPRPLPHIDTTVEEGEIVTEADMPPL--------TAVGSNSNYPDVPPSP-PAEGTYEE 253
Query: 889 LFEGYD-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
+E +D +++ TR+ K S + LVLDLD TL++ +
Sbjct: 254 DWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------- 306
Query: 944 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
E +D LF+ ++ +LRP FLER S+++E+ L+T K+YA
Sbjct: 307 -------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYA 359
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV 1062
++ +LDPK L R+ + G+ KDL +LG + S VIID+S +
Sbjct: 360 DKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQA 412
Query: 1063 WPHNKLNLIVVERY 1076
+ + N I +E +
Sbjct: 413 FAYQLSNGIPIESW 426
>gi|351699228|gb|EHB02147.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
Length = 465
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D PE+ H + + + +F+ Y+ + ++
Sbjct: 210 INNGLEEAEATVNRDIPHLTAPVTPESDYSSIHAEVTYEEDWEIFDPYNFIKHVPPLTEQ 269
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K + K LVLDLD TL++ + D H + L
Sbjct: 270 QLNRKPALPLKTRAKTKFSLVLDLDETLVHCSLNELEDAAHTFPV--------------L 315
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER SK++E+ ++T K+YA ++ +LDPK L R+
Sbjct: 316 FQGVIYQVYVRLRPFFREFLERMSKMYEIIVFTAAKKVYAEKLLNILDPKKQLVRHRLFQ 375
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 376 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|229892336|ref|NP_001080602.1| CTD small phosphatase-like protein 2-A [Xenopus laevis]
gi|82176945|sp|Q801R4.1|CTL2A_XENLA RecName: Full=CTD small phosphatase-like protein 2-A;
Short=CTDSP-like 2-A
gi|28838482|gb|AAH47962.1| Ctdspl2a protein [Xenopus laevis]
gi|120538080|gb|AAI29525.1| Ctdspl2a protein [Xenopus laevis]
Length = 466
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
PE+G AH ++A+ + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|147907092|ref|NP_001089935.1| CTD small phosphatase-like protein 2-B [Xenopus laevis]
gi|83405117|gb|AAI10767.1| Ctdspl2b protein [Xenopus laevis]
Length = 466
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
PE+G AH ++A+ + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|327288817|ref|XP_003229121.1| PREDICTED: CTD small phosphatase-like protein 2-like [Anolis
carolinensis]
Length = 466
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 118/265 (44%), Gaps = 30/265 (11%)
Query: 821 DFMSVSQPLTSEPMVSQNSPI--QPGQIKS----GADMKAVVTNHDDKQTGTGSGPEAGP 874
D V + +TS + +P Q Q++S G + VT+ D P++G
Sbjct: 180 DMEQVDEIITSTAASANGTPYASQVAQVRSTINNGLEEAEDVTDRDLPPLTAPVSPDSGY 239
Query: 875 VGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 932
AH ++ + + +F+ Y + +++ R+ K S + LVLDLD TL+
Sbjct: 240 SSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDLDETLV 299
Query: 933 NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEM 992
+ + E +D LF+ ++ +LRP FLER S+++E+
Sbjct: 300 HCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEI 345
Query: 993 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-S 1051
L+T K+YA ++ +LDPK L R+ + G+ KDL +LG + S
Sbjct: 346 ILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLS 398
Query: 1052 AVVIIDDSVRVWPHNKLNLIVVERY 1076
+IID+S + + + N I +E +
Sbjct: 399 KTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|123900520|sp|Q3KQB6.1|CTL2B_XENLA RecName: Full=CTD small phosphatase-like protein 2-B;
Short=CTDSP-like 2-B
gi|76779483|gb|AAI06291.1| Ctdspl2b protein [Xenopus laevis]
Length = 466
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
PE+G AH ++A+ + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|348512759|ref|XP_003443910.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 1
[Oreochromis niloticus]
Length = 474
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 117/266 (43%), Gaps = 50/266 (18%)
Query: 828 PLTSEPMVSQ------NSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS 881
P + PM+ Q ++ I+ G+I + DM + + P P G++P S
Sbjct: 199 PCHTAPMLPQRLHITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHS 246
Query: 882 --------AWGDVEHLFEGYDDQQKAA--IQKERTRRLEEQKKMFSARKLCLVLDLDHTL 931
++ + +F+ Y + +++ TR+ K S + LVLDLD TL
Sbjct: 247 LPSVPAEPSYDEDWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL 306
Query: 932 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
++ + E +D LF+ ++ +LRP FLER S+L+E
Sbjct: 307 VHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYE 352
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+ L+T K+YA ++ +LDPK L R+ + G+ KDL +LG +
Sbjct: 353 IILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDL 405
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERY 1076
S VIID+S + + + N I +E +
Sbjct: 406 SKTVIIDNSPQAFAYQLSNGIPIESW 431
>gi|126281910|ref|XP_001363358.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 [Monodelphis
domestica]
Length = 466
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 105/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + TN D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEVEEATNRDIPPLTAPVSPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL++ + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|387594493|gb|EIJ89517.1| hypothetical protein NEQG_00287 [Nematocida parisii ERTm3]
gi|387596665|gb|EIJ94286.1| hypothetical protein NEPG_00953 [Nematocida parisii ERTm1]
Length = 310
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 111/272 (40%), Gaps = 59/272 (21%)
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVI 1055
MGNK YA +A +LDP G LF R+ISR D+ FD KD++ + S VVI
Sbjct: 1 MGNKSYACSIAGLLDPTGKLFGSRIISRDDNFGCFD--------KDIKRLFPTNSKHVVI 52
Query: 1056 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE-----IDHDERSE------- 1103
+DD VW NL + Y YF + P L+ + D R+
Sbjct: 53 LDDRPDVWGFVD-NLYPIRPYYYFQTD----DINSPEALQGMKSALSEDVRNSPVGEVFR 107
Query: 1104 ------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 1145
D L L ++ +H FFS Q D +IL ++++I
Sbjct: 108 NKNDLIELIDRECILTYFDNELEKVLSGLKEVHTEFFSTQQ----DTASIL-KKKKEIFK 162
Query: 1146 GCRI-VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 1204
GC +FS V E +L+ L++ G + ++VTH++ G + W
Sbjct: 163 GCTAEIFS---SVNEYTLYLNLLFK--HHGGQTFSHGSVNKVTHLITTGNGRIRSVWKNK 217
Query: 1205 TGRFV----VHPGWVEASALLYRRANEQDFAI 1232
+ V W+ S ++R +E++F +
Sbjct: 218 KDPVIDPVCVQVEWMHESIYAFKRLDEKEFIL 249
>gi|62078827|ref|NP_001014070.1| CTD small phosphatase-like protein 2 [Rattus norvegicus]
gi|81883796|sp|Q5XIK8.1|CTSL2_RAT RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
gi|53734232|gb|AAH83672.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Rattus norvegicus]
gi|149023119|gb|EDL80013.1| similar to hypothetical protein HSPC129 [Rattus norvegicus]
Length = 465
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 233 APESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 292
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 293 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 338
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 339 SQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 391
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 392 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|357156637|ref|XP_003577524.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
[Brachypodium distachyon]
Length = 443
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S L E+ D P F ++
Sbjct: 251 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 296
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
+ RP + FLER + +FE+ ++T +YA ++ VLDPK LF RV +G+
Sbjct: 297 VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 356
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
KDL VLG + A VVI+D+S + + N I +E + P + L
Sbjct: 357 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 406
Query: 1089 L 1089
L
Sbjct: 407 L 407
>gi|426233772|ref|XP_004010888.1| PREDICTED: CTD small phosphatase-like protein 2 [Ovis aries]
Length = 466
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D PE+G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|348512761|ref|XP_003443911.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 2
[Oreochromis niloticus]
Length = 471
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 117/266 (43%), Gaps = 50/266 (18%)
Query: 828 PLTSEPMVSQ------NSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS 881
P + PM+ Q ++ I+ G+I + DM + + P P G++P S
Sbjct: 196 PCHTAPMLPQRLHITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHS 243
Query: 882 --------AWGDVEHLFEGYDDQQKAA--IQKERTRRLEEQKKMFSARKLCLVLDLDHTL 931
++ + +F+ Y + +++ TR+ K S + LVLDLD TL
Sbjct: 244 LPSVPAEPSYDEDWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL 303
Query: 932 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
++ + E +D LF+ ++ +LRP FLER S+L+E
Sbjct: 304 VHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYE 349
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
+ L+T K+YA ++ +LDPK L R+ + G+ KDL +LG +
Sbjct: 350 IILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDL 402
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERY 1076
S VIID+S + + + N I +E +
Sbjct: 403 SKTVIIDNSPQAFAYQLSNGIPIESW 428
>gi|74190363|dbj|BAE37265.1| unnamed protein product [Mus musculus]
Length = 465
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|30851260|gb|AAH52660.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Mus musculus]
Length = 465
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|71031738|ref|XP_765511.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352467|gb|EAN33228.1| hypothetical protein TP02_0943 [Theileria parva]
Length = 769
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 14/119 (11%)
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP++ + KLRP I FL+ S +EM +YT K YA + +LDP LF R+++R
Sbjct: 339 FPNVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 398
Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
D+ D R+ D +L DD VW PH + ++ E Y +F
Sbjct: 399 SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFF 448
>gi|26343511|dbj|BAC35412.1| unnamed protein product [Mus musculus]
Length = 464
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 233 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 292
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 293 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 338
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 339 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 391
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 392 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 421
>gi|47059059|ref|NP_997615.1| CTD small phosphatase-like protein 2 [Mus musculus]
gi|81873659|sp|Q8BG15.1|CTSL2_MOUSE RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
gi|26326063|dbj|BAC26775.1| unnamed protein product [Mus musculus]
gi|26329037|dbj|BAC28257.1| unnamed protein product [Mus musculus]
gi|26340192|dbj|BAC33759.1| unnamed protein product [Mus musculus]
gi|26349835|dbj|BAC38557.1| unnamed protein product [Mus musculus]
gi|148696133|gb|EDL28080.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2, isoform CRA_b [Mus musculus]
Length = 465
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|148696132|gb|EDL28079.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2, isoform CRA_a [Mus musculus]
Length = 465
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|354471693|ref|XP_003498075.1| PREDICTED: CTD small phosphatase-like protein 2 [Cricetulus griseus]
Length = 465
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422
>gi|114108339|gb|AAI23380.1| Ctdspl2a protein [Xenopus laevis]
Length = 536
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 99/210 (47%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++A+ + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 305 PESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 364
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 365 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 410
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 411 QIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NIL 463
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 464 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 493
>gi|26390099|dbj|BAC25842.1| unnamed protein product [Mus musculus]
Length = 351
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 120 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 179
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL++ + E +D LF+ ++ +LRP FLER S
Sbjct: 180 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 225
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 226 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 278
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 279 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 308
>gi|348685327|gb|EGZ25142.1| hypothetical protein PHYSODRAFT_311755 [Phytophthora sojae]
Length = 257
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 28/164 (17%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
+A K+CLVLDLD TL++ + VD V + PH F +
Sbjct: 72 NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 114
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RP + FL+R SKLFE+ ++T +K+YA ++ +LDP R + R D D F
Sbjct: 115 KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 173
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + S VV++D+S + + N I +E +
Sbjct: 174 -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 211
>gi|254728746|gb|ACT79548.1| CTD phosphatase-like protein [Oryza sativa Japonica Group]
gi|254728748|gb|ACT79549.1| CTD phosphatase-like protein [Oryza sativa Indica Group]
gi|254728750|gb|ACT79550.1| CTD phosphatase-like protein [Oryza rufipogon]
gi|254728752|gb|ACT79551.1| CTD phosphatase-like protein [Oryza nivara]
Length = 462
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S L E+ D P F F ++ + RP + F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LER + LFE ++T +YA ++ VLDPK LF RV D +G+ KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377
Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
L VLG + + ++I+D+S + + N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411
>gi|254728754|gb|ACT79552.1| CTD phosphatase-like protein [Oryza glaberrima]
Length = 462
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S L E+ D P F F ++ + RP + F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LER + LFE ++T +YA ++ VLDPK LF RV D +G+ KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377
Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
L VLG + + ++I+D+S + + N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411
>gi|242072230|ref|XP_002446051.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
gi|241937234|gb|EES10379.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
Length = 447
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 52/114 (45%), Gaps = 24/114 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
K+ S LVLDLD TL++S H D PVH F F
Sbjct: 253 KQTRSCPTTTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFQ 293
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
++ + RP + FLER + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 294 EHTIYVRCRPYLKEFLERVASMFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 347
>gi|118369793|ref|XP_001018099.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89299866|gb|EAR97854.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 874
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 35/207 (16%)
Query: 889 LFEGYDDQQKAAI----QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
++ G D + K+ + E +++L Q+ + S +KL LVLDLD+T+L H V +
Sbjct: 247 VYAGLDQKDKSVLIGKEYAEYSKKLAHQQ-LHSNQKLILVLDLDNTIL-----HAVPAIK 300
Query: 945 DEILRKKE--EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
+ + + +QD K F + K RP + FL+ +E++++TM Y
Sbjct: 301 NALFDNADGIQQDSFKE----FHNRYSKYVIKFRPYMKEFLQTVLPHYEIYIFTMAMLDY 356
Query: 1003 ATEMAK--------VLDPKGVLF-AGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESA 1052
A + +LD + F R+ISR + F + +KDL+ +L E
Sbjct: 357 AKCVCDYLKQTYKDILDDYPMTFNYDRIISR----EQFSSN-----NKDLQQILPNSEKI 407
Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++I+DD VW NK+NL+ Y Y+
Sbjct: 408 MLILDDRDDVWAKNKMNLVTTLPYIYW 434
>gi|294905859|ref|XP_002777694.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
gi|239885585|gb|EER09510.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
Length = 523
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 55/211 (26%)
Query: 920 KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
+L +VLDLD T++NS + + V P+ EI R +E P +L + +
Sbjct: 63 RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEEGL----PELYLCVISDVKV 118
Query: 972 WTKLRPGIWTFLER--ASKLFE--MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR--- 1024
TK+RP F+ AS + + +YT G++ Y + ++LDP G L GR++SR
Sbjct: 119 LTKIRPHARAFIRELVASTDYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 178
Query: 1025 --------------------------GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDD 1058
G DG +GD+ +S+ M V++DD
Sbjct: 179 PSNMTPVEKDPDLIINASIESGAQVDGSDGRLCNGDKETKESE-------MRRWFVVLDD 231
Query: 1059 SVRVWP---HNKLNLIVVERYTYFPCSRRQF 1086
S WP N++ Y + + RQ
Sbjct: 232 SPEAWPEELREAGNVVTANMYDFAEVNHRQL 262
>gi|221057654|ref|XP_002261335.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247340|emb|CAQ40740.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1389
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 80/177 (45%), Gaps = 26/177 (14%)
Query: 958 KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P+ + K RP + FL+ S +E+ +YT + YA + +LDP
Sbjct: 948 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1007
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
LFA R+++R + D ++K+ + ++S VI DD VW PH+ N+
Sbjct: 1008 LFADRIVARCNSADR-------EENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--NI 1058
Query: 1071 IVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFF 1122
+ E Y +F S+ PS + +D D L V+ +LHK FF
Sbjct: 1059 LKAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHFMTKVLLKLHKHFF 1110
>gi|399215917|emb|CCF72605.1| unnamed protein product [Babesia microti strain RI]
Length = 664
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 118/297 (39%), Gaps = 68/297 (22%)
Query: 959 PHRHLFRFPH---MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
P + F P + + KLRP + FL S +EM +YT + YA + +LDP
Sbjct: 212 PELYTFTLPSYADVSYYLKLRPRLREFLHILSFYYEMSIYTNATREYADVVIAILDPDRS 271
Query: 1016 LFAGRVISRGDDGDPFDGDER--VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV- 1072
LF R+I+RG G++R ++ L L + VV DD VW N ++
Sbjct: 272 LFMDRIIARG------GGNDRGLTKSARRLYPKLS-QRFVVSFDDRRDVWTDIDPNQVLK 324
Query: 1073 VERYTYF------------------PCSRRQFGLLGPSLLEIDHDERSEDGTL--ASSLG 1112
Y+YF P S + L G S + + S+ G + +SS+G
Sbjct: 325 AHHYSYFMENLPQNIGKFKTDIPAMPNSTSKTSLDGKSQINGLTLDLSQSGAINDSSSVG 384
Query: 1113 ---------------------VIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 1151
V LHK F+S+ +V ILA Q++ L+G I F
Sbjct: 385 CNSSVNNPLQDFDMHLMYMTKVFLELHKRFYSNPL--KANVGTILAEMQKETLSGVGIFF 442
Query: 1152 SRVFPVGEANPHLHPLWQTAE--------QFGA-VCTKHIDDQVTHVVANSLGTDKV 1199
+ F N W E +FGA V K +D + THVVA TD +
Sbjct: 443 TG-FRKNVKNA--VSGWTDCEVRQKEMALEFGAHVVDKLVDKKTTHVVAAKNCTDNL 496
>gi|125557643|gb|EAZ03179.1| hypothetical protein OsI_25332 [Oryza sativa Indica Group]
gi|125599502|gb|EAZ39078.1| hypothetical protein OsJ_23510 [Oryza sativa Japonica Group]
Length = 461
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S L E+ D P F F ++ + RP + F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LER + LFE ++T +YA ++ VLDPK LF RV D +G+ KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377
Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
L VLG + + ++I+D+S + + N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411
>gi|82541597|ref|XP_725029.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
gi|23479881|gb|EAA16594.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
Length = 1177
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 112/265 (42%), Gaps = 37/265 (13%)
Query: 955 DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ ++P + F P+ + K RP + FLE S +E+ +YT + YA + +LDP
Sbjct: 759 ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 818
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 1067
+FA R+++R D DE +K E + ++ VI DD VW PH+
Sbjct: 819 DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 870
Query: 1068 LNLIVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFF 1122
+++ E Y +F S+ PS + +D D L + + ++HK FF
Sbjct: 871 -HILRAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHYMIKIFLKIHKQFF 924
Query: 1123 SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAE-------QFG 1175
+ +VDV I+ L+ + F+ F N L E + G
Sbjct: 925 ENPL--NVDVGKIIDNIMLSTLSNVGLYFTG-FRKNSKNLQNVILSDCEERQKEIALELG 981
Query: 1176 AVCTKHIDD-QVTHVVANSLGTDKV 1199
A + DD VTH++A TD +
Sbjct: 982 AKIYSNYDDPGVTHIIAAKNCTDNL 1006
>gi|410912504|ref|XP_003969729.1| PREDICTED: CTD small phosphatase-like protein 2-like [Takifugu
rubripes]
Length = 474
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 147/358 (41%), Gaps = 59/358 (16%)
Query: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
++E V + PR L G P F P+ + GS E K
Sbjct: 117 LEETTAVEVPSSAPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQAMEAEEIVK 170
Query: 789 QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPL--TSEPMVSQNSPIQPGQI 846
QL + Q+V P T +L ++F S PL P V + SP+ +
Sbjct: 171 QLD-------MEQAVEMPTSTAMSPHDLCVASNFHSSVSPLPPLRPPHVPEASPLAVEE- 222
Query: 847 KSGADMKAVVTNHDDKQTGTGSGP-----EAGPVGAHPQSAWGDVEHLFEGYDDQQKAA- 900
+ AD+ + T GS P EA P++++ + +F+ Y +
Sbjct: 223 ELDADLPPL--------TAPGSSPDMTYVEAPLAAVPPEASYEEDWEVFDPYFFIKHVPP 274
Query: 901 -IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 959
+++ TR+ K S + LVLDLD TL++ + E +D
Sbjct: 275 LTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALT 320
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L
Sbjct: 321 FPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 380
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
R+ + G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 381 RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431
>gi|47220514|emb|CAG05540.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 110/249 (44%), Gaps = 36/249 (14%)
Query: 835 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 893
++ +S I+ G+I + DM + T G P++G P P A E +E +
Sbjct: 211 INADSSIEEGEIVAETDMPPL--------TAPGCMPDSGYPHTLPPAPAETSYEEDWEVF 262
Query: 894 D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
D +++ TR+ K S + LVLDLD TL VH +
Sbjct: 263 DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL- 310
Query: 949 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
E +D LF+ ++ +LRP FLER S+ +E+ L+T K+YA ++
Sbjct: 311 --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQKYEIILFTASKKVYADKLLN 368
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
+LDP+ L R+ + G+ KDL +LG + S +IID+S + + +
Sbjct: 369 ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 421
Query: 1068 LNLIVVERY 1076
N I +E +
Sbjct: 422 SNGIPIESW 430
>gi|330864811|ref|NP_001178334.1| CTD small phosphatase-like protein 2 [Bos taurus]
gi|296482877|tpg|DAA24992.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Bos taurus]
gi|440911957|gb|ELR61572.1| CTD small phosphatase-like protein 2 [Bos grunniens mutus]
Length = 466
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|224116454|ref|XP_002317305.1| predicted protein [Populus trichocarpa]
gi|222860370|gb|EEE97917.1| predicted protein [Populus trichocarpa]
Length = 377
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 71/149 (47%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H D + F ++ K RP
Sbjct: 203 KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNMKEHTVYVKQRPH 248
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ TFLER +++FE+ ++T +YA ++ +LDP L + R+ DG
Sbjct: 249 VHTFLERVAEMFEVVIFTASQSIYAAQLLDMLDPDRKLISRRIYRESCIFSDG------- 301
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+S +V+
Sbjct: 302 --SYTKDL-TVLGVDLAKVAIIDNSPQVF 327
>gi|167384602|ref|XP_001737021.1| RNA polymerase II ctd phosphatase [Entamoeba dispar SAW760]
gi|165900378|gb|EDR26711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba dispar SAW760]
Length = 429
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 126/301 (41%), Gaps = 62/301 (20%)
Query: 880 QSAWGDVEHLFEGYDDQQKAAIQKERTRRL-EEQKKMFSAR-----KLCLVLDLDHTLLN 933
Q+ D L E DD + + T+ EEQK+ S R KL L+LDLD T++
Sbjct: 15 QNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSERLLKEKKLSLILDLDGTIVF 74
Query: 934 SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFE 991
+ V P+ +E E+ F P + K R GI TF+E+ SKL++
Sbjct: 75 TNPELCV-PLENE------EEPITPEQGFYFEIPEQNAKVLIKFRDGIVTFMEKVSKLYD 127
Query: 992 MHLYTMGNKLYATEMAKVLDP-------KGVLFAGRVISR---GDDGDPFDG-------D 1034
+H+ T+G K YA + ++ G L S D+ D DG +
Sbjct: 128 IHVVTLGQKEYAFAIVNAINKLRDTPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREETN 187
Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
ER + + +G E VI+DD + VW + N++ + Y PS
Sbjct: 188 ERRSVKRSI-PTMGKEEMQVIVDDRIDVWDNK--NVVQICEYV-------------PSTN 231
Query: 1095 EIDHDERSEDGTLASSLGVIERLHKIF--FSHQSLDDVDVRNILAAEQRKILAGCRIVFS 1152
++D L V E L I+ F + ++ DV+ IL + ++KIL + F+
Sbjct: 232 QVD----------TELLRVTEVLQNIYNKFYDEHIE--DVKEILHSFRKKILENKNLYFN 279
Query: 1153 R 1153
R
Sbjct: 280 R 280
>gi|332235387|ref|XP_003266885.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Nomascus
leucogenys]
gi|332235389|ref|XP_003266886.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Nomascus
leucogenys]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|149692003|ref|XP_001502897.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 2 [Equus
caballus]
gi|149692005|ref|XP_001502892.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Equus
caballus]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|356566193|ref|XP_003551319.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
Length = 403
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 28/174 (16%)
Query: 907 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
RRL K+ S LVLDLD TL++S L E+ D P F
Sbjct: 198 RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEHCEDVDFTFPVN--FNS 243
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
++ + RP + FLER S LFE+ ++T +YA ++ VLDPK +F RV
Sbjct: 244 EEHIVYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVYRESC 303
Query: 1027 ---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL VLG + A V+IID+S + + N I +E +
Sbjct: 304 VYVEGNYL---------KDLT-VLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 347
>gi|417401418|gb|JAA47595.1| Putative ctd carboxy-terminal domain rna polymer [Desmodus rotundus]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|410961377|ref|XP_003987259.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Felis
catus]
gi|410961379|ref|XP_003987260.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Felis
catus]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|403274413|ref|XP_003928971.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403274415|ref|XP_003928972.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)
Query: 844 GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 901
I +G + N D P++G AH ++ + + +F+ Y +
Sbjct: 209 ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+++ R+ K S + LVLDLD TL VH + E +D
Sbjct: 269 EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 315 VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 375 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|388453109|ref|NP_001253738.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Macaca mulatta]
gi|114656732|ref|XP_001161756.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 3 [Pan
troglodytes]
gi|114656734|ref|XP_001161793.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 4 [Pan
troglodytes]
gi|297696523|ref|XP_002825440.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Pongo
abelii]
gi|395746659|ref|XP_003778487.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 2 [Pongo
abelii]
gi|380813572|gb|AFE78660.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|383419005|gb|AFH32716.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|384947558|gb|AFI37384.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|410206686|gb|JAA00562.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Pan troglodytes]
gi|410253512|gb|JAA14723.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Pan troglodytes]
gi|410302524|gb|JAA29862.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Pan troglodytes]
gi|410341327|gb|JAA39610.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [Pan troglodytes]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|397480304|ref|XP_003811426.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Pan
paniscus]
gi|397480306|ref|XP_003811427.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Pan
paniscus]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|57108473|ref|XP_544655.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Canis
lupus familiaris]
gi|73999941|ref|XP_860654.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 4 [Canis
lupus familiaris]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|402874166|ref|XP_003900915.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Papio
anubis]
gi|402874168|ref|XP_003900916.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Papio
anubis]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|6841354|gb|AAF29030.1|AF161543_1 HSPC058 [Homo sapiens]
Length = 352
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
+ +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 97 LNNGLEEAEETVNRDIPTLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 156
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL++ + E +D L
Sbjct: 157 QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 202
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 203 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 262
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 263 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 309
>gi|296213856|ref|XP_002753450.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 3 [Callithrix
jacchus]
Length = 466
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)
Query: 844 GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 901
I +G + N D P++G AH ++ + + +F+ Y +
Sbjct: 209 ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+++ R+ K S + LVLDLD TL VH + E +D
Sbjct: 269 EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 315 VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 375 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|209156270|gb|ACI34367.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Salmo salar]
Length = 367
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 80/156 (51%), Gaps = 22/156 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL+ S+ ++ + +D E R F+ ++ LRP + F
Sbjct: 191 LVLDLDETLMYSS----LNVI----------EDAEYTFRTCFQDNPYKVYVILRPYVKEF 236
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LE +K FEM +YT K YA ++ +LDPK LF R+ + D G KD
Sbjct: 237 LEAMTKHFEMFVYTSAKKEYAEKILDILDPKRRLFRHRLYQQ--DCACVLGH----YVKD 290
Query: 1043 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYT 1077
L GVL + A V++D++ +P++ +N++ ++ ++
Sbjct: 291 L-GVLERDLAKTVVLDNAPHTYPYHLMNVLPIKSWS 325
>gi|100815975|ref|NP_057480.2| CTD small phosphatase-like protein 2 [Homo sapiens]
gi|187471086|sp|Q05D32.2|CTSL2_HUMAN RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
gi|23273027|gb|AAH35744.1| CTDSPL2 protein [Homo sapiens]
gi|71835542|gb|AAZ42188.1| unknown [Homo sapiens]
gi|119597671|gb|EAW77265.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2, isoform CRA_a [Homo sapiens]
gi|119597672|gb|EAW77266.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2, isoform CRA_a [Homo sapiens]
gi|123994825|gb|ABM85014.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [synthetic construct]
gi|157928777|gb|ABW03674.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase like 2 [synthetic construct]
gi|158255896|dbj|BAF83919.1| unnamed protein product [Homo sapiens]
gi|168278020|dbj|BAG10988.1| CTD small phosphatase like 2 [synthetic construct]
Length = 466
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|68068525|ref|XP_676173.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56495746|emb|CAI00611.1| conserved hypothetical protein [Plasmodium berghei]
Length = 953
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 115/267 (43%), Gaps = 41/267 (15%)
Query: 955 DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+ ++P + F P+ + K RP + FLE S +E+ +YT + YA + +LDP
Sbjct: 598 ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 657
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 1067
+FA R+++R D DE +K E + ++ VI DD VW PH+
Sbjct: 658 DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 709
Query: 1068 LNLIVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFF 1122
+++ E Y +F S+ PS + +D D L + + ++HK FF
Sbjct: 710 -HILRAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHYMIKIFLKIHKQFF 763
Query: 1123 SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT---------AEQ 1173
+ +VDV I+ L+ + F+ + + +L + T A +
Sbjct: 764 ENPL--NVDVGKIIDNIMLSTLSNVGLYFT---GFRKNSKNLQNVISTDCEERQKEIALE 818
Query: 1174 FGAVCTKHIDD-QVTHVVANSLGTDKV 1199
GA + DD VTH++A TD +
Sbjct: 819 LGAKIYSNYDDPGVTHIIAAKNCTDNL 845
>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
bisporus H97]
Length = 814
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 37/60 (61%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ K RPG FL + ++MH+YTMG + YA E+ +DP G +F R++SR + G+ F
Sbjct: 269 YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGNDF 328
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 1130 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 1189
+V I+ + ++L G +VFS V P+ + P W+ A FGA C + VTHV
Sbjct: 500 FNVTTIIPRLRSEVLGGLHLVFSGVIPL-DTPPETTEFWRLARMFGAKCHTDLTPDVTHV 558
Query: 1190 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 1230
+ GT KV A G +V P W S L+ R +E +
Sbjct: 559 ITAKRGTKKVETARQRGGIKIVRPEWFNDSIALWHRQDETPY 600
>gi|291403116|ref|XP_002717973.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 [Oryctolagus
cuniculus]
Length = 286
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
PE+G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 55 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 114
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL++ + E +D LF+ ++ +LRP FLER S
Sbjct: 115 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 160
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 161 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 213
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 214 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243
>gi|55740281|gb|AAV63942.1| putative nuclear LIM factor interactor-interacting protein hyphal
form [Phytophthora infestans]
Length = 211
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 74/147 (50%), Gaps = 28/147 (19%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
+A K+CLVLDLD TL++ + VD V + PH F +
Sbjct: 25 NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYIVNV 67
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RP + FL+R SKLFE+ ++T +K+YA ++ +LDP L R + R D D F
Sbjct: 68 KKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYR-LYRDDCLDVFGN 126
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL VLG + S VV++D+S
Sbjct: 127 -----YLKDL-NVLGRDLSKVVLVDNS 147
>gi|344297040|ref|XP_003420208.1| PREDICTED: CTD small phosphatase-like protein 2 [Loxodonta africana]
Length = 466
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEGTVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|221488107|gb|EEE26321.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
gi|221508626|gb|EEE34195.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
Length = 1139
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 124/335 (37%), Gaps = 67/335 (20%)
Query: 958 KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P + KLRP + TFL++ +EM +YT + YA + +LD
Sbjct: 664 EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
LF R+++R D G + E + EG+ + +V DD +W L +V +
Sbjct: 724 LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780
Query: 1075 RYTYF------------PCSRRQFGLLGPSLLE--------------------IDHDERS 1102
Y +F P S G+ P +L + D +S
Sbjct: 781 HYDFFDSHKTELNAYYPPLSNGIEGMTAPDVLGNRQNEDEGVMDPVGCGSFPPVGQDTQS 840
Query: 1103 E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
D L L + LH FF + + ++ IL Q+
Sbjct: 841 TNTPVQQHLSRAAADGKKPCDWDRHLECMLKLFLHLHAEFF--KDPVNANIGAILCNFQQ 898
Query: 1142 KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 1196
K+L+G I F+ + F G A + A++ GA K D++ VTHVVA T
Sbjct: 899 KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRFDEEGVTHVVAGKNNT 958
Query: 1197 DKVNWALSTGRFV-VHPGWVEASALLYRRANEQDF 1230
+ + + VH W+ R E F
Sbjct: 959 NNMLACKENTKLARVHTLWLYCCEAALARVPESAF 993
>gi|237832707|ref|XP_002365651.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211963315|gb|EEA98510.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
Length = 1139
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 124/335 (37%), Gaps = 67/335 (20%)
Query: 958 KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P + KLRP + TFL++ +EM +YT + YA + +LD
Sbjct: 664 EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
LF R+++R D G + E + EG+ + +V DD +W L +V +
Sbjct: 724 LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780
Query: 1075 RYTYF------------PCSRRQFGLLGPSLLE--------------------IDHDERS 1102
Y +F P S G+ P +L + D +S
Sbjct: 781 HYDFFDSHKTELNAYYPPLSNGIEGMTAPDVLGNRQNEDEGVMDPVGCGSFPPVGQDTQS 840
Query: 1103 E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
D L L + LH FF + + ++ IL Q+
Sbjct: 841 TNTPVQQHLSRAAADGKKPCDWDRHLECMLKLFLHLHAEFF--KDPVNANIGAILCNFQQ 898
Query: 1142 KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 1196
K+L+G I F+ + F G A + A++ GA K D++ VTHVVA T
Sbjct: 899 KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRFDEEGVTHVVAGKNNT 958
Query: 1197 DKVNWALSTGRFV-VHPGWVEASALLYRRANEQDF 1230
+ + + VH W+ R E F
Sbjct: 959 NNMLACKENTKLARVHTLWLYCCEAALARVPESAF 993
>gi|34596232|gb|AAQ76796.1| hypothetical protein [Homo sapiens]
Length = 466
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVVYQVYVRLRPFFREFLERMS 340
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|413917758|gb|AFW57690.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
Length = 449
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
K+ S + LVLDLD TL++S H D PVH F F
Sbjct: 255 KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
++ + RP + FL+R + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 296 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349
>gi|413917759|gb|AFW57691.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
Length = 451
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
K+ S + LVLDLD TL++S H D PVH F F
Sbjct: 255 KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
++ + RP + FL+R + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 296 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349
>gi|410908573|ref|XP_003967765.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Takifugu
rubripes]
Length = 474
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 108/249 (43%), Gaps = 36/249 (14%)
Query: 835 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 893
+S +S ++ G+I + DM + T G P+ G P P A E +E +
Sbjct: 212 ISSDSTVEEGEIVTETDMPPL--------TAPGCMPDGGYPHMLPPAPAETSYEEDWEVF 263
Query: 894 D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
D +++ TR+ K S + LVLDLD TL++ +
Sbjct: 264 DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------------ 311
Query: 949 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
E +D LF+ ++ +LRP FLER + +E+ L+T K+YA ++
Sbjct: 312 --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMCQKYEIILFTASKKVYADKLLN 369
Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
+LDP+ L R+ + G+ KDL +LG + S +IID+S + + +
Sbjct: 370 ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 422
Query: 1068 LNLIVVERY 1076
N I +E +
Sbjct: 423 SNGIPIESW 431
>gi|209882178|ref|XP_002142526.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209558132|gb|EEA08177.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 710
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 74/155 (47%), Gaps = 20/155 (12%)
Query: 938 HEVDPVHDEILRKKEEQDREKPHRHLFRFPH-------MGMWT----KLRPGIWTFLERA 986
H +D D I + E+ R+ + +F P+ + W+ KLRPG+ L R
Sbjct: 251 HYIDE-EDNIFGLEAEKYRQLIEKLIFCIPYPNSSNNGIDNWSQGFYKLRPGVLNMLRRL 309
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
FE+++YTMG +L+A +++DP+ F + + ++G + SK L +
Sbjct: 310 KDKFELYMYTMGTELHAYSALRIIDPEFRFFHPKRLFYRNNG------FKDCNSKSLSTL 363
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ +++IDD + W N +LI V Y +FP
Sbjct: 364 FPYDHRTLIVIDDIEQAWS-NSNSLIKVYPYNFFP 397
>gi|6841480|gb|AAF29093.1|AF161478_1 HSPC129 [Homo sapiens]
Length = 466
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|326926934|ref|XP_003209651.1| PREDICTED: CTD small phosphatase-like protein 2-like [Meleagris
gallopavo]
Length = 468
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 296 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425
>gi|401408967|ref|XP_003883932.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
gi|325118349|emb|CBZ53900.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
Length = 1149
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 128/338 (37%), Gaps = 73/338 (21%)
Query: 958 KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P + KLRP + TFL++ +EM +YT + YA + +LD
Sbjct: 674 EPELYRFELPCNRKTYYMKLRPYLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDDNRQ 733
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
LF R+++R D G + E + EG+ + +V DD +W L +V +
Sbjct: 734 LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 790
Query: 1075 RYTYFPCSRRQF----------------------------GLLGP----SLLEIDHDERS 1102
Y +F + + G++ P SL + D++S
Sbjct: 791 HYDFFDSHKAELNAYYPPLSNGIGGMATADIMGNRQNEDEGMMDPVGCSSLSQAGQDQQS 850
Query: 1103 E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
D L L + LH FF + + ++ IL Q+
Sbjct: 851 TNTPVQQHLSRAAAEGKKPCDWDRHLECMLKLFLHLHTEFF--KDPINANIGAILCNFQQ 908
Query: 1142 KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 1196
K+L+G I F+ + F G A + A++ GA K D++ VTHVVA G
Sbjct: 909 KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRYDEEGVTHVVA---GK 965
Query: 1197 DKVNWALS----TGRFVVHPGWVEASALLYRRANEQDF 1230
+ N L+ T VH W+ R E F
Sbjct: 966 NNTNNMLACKENTNLARVHTLWLYCCEAALARVPESAF 1003
>gi|294915786|ref|XP_002778342.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
gi|239886620|gb|EER10137.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
Length = 278
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 84/204 (41%), Gaps = 41/204 (20%)
Query: 920 KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
+L +VLDLD T++NS + + V P+ EI R ++ P +L + +
Sbjct: 45 RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEQGL----PELYLCVISDVKV 100
Query: 972 WTKLRPGIWTFLERASKLFE----MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
TK+RP F+ E + +YT G++ Y + ++LDP G L GR++SR D+
Sbjct: 101 LTKIRPHARAFIRELVASTEYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 160
Query: 1028 ---------------------GDPFDG-DERVPKSKDLEGVLGMESAVVIIDDSVRVWP- 1064
G FDG D R+ M V++DDS WP
Sbjct: 161 PSNMTPVEKDPDLIINASVESGAQFDGSDGRLCNGDKETKESEMRRWFVVLDDSPEAWPE 220
Query: 1065 --HNKLNLIVVERYTYFPCSRRQF 1086
N++ Y + + RQ
Sbjct: 221 ELREAGNVVTANMYDFAEVNHRQL 244
>gi|357450577|ref|XP_003595565.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355484613|gb|AES65816.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 460
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H D + F ++ K RP
Sbjct: 286 KSVTLVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPF 331
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ FLER S +FE+ ++T +YA ++ +LDP + R+ DG+
Sbjct: 332 LHKFLERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN------ 385
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A VVIID+S +V+
Sbjct: 386 ---YTKDL-TILGIDLAKVVIIDNSPQVF 410
>gi|61098234|ref|NP_001012790.1| CTD small phosphatase-like protein 2 [Gallus gallus]
gi|60098613|emb|CAH65137.1| hypothetical protein RCJMB04_4a24 [Gallus gallus]
Length = 468
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 296 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425
>gi|187471087|sp|Q5F3Z7.2|CTSL2_CHICK RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
Length = 466
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|403332687|gb|EJY65381.1| hypothetical protein OXYTRI_14465 [Oxytricha trifallax]
Length = 927
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 84/168 (50%), Gaps = 21/168 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
++Q+K+++ L+LD+D TL+ + P + +I++ H +
Sbjct: 468 KQQQKLYT-----LILDMDETLIYCRQ--NPYPGYQDIIQATSS-------AHNTYSCQV 513
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
++T RP + FLE+ S++FE+ ++T K YA + +DP+ F+ R+ D
Sbjct: 514 QIFTSYRPNLRKFLEQVSQIFEVVIFTASEKSYADLILDKIDPRNEFFSKRLYR--DSCL 571
Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
P G + V KDL +LG + S +I+D+S+ + +N N I + Y
Sbjct: 572 PTPGGQYV---KDL-TILGRDLSRTIIVDNSIMAFAYNISNGIPIPSY 615
>gi|357450579|ref|XP_003595566.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355484614|gb|AES65817.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 469
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H D + F ++ K RP
Sbjct: 295 KSVTLVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPF 340
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ FLER S +FE+ ++T +YA ++ +LDP + R+ DG+
Sbjct: 341 LHKFLERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN------ 394
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A VVIID+S +V+
Sbjct: 395 ---YTKDL-TILGIDLAKVVIIDNSPQVF 419
>gi|449270631|gb|EMC81290.1| CTD small phosphatase-like protein 2 [Columba livia]
Length = 468
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 296 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKKLVRHRLFR--EHCVCVQGN----YIKDL-NI 394
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425
>gi|403222586|dbj|BAM40718.1| CTD-like phosphatase [Theileria orientalis strain Shintoku]
Length = 763
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 14/119 (11%)
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
FP++ + KLRP I FL+ S +EM +YT K YA + +LDP LF R+++R
Sbjct: 336 FPNITYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 395
Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
D+ D R+ D +L DD VW PH + ++ E Y +F
Sbjct: 396 SVDEKDLLKSASRLYPDLDPRFILA-------FDDRRDVWSDIPHKQ--VVRAEHYDFF 445
>gi|334186662|ref|NP_001190760.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658603|gb|AEE84003.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
Length = 442
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S E+ R D + R F ++ K RP
Sbjct: 267 KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP G + R D DG
Sbjct: 313 LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V I+D+ +V+
Sbjct: 367 YTKDL-TVLGLDLAKVAIVDNCPQVY 391
>gi|238480828|ref|NP_001031661.2| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|240255993|ref|NP_193548.7| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658601|gb|AEE84001.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658602|gb|AEE84002.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
Length = 446
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S E+ R D + R F ++ K RP
Sbjct: 267 KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP G + R D DG
Sbjct: 313 LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V I+D+ +V+
Sbjct: 367 YTKDL-TVLGLDLAKVAIVDNCPQVY 391
>gi|224062995|ref|XP_002187586.1| PREDICTED: CTD small phosphatase-like protein 2 [Taeniopygia guttata]
Length = 467
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 235 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 294
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL VH + E +D LF+ ++ +LRP FLER
Sbjct: 295 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 340
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 341 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 393
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 394 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 424
>gi|356540144|ref|XP_003538550.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
Length = 462
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 83/181 (45%), Gaps = 42/181 (23%)
Query: 907 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
RRL K+ S LVLDLD TL++S L E+ D F F
Sbjct: 259 RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEPCEDVD--------FTF 298
Query: 967 P-------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
P H+ ++ + RP + FLER S LFE+ ++T +YA ++ VLDPK +F
Sbjct: 299 PVNFNSEEHI-VYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRH 357
Query: 1020 RVISRGD---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 1075
RV +G+ KDL VLG + A V+IID+S + + N I +E
Sbjct: 358 RVYRESCVYVEGNYL---------KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIES 407
Query: 1076 Y 1076
+
Sbjct: 408 W 408
>gi|350578733|ref|XP_003480441.1| PREDICTED: CTD small phosphatase-like protein 2-like [Sus scrofa]
Length = 355
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 124 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 183
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL++ + E +D LF+ ++ +LRP FLER S
Sbjct: 184 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 229
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 230 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 282
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 283 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 312
>gi|50949928|emb|CAH10508.1| hypothetical protein [Homo sapiens]
Length = 394
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 163 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 222
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 223 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 268
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +L
Sbjct: 269 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 321
Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G + S +IID+S + + + N I +E +
Sbjct: 322 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 351
>gi|297794619|ref|XP_002865194.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297311029|gb|EFH41453.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 447
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L D R F ++ K RP
Sbjct: 278 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVKQRPH 323
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP+G + R D DG
Sbjct: 324 LYRFLERVGELFHVVIFTASHNIYASQLLDILDPEGKFISQRFYR--DSCILLDG----I 377
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+ +V+
Sbjct: 378 YTKDL-TVLGLDLAKVAIIDNCPQVY 402
>gi|281338163|gb|EFB13747.1| hypothetical protein PANDA_001000 [Ailuropoda melanoleuca]
Length = 445
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G H ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|395837830|ref|XP_003791832.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Otolemur
garnettii]
gi|395837832|ref|XP_003791833.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Otolemur
garnettii]
Length = 466
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G H ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|255547724|ref|XP_002514919.1| conserved hypothetical protein [Ricinus communis]
gi|223545970|gb|EEF47473.1| conserved hypothetical protein [Ricinus communis]
Length = 455
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 75/158 (47%), Gaps = 28/158 (17%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S ++P D D P F ++ + RP + F
Sbjct: 265 LVLDLDETLVHST----LEPCGD--------ADFTFPVN--FNLQEHTVYVRCRPFLKDF 310
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 1039
+ER S LFE+ ++T +YA ++ VLDPK +F RV +G+
Sbjct: 311 MERVSSLFEIIIFTASQSIYAEQLLNVLDPKRKVFRHRVFRESCVYVEGNYL-------- 362
Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+IID+S + + N I +E +
Sbjct: 363 -KDL-SVLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 398
>gi|355681384|gb|AER96789.1| CTD small phosphatase like 2 [Mustela putorius furo]
Length = 465
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G H ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|301754747|ref|XP_002913218.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ailuropoda
melanoleuca]
Length = 466
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G H ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|7022613|dbj|BAA91664.1| unnamed protein product [Homo sapiens]
Length = 286
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
+ +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 31 LNNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 90
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL++ + E +D L
Sbjct: 91 QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 136
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 137 FQDVVYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 196
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 197 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243
>gi|357463015|ref|XP_003601789.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355490837|gb|AES72040.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 885
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 86/171 (50%), Gaps = 22/171 (12%)
Query: 907 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
RR+ K+ S + LVLDLD TL++S+ ++P D + E+ H+
Sbjct: 265 RRMLLPKQTRSCPPITLVLDLDETLVHSS----LEPCEDVDFTFTVNFNSEE---HI--- 314
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
++ + RP + FLER S LFE+ ++T +YA ++ VLDPK +F RV +
Sbjct: 315 ----VYVRCRPHLKEFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFR--E 368
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL VLG + A V+IID+S + + N I +E +
Sbjct: 369 SCVYVEGNYL----KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIESW 414
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 36/181 (19%)
Query: 890 FEGY------DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
F GY DD KA + E R S + LVLDLD TL++S+
Sbjct: 665 FVGYLALIPIDDISKAFKKGESWTR--------SCPPITLVLDLDETLVHSSL------- 709
Query: 944 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
K +D + F+ ++ + RP + FLER S LFE+ ++T +YA
Sbjct: 710 -------KPSEDVDFTFTVNFKSEEYIVYVRCRPHLKEFLERVSGLFEIIIFTASQSIYA 762
Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
++ +LDPK +F RV + +G+ KDL VLG + A V+IID+S R
Sbjct: 763 EQLLNLLDPKRKIFRHRVFR--ESCVKVEGNYL----KDL-TVLGCDLAHVMIIDNSRRA 815
Query: 1063 W 1063
+
Sbjct: 816 F 816
>gi|225716618|gb|ACO14155.1| RNA polymerase II subunit A C-terminal domain phosphatase [Esox
lucius]
Length = 266
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 56/98 (57%), Gaps = 12/98 (12%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+Q+++ +KL L++DLD TL+++ + H + I H L R M
Sbjct: 167 EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 215
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
+ T+LRP FLE+ +KL+E+H++T G++LYA +A
Sbjct: 216 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIA 252
>gi|84994102|ref|XP_951773.1| CTD-like phosphatase [Theileria annulata strain Ankara]
gi|65301934|emb|CAI74041.1| CTD-like phosphatase, putative [Theileria annulata]
Length = 767
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 14/119 (11%)
Query: 966 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
F ++ + KLRP I FL+ S +EM +YT K YA + +LDP LF R+++R
Sbjct: 338 FANVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 397
Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
D+ D R+ D +L DD VW PH + ++ E Y +F
Sbjct: 398 SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFF 447
>gi|297597322|ref|NP_001043795.2| Os01g0665300 [Oryza sativa Japonica Group]
gi|55773815|dbj|BAD72353.1| Chain A, Three-Dimensional Structure Of A Rna-Polymerase Ii Binding
Protein With Associated Ligand-like [Oryza sativa
Japonica Group]
gi|125571492|gb|EAZ13007.1| hypothetical protein OsJ_02926 [Oryza sativa Japonica Group]
gi|255673527|dbj|BAF05709.2| Os01g0665300 [Oryza sativa Japonica Group]
Length = 439
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
ARK+ LVLDLD TL++S E+ + F FP HM
Sbjct: 257 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 295
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ + RP + FL++ +++FE+ ++T +YA ++ +LDP+ LF+ R
Sbjct: 296 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 353
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+KDL V+G++ A VVIID++ +V+ N I +E +
Sbjct: 354 ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 395
>gi|125527169|gb|EAY75283.1| hypothetical protein OsI_03170 [Oryza sativa Indica Group]
Length = 507
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
ARK+ LVLDLD TL++S E+ + F FP HM
Sbjct: 325 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDLKEHM- 363
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ + RP + FL++ +++FE+ ++T +YA ++ +LDP+ LF+ R
Sbjct: 364 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 421
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+KDL V+G++ A VVIID++ +V+ N I +E +
Sbjct: 422 ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 463
>gi|359473746|ref|XP_002271611.2| PREDICTED: CTD small phosphatase-like protein 2-like [Vitis vinifera]
gi|297738449|emb|CBI27650.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 76/165 (46%), Gaps = 42/165 (25%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
LVLDLD TL++S ++P D F FP HM ++ +
Sbjct: 313 LVLDLDETLVHST----LEPCDDAD----------------FTFPVNFNLKEHM-VYVRC 351
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFD 1032
RP + F+ER + LFE+ ++T +YA ++ VLDPK F RV +G+
Sbjct: 352 RPHLKDFMERVASLFEIIIFTASQSIYAEQLLNVLDPKRRFFRHRVYRESCVFVEGNYL- 410
Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+IID+S + + N I +E +
Sbjct: 411 --------KDL-SVLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 446
>gi|294877772|ref|XP_002768119.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
gi|239870316|gb|EER00837.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
Length = 161
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 23/153 (15%)
Query: 931 LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 985
++NS + + DP E IL++ + + P +L + + TK+RP F+ E
Sbjct: 1 MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60
Query: 986 ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
SK + +YT G++ Y + K+LDP G L GR++SR D+ P KD
Sbjct: 61 VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSREDEPSNM-----TPLEKD 115
Query: 1043 LEGVLGMESAV----------VIIDDSVRVWPH 1065
+ ++ +SAV V++DDS VWP
Sbjct: 116 PDFIINADSAVGTEELRRRWFVVLDDSPEVWPE 148
>gi|55740289|gb|AAV63947.1| putative nuclear LIM interactor-interacting protein [Phytophthora
sojae]
Length = 261
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 75/153 (49%), Gaps = 32/153 (20%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
+A K+CLVLDLD TL++ + VD V + PH F +
Sbjct: 75 NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 117
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RP + FL+R SKLFE+ ++T +K+YA ++ +LDP R + R D D F
Sbjct: 118 KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 176
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPH 1065
KDL VLG + S VV++D+S PH
Sbjct: 177 -----YLKDL-NVLGRDLSKVVLVDNS----PH 199
>gi|340380578|ref|XP_003388799.1| PREDICTED: hypothetical protein LOC100637093 [Amphimedon
queenslandica]
Length = 532
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 83/176 (47%), Gaps = 37/176 (21%)
Query: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHR 961
RTRR E CLVLDLD TL++ + L K E + + +
Sbjct: 348 RTRRTPE---------FCLVLDLDETLVHCS------------LSKLELANFTFKVEYSN 386
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LF ++ +LRP FLER SK FE+ L+T K+YA ++ ++DP L R+
Sbjct: 387 QLF-----DVYVRLRPYFHEFLERVSKQFEVILFTASTKVYADKLLDLIDPSRRLVKHRL 441
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
D DG+ K+L G+LG + A +I+D+S + + + N + +E +
Sbjct: 442 FR--DHCVCVDGN----FIKEL-GILGRDLAKTIIVDNSPQAFGYQLSNGVPIESW 490
>gi|395503570|ref|XP_003756137.1| PREDICTED: CTD small phosphatase-like protein 2 [Sarcophilus
harrisii]
Length = 395
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 64/118 (54%), Gaps = 8/118 (6%)
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
H++L ++ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L
Sbjct: 242 HKNLKKYIDSNVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 301
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
R+ + G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 302 RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 352
>gi|355692677|gb|EHH27280.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
Length = 466
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 102/234 (43%), Gaps = 24/234 (10%)
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
I +G + N D P++G AH ++ + + +F+ Y + ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270
Query: 904 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
+ R+ K S + LVLDLD TL VH + E +D L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHHFFC 376
Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|432851772|ref|XP_004067077.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
latipes]
Length = 474
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 149/359 (41%), Gaps = 61/359 (16%)
Query: 735 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
++E V + PR L G P F P+ + GS E K
Sbjct: 117 LEEASAVEVTASPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170
Query: 789 QLGAPEAKPVLSQSVLQPDI--TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSP-IQPGQ 845
QL E + S +V Q D+ T F ++ H+ P P + + SP I +
Sbjct: 171 QLDMEEVVEMPSSTVTQ-DVCATTHFYSSVSHL--------PSLRPPHMLEASPTIDEAE 221
Query: 846 IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 899
+++ AD+ + T G+ PE P P+ ++ + +F+ Y +
Sbjct: 222 LEADADLPPL--------TAPGASPEMTYVDVPPPAVPPEVSYEEDWEVFDPYFFIKHVP 273
Query: 900 -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+++ TR+ K S + LVLDLD TL++ + E +D
Sbjct: 274 PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 319
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L
Sbjct: 320 TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 379
Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
R+ + G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 380 HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431
>gi|358335312|dbj|GAA53844.1| CTD small phosphatase-like protein 2 [Clonorchis sinensis]
Length = 498
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 91/189 (48%), Gaps = 31/189 (16%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
K+ SA + CLVLDLD TL++ + + P+ D + +F+ G+
Sbjct: 299 KRTRSAPEFCLVLDLDETLVHCS----LTPLPDA--------------QFIFQVVFQGVV 340
Query: 972 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ ++RP ++ FL R S+ FE+ L+T K+YA + ++DPK R+ +
Sbjct: 341 YMVYVRIRPHLYEFLSRVSERFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 398
Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
+G+ KDL VLG + VI+D+S + + + N + +E + + + R+
Sbjct: 399 VCVNGN----YVKDLR-VLGRDLRKTVIVDNSPQAFGYQLDNGVPIESW-FVDSNDRELL 452
Query: 1088 LLGPSLLEI 1096
L P L E+
Sbjct: 453 NLLPFLFEV 461
>gi|241249809|ref|XP_002403164.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
gi|215496447|gb|EEC06087.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
Length = 185
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 2/104 (1%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D+++++ +RK+L G +VFS V P + P WQTA GA + + VTH+V
Sbjct: 11 DLKHVVPYVRRKVLKGSHLVFSGVVPTNQ-EPEKSRAWQTARALGARVSSDLCPGVTHLV 69
Query: 1191 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 1233
A GT KVN A T + VV P W+ A + +E F ++
Sbjct: 70 AARPGTAKVNRARRTRQLHVVSPAWLWCCAERWEHVHEALFPLE 113
>gi|148233948|ref|NP_001082795.1| CTD small phosphatase-like protein 2-B [Danio rerio]
gi|187471000|sp|A4QNX6.1|CTL2B_DANRE RecName: Full=CTD small phosphatase-like protein 2-B;
Short=CTDSP-like 2-B
gi|141796856|gb|AAI39561.1| Zgc:162265 protein [Danio rerio]
Length = 460
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 114/261 (43%), Gaps = 33/261 (12%)
Query: 825 VSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDD--KQTGTGSGPEAGPVGAHPQ-S 881
+S PL + M S +P + ++ T H D T GS G V A
Sbjct: 181 LSVPLYTSAMTSA---FRPPPTAHTPEQESENTPHPDLPPLTAPGSPATGGYVDASITVP 237
Query: 882 AWGDVEHLFEGYD-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAK 936
A G E +E +D +++ TR+ K S + LVLDLD TL++ +
Sbjct: 238 AEGSYEEEWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCS- 296
Query: 937 FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYT 996
+E+D D LF+ ++ +LRP FLER S+++E+ L+T
Sbjct: 297 LNELD-------------DAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFT 343
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVI 1055
K+YA ++ +LDP+ L R+ + G+ KDL +LG + S +I
Sbjct: 344 ASKKVYADKLLNILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTII 396
Query: 1056 IDDSVRVWPHNKLNLIVVERY 1076
ID+S + + + N I +E +
Sbjct: 397 IDNSPQAFAYQLSNGIPIESW 417
>gi|186529839|ref|NP_001119383.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
gi|332007998|gb|AED95381.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
Length = 456
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L D R F ++ + RP
Sbjct: 282 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 327
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP G + R D DG
Sbjct: 328 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 381
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+ +V+
Sbjct: 382 YTKDL-TVLGLDLAKVAIIDNCPQVY 406
>gi|348523113|ref|XP_003449068.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
niloticus]
Length = 378
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 77/156 (49%), Gaps = 22/156 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LV+DL+ TL+ F ++ + +D E F+ ++ LRP + F
Sbjct: 202 LVVDLEETLM----FSSLNVI----------EDAEYTFHAAFQDHQYKVYMVLRPHVKEF 247
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L+ +K++E+ +YT K YA ++ +LDP+ LF R+ DD G KD
Sbjct: 248 LQAMAKIYELFVYTCAKKEYAEKILDILDPQRKLFRHRLYQ--DDCACVLGH----YIKD 301
Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
L +LG + V++D++ +P+N LN I ++ ++
Sbjct: 302 L-SILGRDLKKTVVLDNAPHTYPYNLLNTIPIKSWS 336
>gi|293331055|ref|NP_001170732.1| uncharacterized protein LOC100384823 [Zea mays]
gi|238007228|gb|ACR34649.1| unknown [Zea mays]
Length = 254
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
K+ S + LVLDLD TL++S H D PVH F F
Sbjct: 60 KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 100
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
++ + RP + FL+R + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 101 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 154
>gi|326671582|ref|XP_700009.2| PREDICTED: CTD small phosphatase-like protein 2-like [Danio rerio]
Length = 358
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 22/155 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHD-EILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
LVLDLD TL+ F ++ + D E QD H++ ++ LRP +
Sbjct: 182 LVLDLDETLV----FSSLNVIPDAEYTFNTRFQD----HKY-------KVYVILRPHVRE 226
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
FL+ +K FEM +YT K YA ++ +LDP LF R+ DD G K
Sbjct: 227 FLQAMTKHFEMFVYTSAKKEYAEKIVDILDPNKKLFRHRLYQ--DDCACVLGH----YIK 280
Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
DL + S VI+D++ +P++ +N+I ++ +
Sbjct: 281 DLTILERDLSKTVILDNAPHTFPYHLMNMIPIKSW 315
>gi|22327621|ref|NP_199453.2| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
gi|18377616|gb|AAL66958.1| unknown protein [Arabidopsis thaliana]
gi|20465765|gb|AAM20371.1| unknown protein [Arabidopsis thaliana]
gi|332007997|gb|AED95380.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
Length = 453
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L D R F ++ + RP
Sbjct: 279 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP G + R D DG
Sbjct: 325 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+ +V+
Sbjct: 379 YTKDL-TVLGLDLAKVAIIDNCPQVY 403
>gi|26449836|dbj|BAC42041.1| unknown protein [Arabidopsis thaliana]
Length = 453
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S L D R F ++ + RP
Sbjct: 279 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
++ FLER +LF + ++T + +YA+++ +LDP G + R D DG
Sbjct: 325 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A V IID+ +V+
Sbjct: 379 YTKDL-TVLGLDLAKVAIIDNCPQVY 403
>gi|389584495|dbj|GAB67227.1| hypothetical protein PCYB_112480 [Plasmodium cynomolgi strain B]
Length = 1447
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 111/261 (42%), Gaps = 35/261 (13%)
Query: 958 KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P+ + K RP + FL+ S +E+ +YT + YA + +LDP
Sbjct: 1018 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1077
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
LFA R+++R D ++K+ + ++S +I DD VW PH+ ++
Sbjct: 1078 LFADRIVARCSSADR-------EENKNFSKIYPNVDSKYIIAFDDRKDVWTDIPHS--HI 1128
Query: 1071 IVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFFSHQ 1125
+ E Y +F S+ P+ + +D D L V+ +LHK FF
Sbjct: 1129 LKAEHYNFFELSKYDIISHFKEPTTCKKRFVDMDMH-----LHFMTKVLLKLHKHFFERP 1183
Query: 1126 SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL------HPLWQTAEQFGA-VC 1178
+VDV ++ L+ + F+ + + H+ + A + GA +
Sbjct: 1184 L--EVDVGTLIENIMLSTLSNVGVYFTGFRKNSKNSQHVLSSDCEDRQKEIALELGAKIY 1241
Query: 1179 TKHIDDQVTHVVANSLGTDKV 1199
T + VTH++A TD +
Sbjct: 1242 TNYDMPGVTHIIAAKNCTDNL 1262
>gi|115456605|ref|NP_001051903.1| Os03g0850100 [Oryza sativa Japonica Group]
gi|27573336|gb|AAO20054.1| putative NLI interacting factor [Oryza sativa Japonica Group]
gi|28269415|gb|AAO37958.1| putative NLI-interacting factor [Oryza sativa Japonica Group]
gi|108712119|gb|ABF99914.1| NLI interacting factor, putative, expressed [Oryza sativa Japonica
Group]
gi|113550374|dbj|BAF13817.1| Os03g0850100 [Oryza sativa Japonica Group]
gi|125588650|gb|EAZ29314.1| hypothetical protein OsJ_13375 [Oryza sativa Japonica Group]
Length = 444
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S L E+ D P F ++
Sbjct: 252 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
+ RP + FLE + +FE+ ++T +YA ++ +LDPK LF RV +G+
Sbjct: 298 VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A VVI+D+S + + N + +E +
Sbjct: 358 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395
>gi|294875260|ref|XP_002767242.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
gi|239868797|gb|EEQ99959.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
Length = 215
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 23/153 (15%)
Query: 931 LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 985
++NS + + DP E IL++ + + P +L + + TK+RP F+ E
Sbjct: 1 MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60
Query: 986 ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
SK + +YT G++ Y + K+LDP G L GR++SR D+ P KD
Sbjct: 61 VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSREDEPSNM-----TPLEKD 115
Query: 1043 LEGVLGMESAV----------VIIDDSVRVWPH 1065
+ ++ +SAV V++DDS VWP
Sbjct: 116 PDFIINADSAVGTEELRRRWFVVLDDSPEVWPE 148
>gi|215695024|dbj|BAG90215.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 269
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 75/153 (49%), Gaps = 36/153 (23%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
ARK+ LVLDLD TL++S E+ + F FP HM
Sbjct: 143 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 181
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ + RP + FL++ +++FE+ ++T +YA ++ +LDP+ LF+ R
Sbjct: 182 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 239
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
+KDL V+G++ A VVIID++ +V
Sbjct: 240 ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQV 267
>gi|218194116|gb|EEC76543.1| hypothetical protein OsI_14336 [Oryza sativa Indica Group]
Length = 444
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S + LVLDLD TL++S L E+ D P F ++
Sbjct: 252 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
+ RP + FLE + +FE+ ++T +YA ++ +LDPK LF RV +G+
Sbjct: 298 VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A VVI+D+S + + N + +E +
Sbjct: 358 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395
>gi|156101293|ref|XP_001616340.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805214|gb|EDL46613.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1544
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 110/261 (42%), Gaps = 35/261 (13%)
Query: 958 KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
+P + F P+ + K RP + FL+ S +E+ +YT + YA + +LDP
Sbjct: 1108 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1167
Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
LFA R+++R D ++K+ + ++S VI DD VW PH+ ++
Sbjct: 1168 LFADRIVARCSSAD-------REENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--HI 1218
Query: 1071 IVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFFSHQ 1125
+ E Y +F S+ PS + +D D L V+ +LHK FF
Sbjct: 1219 LKAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHFMTKVLLKLHKQFFERP 1273
Query: 1126 SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL------HPLWQTAEQFGA-VC 1178
+VDV ++ L + F+ + + H+ + A + GA +
Sbjct: 1274 L--EVDVGRLIDDIMLSTLCNVGVYFTGFRKNSKNSQHVLSSDCEDRQKEIALELGAKIY 1331
Query: 1179 TKHIDDQVTHVVANSLGTDKV 1199
T + VTH++A TD +
Sbjct: 1332 TNYDMPGVTHIIAAKNCTDNL 1352
>gi|47230493|emb|CAF99686.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2418
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 22/176 (12%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+++ TR+ K S + LVLDLD TL VH + E +D
Sbjct: 305 EEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 350
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 351 VLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRL 410
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 411 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 459
>gi|387015310|gb|AFJ49774.1| CTD small phosphatase [Crotalus adamanteus]
Length = 466
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 97/211 (45%), Gaps = 24/211 (11%)
Query: 869 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLD
Sbjct: 234 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293
Query: 927 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
LD TL++ + E +D LF+ ++ +LRP FLE
Sbjct: 294 LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLECM 339
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
S+++E+ L+T K+YA ++ +LDPK L R+ + G+ KDL +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392
Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
LG + S +IID+S + + + N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423
>gi|302806561|ref|XP_002985030.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
gi|300147240|gb|EFJ13905.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
Length = 177
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 21/138 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L+LDLD TL+ +++ + D ++ E E+P +W RPG+ F
Sbjct: 14 LILDLDGTLIATSRQASLHACFDFVVEFDSE---EQP-----------VWVSKRPGLEDF 59
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L +AS+++E+ ++++G K Y +M + +DP G LF ++R D G + KD
Sbjct: 60 LRQASEIYEVVVFSLGRKSYVEKMREAIDPSG-LFVATWLAR----DSCSGSSEIKDYKD 114
Query: 1043 LEG-VLGME-SAVVIIDD 1058
L LG E VV +DD
Sbjct: 115 LNSPKLGRELRKVVWVDD 132
>gi|299472381|emb|CBN77569.1| putative nuclear LIM interactor-interacting protein [Ectocarpus
siliculosus]
Length = 602
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 22/169 (13%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L E++K ++L LVLDLD TL++ V+P H R + H F
Sbjct: 351 LPEKRKTRHGKELTLVLDLDETLVHCTVDPIVNPDH-----------RFEVH---FNGEE 396
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
++ + RP + FLE S+LFE+ ++T ++YA + ++DP+ R+ D
Sbjct: 397 FQVYVRKRPHLDAFLEAVSELFEVVVFTASQQVYAERLLNMIDPQKKFVKYRLYR--DAC 454
Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL VLG + S V I+D+S + N I +E +
Sbjct: 455 MALEGNYL----KDLN-VLGRDLSKVAIVDNSPYAYGFQIDNGIPIESW 498
>gi|353230275|emb|CCD76446.1| nuclear lim interactor-interacting factor-related [Schistosoma
mansoni]
Length = 429
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 81/166 (48%), Gaps = 30/166 (18%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
KK S+ + CLVLDLD TL++ + ++P+ D + +F+ G+
Sbjct: 285 KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326
Query: 972 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ ++RP ++ FL S+ FE+ L+T K+YA + ++DPK R+ +
Sbjct: 327 YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384
Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVV 1073
+G+ KDL VLG + VIID+S + + + L+++
Sbjct: 385 VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQVFGLLLL 425
>gi|356510404|ref|XP_003523928.1| PREDICTED: uncharacterized protein LOC100810756 [Glycine max]
Length = 469
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 22/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+K+ LVLDLD TL++S+ + D D K DRE + ++ + RP
Sbjct: 295 KKVTLVLDLDETLIHSS-MGQCDGAAD--FTFKMITDRE-----------LTVYVRKRPF 340
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ FL + S++FE+ ++T ++YA + VLDP F+ RV D R
Sbjct: 341 LQEFLVKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVYRESCTW----KDRRCV 396
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
KDL VLG++ A V IID++ V+
Sbjct: 397 --KDL-TVLGIDLAKVCIIDNTPEVF 419
>gi|357130565|ref|XP_003566918.1| PREDICTED: uncharacterized protein LOC100830008 [Brachypodium
distachyon]
Length = 510
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 74/156 (47%), Gaps = 28/156 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
QK + + LVLDLD TL++S H + D ++ F +
Sbjct: 326 QKSPVRTKHVTLVLDLDETLVHSTLDHC--DIADFTIQV------------FFNMKDHTV 371
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDG 1028
+ + RP + FLE+ +++FE+ ++T K+YA ++ LDP G L + R+ DG
Sbjct: 372 YVRQRPHLKMFLEKVAQMFELVIFTASQKIYAEQIIDRLDPDGKLISQRIYRESCIFSDG 431
Query: 1029 DPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG+ A V IID++ +V+
Sbjct: 432 S---------YTKDL-TILGVHLAKVAIIDNTPQVF 457
>gi|125526935|gb|EAY75049.1| hypothetical protein OsI_02945 [Oryza sativa Indica Group]
Length = 577
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
++++ LVLDLD TL++S H D V D L+ F + ++ + RP
Sbjct: 399 SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 444
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 1031
+ FLE+ +++FE+ ++T ++YA ++ LDP G L + R + S G
Sbjct: 445 HLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 498
Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A VVI+D++ +V+
Sbjct: 499 ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 524
>gi|149490347|ref|XP_001511004.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ornithorhynchus
anatinus]
Length = 374
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 71/149 (47%), Gaps = 16/149 (10%)
Query: 870 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKAA--IQKERTRRLEEQKKMFSARKLCLVLDL 927
P++G AH ++ + + +F+ Y + +++ R+ K S + LVLDL
Sbjct: 238 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 297
Query: 928 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
D TL VH + E +D LF+ ++ +LRP FLER S
Sbjct: 298 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 343
Query: 988 KLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+++E+ L+T K+YA ++ +LDPK L
Sbjct: 344 QIYEIILFTASKKVYADKLLNILDPKKQL 372
>gi|298705179|emb|CBJ28610.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 482
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 15/119 (12%)
Query: 898 KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
K + K+R L +Q F+ RK L+LDLD TL++S+ F V P D I+ +
Sbjct: 281 KGGLGKKRRSLLPKQLPEFAGRK-QLILDLDETLVHSS-FKPV-PGADFIMDIMVDGTFY 337
Query: 958 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
K ++ RPG+ FLER +KL+E+ ++T YA + VLDPKG +
Sbjct: 338 K------------VFVLKRPGVDAFLERVAKLYEVIIFTASLPQYANPLLDVLDPKGTI 384
>gi|256083671|ref|XP_002578064.1| nuclear lim interactor-interacting factor-related [Schistosoma
mansoni]
Length = 441
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 80/165 (48%), Gaps = 32/165 (19%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
KK S+ + CLVLDLD TL++ + ++P+ D + +F+ G+
Sbjct: 285 KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326
Query: 972 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
+ ++RP ++ FL S+ FE+ L+T K+YA + ++DPK R+ +
Sbjct: 327 YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384
Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV--WPHNKLNL 1070
+G+ KDL VLG + VIID+S + + HN+ L
Sbjct: 385 VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQHNERKL 424
>gi|224072608|ref|XP_002303804.1| predicted protein [Populus trichocarpa]
gi|222841236|gb|EEE78783.1| predicted protein [Populus trichocarpa]
Length = 244
Score = 53.9 bits (128), Expect = 6e-04, Method: Composition-based stats.
Identities = 53/168 (31%), Positives = 80/168 (47%), Gaps = 28/168 (16%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K+ S LVLDLD TL++SA ++P +D D P F ++
Sbjct: 44 KQTRSCPPTTLVLDLDETLVHSA----LEPCND--------ADFTFPVN--FNLQEHTVF 89
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 1029
+ RP + F+ER S LFE+ ++T +YA ++ VLDPK +F RV +G+
Sbjct: 90 VRCRPYLRDFMERVSSLFEIIIFTASQSIYAEQLLNVLDPKRRIFRHRVFRESCVFVEGN 149
Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+IID+S + + N I +E +
Sbjct: 150 YL---------KDLS-VLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 187
>gi|326513088|dbj|BAK06784.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
ARK+ LVLDLD TL++S L ++ D P F ++ + RP
Sbjct: 272 ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
+ FL++ +++F++ ++T +YA ++ LDP+ LF+ R + G
Sbjct: 318 HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372
Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+KDL V+G++ A V IID++ +V+ N I +E +
Sbjct: 373 -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410
>gi|297799336|ref|XP_002867552.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
lyrata]
gi|297313388|gb|EFH43811.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
lyrata]
Length = 113
Score = 53.9 bits (128), Expect = 6e-04, Method: Composition-based stats.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
E V+I+DD+V +WPH+K NL+ + +Y YF + S E+ DE +G+LA+
Sbjct: 21 ELRVIIVDDTVDIWPHDKRNLLQITKYIYFSVA-VSIDKRWRSYAEVKRDESLSNGSLAN 79
Query: 1110 SLGVIERLHKIFFSHQSLDDVDVR 1133
L + +HK + + LD D+R
Sbjct: 80 VLKFLVYVHKRY--EKKLDSKDLR 101
>gi|225681687|gb|EEH19971.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 869
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 84/201 (41%), Gaps = 60/201 (29%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
D + K R+EE K L V+DLD T++++ VDP E
Sbjct: 132 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 182
Query: 953 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
+QDR+ P+ + P M G W KLRPG+ FL+ S L+E+H+YTMG
Sbjct: 183 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 242
Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
+ AG + +K+L+ + +++ VVIIDD
Sbjct: 243 R-----------------AGSLT-----------------AKNLQRLFPVDTKMVVIIDD 268
Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
VW + NLI V Y +F
Sbjct: 269 RGDVWKWSD-NLIKVSPYDFF 288
>gi|37538060|gb|AAQ92971.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
gi|37538062|gb|AAQ92972.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
ARK+ LVLDLD TL++S L ++ D P F ++ + RP
Sbjct: 272 ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
+ FL++ +++F++ ++T +YA ++ LDP+ LF+ R + G
Sbjct: 318 HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372
Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+KDL V+G++ A V IID++ +V+ N I +E +
Sbjct: 373 -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410
>gi|384246936|gb|EIE20424.1| hypothetical protein COCSUDRAFT_67358 [Coccomyxa subellipsoidea
C-169]
Length = 676
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 17/102 (16%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL--RPGIW 980
LVLDLDHTL+ S F+ P D R +F G T RP +
Sbjct: 108 LVLDLDHTLIRSTLFNPHKPAKDS--------------REVF-VTGDGARTAFERRPHLT 152
Query: 981 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FLE S LFE+ ++T G++ YA + +LDP+ LF R+
Sbjct: 153 HFLESVSTLFEIVVFTAGSQSYAGPLLDILDPERRLFEHRLF 194
>gi|449678335|ref|XP_002165480.2| PREDICTED: CTD small phosphatase-like protein 2-like [Hydra
magnipapillata]
Length = 421
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 14/103 (13%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
++ LVLDLD TL++ + L K E + +F ++ KLRP +
Sbjct: 243 QMTLVLDLDETLVHCS------------LSKLEAYN--MTFNVVFDNVTYQLFVKLRPHL 288
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FLER SKL+E+ L+T ++YA ++ ++DP+ F R+
Sbjct: 289 LEFLERVSKLYEVILFTASRRVYADKLLNIIDPRRQFFRHRLF 331
>gi|325180168|emb|CCA14570.1| nuclear LIM factor interactorinteracting protein hyphal form putative
[Albugo laibachii Nc14]
Length = 418
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 78/161 (48%), Gaps = 28/161 (17%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWTKLR 976
K+CLVLDLD TL VH + ++ E P+ F + + LR
Sbjct: 235 KICLVLDLDETL-----------VHCSV------EEIENPNFQFDVFFNGTNYNVNVSLR 277
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
P + FL+R +K FE+ ++T ++YA ++ +LDP L R+ +D DG+
Sbjct: 278 PHMHHFLKRVTKQFELVVFTASQRVYAEKLLNLLDPNRDLIKYRLYR--EDCLEVDGN-- 333
Query: 1037 VPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+++D+S + + N I +E +
Sbjct: 334 --FLKDL-NVLGRDLARVILVDNSPHAFGYQVNNGIPIESW 371
>gi|302847022|ref|XP_002955046.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
nagariensis]
gi|300259574|gb|EFJ43800.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
nagariensis]
Length = 1180
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 53/230 (23%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+++ LVLDLD TL+ S PV + +E RF +W LRPG
Sbjct: 564 QRMTLVLDLDGTLIASEDEPHA-PVPFDYCVDEE------------RF----VW--LRPG 604
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ FL+ FE+ L+T + +AT + +DP GV+F R+ D + P
Sbjct: 605 LRRFLDSVRPHFEVVLFTAAGESWATSALQRIDPDGVIFDSRLYR-----DHTVSHDDWP 659
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 1097
KDL LG + A VVI+DD+ ++ + N + V Y P L
Sbjct: 660 WVKDLS-RLGRDLARVVIVDDNPLMFMYQPDNALHVAAYD-------------PQL--TG 703
Query: 1098 HDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 1147
H+ D L +L V+ +HK+ ++ DVR +L + + I A C
Sbjct: 704 HN----DDVLEQALDVL--MHKVLIAN------DVREVLRSIKEPITASC 741
>gi|125571265|gb|EAZ12780.1| hypothetical protein OsJ_02697 [Oryza sativa Japonica Group]
Length = 576
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
++++ LVLDLD TL++S H D V D L+ F + ++ + RP
Sbjct: 398 SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 443
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 1031
+ FLE+ +++F++ ++T ++YA ++ LDP G L + R + S G
Sbjct: 444 HLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 497
Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A VVI+D++ +V+
Sbjct: 498 ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 523
>gi|224116766|ref|XP_002331872.1| predicted protein [Populus trichocarpa]
gi|222875390|gb|EEF12521.1| predicted protein [Populus trichocarpa]
Length = 502
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 75/173 (43%), Gaps = 35/173 (20%)
Query: 903 KERTRRLEEQKKMFSARKLCLVLDLDHTL--------LNSAKFHEVDPVHDEILRKKEEQ 954
KE RR + + LVLDLD L A VH + +
Sbjct: 303 KESCRR----------KSVTLVLDLDELCPMYNTKVELQMAFLFSETLVHSTL---EHCD 349
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D + F ++ K RP + TFLER +++FE+ ++T +YA ++ +LDP
Sbjct: 350 DADFTFTVFFNMKEHIVYVKQRPHLHTFLERVAEMFEVVIFTASQSIYAAQLLDILDPDR 409
Query: 1015 VLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
L + R+ DG +KDL VLG++ A V IID+S +V+
Sbjct: 410 KLISQRLYRESCIFSDG---------SYTKDL-TVLGVDLAKVAIIDNSPQVF 452
>gi|242009525|ref|XP_002425534.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212509409|gb|EEB12796.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 834
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 22/155 (14%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL VH + +E QD LF+ ++ + RP F
Sbjct: 670 LVLDLDETL-----------VHCSL---QELQDASFTFPVLFQDCAYTVFVRTRPYFREF 715
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LER S LFE+ L+T ++YA ++ +LDPK R+ + +G+ KD
Sbjct: 716 LERVSSLFEVILFTASKRVYADKLMNLLDPKKRWIKYRLFR--EHCVCVNGN----YIKD 769
Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
L +LG + S +IID+S + + + N I +E +
Sbjct: 770 L-TILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 803
>gi|123454430|ref|XP_001314970.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121897632|gb|EAY02747.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 218
Score = 53.1 bits (126), Expect = 0.001, Method: Composition-based stats.
Identities = 42/158 (26%), Positives = 74/158 (46%), Gaps = 25/158 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++++ F P H ++ +F + LRP + F
Sbjct: 44 LVLDLDETLVHTSTF----PPHSDV--------------EALKFDDTNEYVFLRPNVKKF 85
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
LER S+LFE+ ++T G ++YA + P+ ++ D F G++ KD
Sbjct: 86 LERVSELFEVFIFTAGTQIYAERILDSFCPQ----IDQMHRFYRDSCKFSGNK---CKKD 138
Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
L + VV++DD+ ++ + N I ++R++ P
Sbjct: 139 LNKFGRPLTKVVMVDDNYQMRSYYPQNTIYIDRWSGTP 176
>gi|297843870|ref|XP_002889816.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
lyrata]
gi|297335658|gb|EFH66075.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
lyrata]
Length = 100
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 59/119 (49%), Gaps = 29/119 (24%)
Query: 1008 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNK 1067
K+LDPKG F+ R+ISR DG R KS D V+G E AV+ +D+S VW
Sbjct: 7 KLLDPKGKYFSDRIISRD------DGTVRHKKSLD---VMGNEEAVLFVDESKIVWQ--- 54
Query: 1068 LNLIVVERY-TYFPCSRRQF---GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
++Y +F S +QF L P DE DG L++ L V+++ H I F
Sbjct: 55 ------KKYGEFFASSCKQFKEDSKLLP-------DESESDGALSTVLNVLKQTHGILF 100
>gi|148909957|gb|ABR18063.1| unknown [Picea sitchensis]
Length = 517
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 24/106 (22%)
Query: 921 LCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
+ LVLDLD TL++S H D PVH F ++ +
Sbjct: 320 ITLVLDLDETLVHSTLEHCDDADFTFPVH-------------------FNLKEHTVYVRC 360
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RP + F++R + +FE+ ++T +YA ++ VLDPK L RV
Sbjct: 361 RPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRV 406
>gi|55740279|gb|AAV63941.1| putative nuclear LIM factor interactor-interacting protein hyphal
form [Phytophthora infestans]
Length = 237
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
++ LVLD+D L++S +EV+ E R ++ ++ + + K RPG+
Sbjct: 40 RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RPGL 97
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FLE A+K ++++++T G + Y + LDPKG LFAGR
Sbjct: 98 DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140
>gi|301118476|ref|XP_002906966.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
T30-4]
gi|301126789|ref|XP_002909873.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
T30-4]
gi|262101427|gb|EEY59479.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
T30-4]
gi|262108315|gb|EEY66367.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
T30-4]
Length = 237
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
++ LVLD+D L++S +EV+ E R ++ ++ + + K RPG+
Sbjct: 40 RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RPGL 97
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FLE A+K ++++++T G + Y + LDPKG LFAGR
Sbjct: 98 DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140
>gi|302775067|ref|XP_002970950.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
gi|300161661|gb|EFJ28276.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
Length = 177
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 69/138 (50%), Gaps = 21/138 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L+LDLD TL+ +++ ++ D ++ E E+P +W RPG+ F
Sbjct: 14 LILDLDGTLIATSRQAKLHACFDFVVEFDSE---EQP-----------VWVSKRPGLDDF 59
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L +AS+++E+ ++++G K Y +M + +DP G F ++R D G + KD
Sbjct: 60 LRQASEIYEVVVFSLGRKSYVEKMREAIDPSGS-FVATWLAR----DSCSGSSEIKDYKD 114
Query: 1043 LEG-VLGME-SAVVIIDD 1058
L LG E VV +DD
Sbjct: 115 LNSPKLGRELRKVVWVDD 132
>gi|302812229|ref|XP_002987802.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
gi|302817447|ref|XP_002990399.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
gi|300141784|gb|EFJ08492.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
gi|300144421|gb|EFJ11105.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
Length = 253
Score = 52.4 bits (124), Expect = 0.002, Method: Composition-based stats.
Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 32/160 (20%)
Query: 923 LVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
LVLDLD TL++S H D PV+ F + ++ + RP
Sbjct: 59 LVLDLDETLVHSTLEHCADADFSFPVY-------------------FNYQEHTVYVRRRP 99
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
+ FLE+ ++LFE+ ++T +YA ++ +LDPK L R+ D DG+
Sbjct: 100 HLQVFLEKVAQLFEIIIFTASQSVYAEQLLNILDPKRKLIRHRIFR--DSCVYVDGNYL- 156
Query: 1038 PKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL +LG + S V I+D+S + + N I +E +
Sbjct: 157 ---KDLS-ILGRDLSKVAIVDNSPQAFGFQVDNGIPIESW 192
>gi|410921774|ref|XP_003974358.1| PREDICTED: CTD small phosphatase-like protein 2-B-like [Takifugu
rubripes]
Length = 381
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 66/124 (53%), Gaps = 9/124 (7%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ KLRP + FL+ +K +E+ +YT + YA ++ +LDP+ +F R+ +D
Sbjct: 239 VYMKLRPHVKEFLQSVAKNYELFVYTCAKREYAEKILNILDPQRKVFRHRLYQ--EDCIC 296
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
G KDL +LG + + V++D+ +P++ LN I ++ +T P R+ L
Sbjct: 297 VLGH----YIKDL-SILGRDLTKTVVLDNMPHTYPYHLLNTIPIKSWTGEP-EDRELQKL 350
Query: 1090 GPSL 1093
P+L
Sbjct: 351 VPTL 354
>gi|414881093|tpg|DAA58224.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
gi|414881094|tpg|DAA58225.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
Length = 442
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 74/164 (45%), Gaps = 30/164 (18%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
R + LVLDLD TL++S K D + + ++ K RP
Sbjct: 264 TRNVTLVLDLDETLVHSTM--------------KHCDDADFTFSMFYDMKEHVVYVKKRP 309
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
+ FL+R ++FE+ ++T +YA ++ +LDP+ LF+ R D G
Sbjct: 310 HVHMFLQRMVEMFEVVIFTASQSVYADQLLDMLDPEKKLFSKRFFRESCLITDSG----- 364
Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL V+G++ A V IID++ +V+ N I +E +
Sbjct: 365 -----YRKDL-TVVGVDLAKVAIIDNTPQVFELQVNNGIPIESW 402
>gi|432854554|ref|XP_004067958.1| PREDICTED: CTD small phosphatase-like protein 2-like [Oryzias
latipes]
Length = 381
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 59/107 (55%), Gaps = 8/107 (7%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ LRP + FL+ +K++E+ +YT K YA ++ ++ DP+ LF R+ DD
Sbjct: 240 YMILRPHVREFLQAMAKIYELFVYTCAKKEYAEKILEIFDPQKKLFRHRLYQ--DDCACV 297
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
G KDL +LG + + V++D++ +P++ +N I ++ ++
Sbjct: 298 LGH----YIKDL-SILGRDLTKTVVLDNAPHTYPYHLMNTIPIKSWS 339
>gi|403338554|gb|EJY68521.1| Dullard-like phosphatase domain containing protein [Oxytricha
trifallax]
Length = 615
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 73/159 (45%), Gaps = 24/159 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
++L +VLDLD+TL+ H V+ V +Q+ +++ + RP
Sbjct: 437 KRLIVVLDLDNTLI-----HSVNSVPTS-----SDQNYFAIRDNIYVYK--------RPH 478
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ FL +K +++++T K YA ++ V+DPK G+ R D DER
Sbjct: 479 MEYFLAEIAKFADIYIFTASMKDYADQIMDVIDPKKTF--GKCFYRTD----CKKDERRQ 532
Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
KDL V + +++IDD+ N LN ++ +T
Sbjct: 533 IYKDLSTVSDDLTQLIMIDDNEINCTKNPLNTFKIKHWT 571
>gi|67463585|ref|XP_648443.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56464600|gb|EAL43056.1| hypothetical protein EHI_121510 [Entamoeba histolytica HM-1:IMSS]
gi|449705880|gb|EMD45836.1| RNA polymerase II ctd phosphatase, putative [Entamoeba histolytica
KU27]
Length = 428
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 89/209 (42%), Gaps = 45/209 (21%)
Query: 964 FRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AG 1019
F P ++ + R GI TF+E+ SKL+++H+ T+G K YA + ++ + + F G
Sbjct: 98 FEIPEQNAKVFIRFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRNIPFITG 157
Query: 1020 RVISR--------GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1064
+++ D+ D DG +ER + + +G E VI+DD + VW
Sbjct: 158 DLVTAEDCSSVIVCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWD 216
Query: 1065 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 1124
+ VV+ Y P + + D L V++ ++ F+
Sbjct: 217 NKN----VVQICEYVPSTNQV------------------DTELVRVTEVLQNIYTKFYDE 254
Query: 1125 QSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
DV+ IL ++KIL + F+R
Sbjct: 255 HI---EDVKEILHLFRKKILENKNLYFNR 280
>gi|55740293|gb|AAV63948.1| putative nuclear LIM interactor-interacting protein [Phytophthora
sojae]
gi|348665891|gb|EGZ05719.1| hypothetical protein PHYSODRAFT_551168 [Phytophthora sojae]
Length = 237
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
++ LVLD+D L++S +EV+ E R ++ ++ + + K RPG+
Sbjct: 40 RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYGDSFEIVMDDGERAVVNK-RPGL 97
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FLE A+K ++++++T G + Y + LDPKG LFAGR
Sbjct: 98 DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140
>gi|449505979|ref|XP_004162620.1| PREDICTED: uncharacterized protein LOC101226452 [Cucumis sativus]
Length = 470
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
LVLDLD TL++S L +++ D FRF H+ ++ K
Sbjct: 300 LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 338
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 1032
RP + FL+R +++FE+ ++T +YA ++ LDP + + R+ DG
Sbjct: 339 RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 394
Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A VVI+D+ +V+
Sbjct: 395 -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 420
>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like, partial [Nasonia vitripennis]
Length = 512
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 10/91 (10%)
Query: 991 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--G 1048
E+H+ T G + YA +A +LD G LF+ R++SR + DP K+ +L+ + G
Sbjct: 1 ELHICTFGARQYAHRVAAILDNDGKLFSHRILSRDECFDP------QSKTANLKALFPCG 54
Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
++ V IIDD VW NL+ V+ Y +F
Sbjct: 55 VD-MVCIIDDRDDVW-QGCANLVQVKPYHFF 83
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 64/126 (50%), Gaps = 8/126 (6%)
Query: 1113 VIERLHKIFFSHQSLD--DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHP--LW 1168
+++R+H F++ + ++ I+ + ++L G + FS + P N LH +
Sbjct: 267 ILKRIHTEFYAEGDSEPRSRTLKEIIPKVRSRVLKGLCLTFSGLVP---NNQKLHQSRAY 323
Query: 1169 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 1227
+ A FGA ++ + +Q TH+VA GT KV A G+ +V+P W+ A + +E
Sbjct: 324 KVARAFGAQASQDLTEQTTHLVAIQPGTVKVREAKRQGKVKIVNPDWLWTCAERWEHVDE 383
Query: 1228 QDFAIK 1233
+ + +K
Sbjct: 384 RLYPLK 389
>gi|449433585|ref|XP_004134578.1| PREDICTED: uncharacterized protein LOC101215257 [Cucumis sativus]
Length = 484
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
LVLDLD TL++S L +++ D FRF H+ ++ K
Sbjct: 314 LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 352
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 1032
RP + FL+R +++FE+ ++T +YA ++ LDP + + R+ DG
Sbjct: 353 RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 408
Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL VLG++ A VVI+D+ +V+
Sbjct: 409 -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 434
>gi|302814947|ref|XP_002989156.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
gi|300143056|gb|EFJ09750.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
Length = 245
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 22/138 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S H + L F ++ + RP + F
Sbjct: 47 LVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQMF 92
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
+ER ++LFE+ ++T YA ++ +LDPK +F R+ D DG+ KD
Sbjct: 93 MERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGN----YLKD 146
Query: 1043 LEGVLGME-SAVVIIDDS 1059
L VLG + S VI+D+S
Sbjct: 147 L-SVLGRDLSKTVIVDNS 163
>gi|449494439|ref|XP_004159546.1| PREDICTED: CTD small phosphatase-like protein 2-like [Cucumis
sativus]
Length = 446
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S ++P D D P F ++ + RP + F
Sbjct: 251 LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 296
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 1039
+E ++ FE+ ++T +YA ++ VLDPK +F RV DG+
Sbjct: 297 MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 348
Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+I+D+S + + N I +E +
Sbjct: 349 -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 384
>gi|226506682|ref|NP_001149415.1| CTD-phosphatase-like protein [Zea mays]
gi|195627078|gb|ACG35369.1| CTD-phosphatase-like protein [Zea mays]
gi|414881341|tpg|DAA58472.1| TPA: CTD-phosphatase-like protein [Zea mays]
Length = 460
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+ + LVLDLD TL++S L + + D F + ++ K RP
Sbjct: 284 TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 329
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 1034
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+ DG
Sbjct: 330 YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 383
Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V I+D++ +V+
Sbjct: 384 ---CYTKDL-TILGIDLAKVAIVDNTPQVF 409
>gi|449450582|ref|XP_004143041.1| PREDICTED: uncharacterized protein LOC101204959 [Cucumis sativus]
Length = 1024
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S ++P D D P F ++ + RP + F
Sbjct: 244 LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 289
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 1039
+E ++ FE+ ++T +YA ++ VLDPK +F RV DG+
Sbjct: 290 MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 341
Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + A V+I+D+S + + N I +E +
Sbjct: 342 -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 377
>gi|367002193|ref|XP_003685831.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
gi|357524130|emb|CCE63397.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
Length = 494
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 14/105 (13%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
RK CL+LDLD TL++S+ F VD D ++ D + H ++ + RP
Sbjct: 321 GRKKCLILDLDETLVHSS-FKYVDSA-DFVIPVT--IDNQTHHVYVIK----------RP 366
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
G+ FL+R S+L+E+ ++T Y + +LDP + R+
Sbjct: 367 GVDEFLKRVSELYEVVVFTASVSRYGDPLLNILDPANTIIHHRLF 411
>gi|356515353|ref|XP_003526365.1| PREDICTED: uncharacterized protein LOC100813300 [Glycine max]
Length = 467
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 23/146 (15%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+K+ L LDLD TL++S+ + D K DRE+ ++ + RP
Sbjct: 294 KKVTLALDLDETLIHSS-MEQCDGAD---FTFKMITDRERT-----------VYVRKRPF 338
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ FL + S++FE+ ++T ++YA + VLDP F+ RV + +R
Sbjct: 339 LQEFLAKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVCR-----ESCTWKDRCC 393
Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
KDL VLG++ A V IID++ V+
Sbjct: 394 -VKDL-TVLGIDLAKVCIIDNTPEVF 417
>gi|407043726|gb|EKE42114.1| NLI interacting factor family phosphatase domain containing protein
[Entamoeba nuttalli P19]
Length = 428
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 88/209 (42%), Gaps = 45/209 (21%)
Query: 964 FRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AG 1019
F P + + R GI TF+E+ SKL+++H+ T+G K YA + ++ + V F G
Sbjct: 98 FEIPEQNAKVLIRFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRDVPFITG 157
Query: 1020 RVISR--------GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1064
+++ D+ D DG +ER + + +G E VI+DD + VW
Sbjct: 158 DLVTAEDCSSVIVCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWD 216
Query: 1065 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 1124
+ VV+ Y P + + D L V++ ++ F+
Sbjct: 217 NKN----VVQICEYVPSTNQV------------------DTELVRVTEVLQNIYTKFYDE 254
Query: 1125 QSLDDVDVRNILAAEQRKILAGCRIVFSR 1153
DV+ IL ++KIL + F+R
Sbjct: 255 HI---EDVKEILHLFRKKILENKNLYFNR 280
>gi|302817700|ref|XP_002990525.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
gi|302817706|ref|XP_002990528.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
gi|300141693|gb|EFJ08402.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
gi|300141696|gb|EFJ08405.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
Length = 213
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 64/121 (52%), Gaps = 19/121 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L+LDLD TL+ +++ + D ++ E +E+P +W RPG+ F
Sbjct: 20 LILDLDGTLIATSRQAGLHAKLDFVV---EFDPQEQP-----------VWVCKRPGLDDF 65
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L +AS+LFE+ ++++G + Y +M + +DP G L A +SR D G + + + KD
Sbjct: 66 LSKASQLFEVVVFSLGKRAYVEKMREKIDPSGSLVAF-WLSR----DSCSGSDAIKEYKD 120
Query: 1043 L 1043
L
Sbjct: 121 L 121
>gi|145502170|ref|XP_001437064.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404211|emb|CAK69667.1| unnamed protein product [Paramecium tetraurelia]
Length = 334
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 94/221 (42%), Gaps = 40/221 (18%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LV+DLD TL++S+ E V+D I+ + K ++ +RPG F
Sbjct: 48 LVIDLDETLVHSS--FEPMKVNDLIVEVTMNDQKYK------------IYVNIRPGAHDF 93
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS-- 1040
+E ASK FE+ ++T YA + LDP G++ D F + V K
Sbjct: 94 IEEASKYFELIIFTASISEYANSVIDFLDPHGLV----------DLRLFRENCTVYKDIL 143
Query: 1041 -KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099
KDL + +V++ID+SV + +N + + Y + + ++ LL P L
Sbjct: 144 VKDLSLLKRKLDSVILIDNSVNSFMFQPMNAVHILNY-FEDKTDQELTLLIPFL------ 196
Query: 1100 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
L S + +H+ F + D + +++ +Q
Sbjct: 197 ------KLLSQFQDVRPVHEWLFKYAHFDKFEYVDVMGIKQ 231
>gi|224035555|gb|ACN36853.1| unknown [Zea mays]
gi|414881338|tpg|DAA58469.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
gi|414881339|tpg|DAA58470.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
gi|414881340|tpg|DAA58471.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
Length = 397
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+ + LVLDLD TL++S L + + D F + ++ K RP
Sbjct: 221 TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 266
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 1034
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+ DG
Sbjct: 267 YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 320
Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V I+D++ +V+
Sbjct: 321 ---CYTKDL-TILGIDLAKVAIVDNTPQVF 346
>gi|260789874|ref|XP_002589969.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
gi|229275156|gb|EEN45980.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
Length = 232
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 14/120 (11%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
++ R R+ K S + LVLDLD TL++ + L + E+ + P
Sbjct: 35 EEMRQRQPALPLKTRSTPEFSLVLDLDETLVHCS------------LNELEDANLTFPV- 81
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LF+ ++ + RP FLER SKL+E+ L+T K+YA ++ +LDPK L R+
Sbjct: 82 -LFQDVTYQVYVRTRPYYREFLERMSKLYEIILFTASKKVYADKLMNILDPKKELVRHRL 140
>gi|308485158|ref|XP_003104778.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
gi|308257476|gb|EFP01429.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
Length = 292
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 26/112 (23%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM------G 970
S+ + LVLDLD TL++ + + P+ + + FP M
Sbjct: 61 SSAEYTLVLDLDETLVHCS----LTPLDNATMI----------------FPVMFQDITYQ 100
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
++ +LRP + TFL R SK+FE+ ++T K+YA ++ ++DP+ + R+
Sbjct: 101 VYVRLRPHLRTFLRRMSKIFEIIIFTASKKVYANKLCDIIDPQKTMIRHRLF 152
>gi|224057698|ref|XP_002299297.1| predicted protein [Populus trichocarpa]
gi|222846555|gb|EEE84102.1| predicted protein [Populus trichocarpa]
Length = 256
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++S L ++ D P F ++ + RP + F
Sbjct: 81 LVLDLDETLVHST------------LEPCDDADFTFPVN--FNLQQHTVFVRCRPYLRDF 126
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ER S LFE+ ++T +YA ++ VLDPK
Sbjct: 127 MERVSSLFEIIIFTASQSIYAEQLLNVLDPK 157
>gi|157125124|ref|XP_001660632.1| hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
gi|108873763|gb|EAT37988.1| AAEL010078-PA [Aedes aegypti]
Length = 678
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 22/164 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ + LVLDLD TL VH + +E D LF+ ++
Sbjct: 493 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDASFKFPVLFQECKYTVFV 538
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+ RP FLE+ S++FE+ L+T ++YA ++ +LDP+ L R+ + +G
Sbjct: 539 RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPERRLIKYRLFR--EHCVLVNG 596
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 597 N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 635
>gi|426378923|ref|XP_004056157.1| PREDICTED: CTD small phosphatase-like protein 2 [Gorilla gorilla
gorilla]
Length = 398
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 8/104 (7%)
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
LF+ ++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+
Sbjct: 280 LFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF 339
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPH 1065
+ G+ KDL +LG + S +IID+S + + +
Sbjct: 340 R--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAY 376
>gi|357135834|ref|XP_003569513.1| PREDICTED: uncharacterized protein LOC100822852 [Brachypodium
distachyon]
Length = 447
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 71/151 (47%), Gaps = 30/151 (19%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+K+ LVLDLD TL++S H D + F ++ + RP
Sbjct: 269 TKKVTLVLDLDETLVHSTMEHC--------------SDADFTFPVFFDMKEHVVYVRKRP 314
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
+ FL++ +++F++ ++T +YA ++ LDP+ LF R + G
Sbjct: 315 HLHIFLQKMAEMFDVVIFTASQSVYADQLLDRLDPEKTLFCKRFFRESCVFTESG----- 369
Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL V+G++ A VVIID++ +V+
Sbjct: 370 -----YTKDLT-VVGVDLAKVVIIDNTPQVF 394
>gi|195996503|ref|XP_002108120.1| hypothetical protein TRIADDRAFT_18774 [Trichoplax adhaerens]
gi|190588896|gb|EDV28918.1| hypothetical protein TRIADDRAFT_18774, partial [Trichoplax adhaerens]
Length = 208
Score = 50.1 bits (118), Expect = 0.007, Method: Composition-based stats.
Identities = 48/159 (30%), Positives = 76/159 (47%), Gaps = 24/159 (15%)
Query: 920 KLCLVLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ LV+DLD TL++ S E +H I K D ++ +LRP
Sbjct: 30 EFTLVIDLDETLVHCSLSLLEDANLHFPIYFKNNNYD---------------VYVRLRPY 74
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
FLER SK++E+ L+T K+YA ++ ++DP L R+ + F V
Sbjct: 75 YREFLERVSKIYEVILFTASKKVYANKLMDIIDPGRKLVKHRLFR---EHCVFVHGNYV- 130
Query: 1039 KSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL G+LG + S VI+D+S + + + N I +E +
Sbjct: 131 --KDL-GILGRDLSKTVIVDNSPQAFGYQLSNGIPIESW 166
>gi|390356060|ref|XP_003728694.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
[Strongylocentrotus purpuratus]
Length = 514
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 83/176 (47%), Gaps = 24/176 (13%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
QK RT L + + + K LVLDLD TL++ + L + E P
Sbjct: 319 QKNRTPVLPLKTR--RSPKYSLVLDLDETLVHCS------------LAEMENCTMSFPV- 363
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F+ ++ + RP FLER SK+FE+ L+T ++YA ++ +LDP+ L R+
Sbjct: 364 -YFQDNEYQVYVRTRPFFRDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRL 422
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + + VIID+S + + + N I +E +
Sbjct: 423 FR--EHCICVQGN----YIKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 471
>gi|293332237|ref|NP_001167877.1| uncharacterized protein LOC100381584 [Zea mays]
gi|223944585|gb|ACN26376.1| unknown [Zea mays]
gi|413950698|gb|AFW83347.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
Length = 419
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H + + F + ++ + RP
Sbjct: 242 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 287
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+ DG
Sbjct: 288 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 340
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V I+D++ +V+
Sbjct: 341 --CYTKDL-TILGIDLAKVAIVDNTPQVF 366
>gi|118354395|ref|XP_001010460.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89292227|gb|EAR90215.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 540
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 81/178 (45%), Gaps = 31/178 (17%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK--PHRHLFRFPHMG----- 970
++KL L+LDLD TL+N+ V +E K EE R K ++++ +
Sbjct: 201 SQKLNLILDLDETLVNTV------WVTNENQSKLEEIYRYKMPSNKNVITIQYSNNEGAQ 254
Query: 971 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
T LRP + F+ K F + +Y+ G K Y ++ +DP+ LF I +
Sbjct: 255 KEFITILRPHLKEFVTEMKKYFNILVYSHGRKDYVLKLLDKIDPRRDLFNRNNIFKN--- 311
Query: 1029 DPFDGDERVPKSKDLEGVL------GMESAV---VIIDDSVRVWPHNKL-NLIVVERY 1076
+G + KD++ ++ +E A+ +IIDD +W N++ ++R+
Sbjct: 312 ---EGQVNIKTQKDIKNIIECDSPSALEKALKSSIIIDDIFEIWLEETFPNVVPIKRF 366
>gi|219109563|ref|XP_002176536.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411071|gb|EEC50999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 809
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 101/253 (39%), Gaps = 33/253 (13%)
Query: 1008 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---- 1063
KVL+ + LF R++SR D D + + + G++ V++DD VW
Sbjct: 510 KVLELRQRLFGSRIVSRTDVRDLGQNVKSLKRIFPCGGIMA-----VVMDDREDVWANAA 564
Query: 1064 ---------PHNKLNLIVVERYTYFPCSRRQFGLLGPSLL-EIDHDERSEDGTLASSLGV 1113
P + L L+ ++ F G L E + + D L SL +
Sbjct: 565 DILTVRKGEPPDNLLLVRPYHWSSFLGFADVNNASGADLSGESEAGDVETDEQLLWSLDI 624
Query: 1114 IERLHKIFF----SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH---- 1165
++R+H+ F+ S V +I+ + + L G +VFS + P+ L
Sbjct: 625 LQRVHRRFYESDGSFLGALTQTVPDIVKQLRAETLHGAHLVFSGMVPLHRQQQQLESGDK 684
Query: 1166 -----PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEASA 1219
+ + AE GA + +THVVA GTDK+ A G +V PGW+
Sbjct: 685 VVPRPTVIRYAETLGAKVWSKVTPVLTHVVAAKDGTDKILAARKLPGCRIVKPGWLMECV 744
Query: 1220 LLYRRANEQDFAI 1232
R +E + +
Sbjct: 745 WSLTRRDEGRYLL 757
Score = 45.1 bits (105), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 26/126 (20%)
Query: 901 IQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEI-------L 948
+ + +R+ +Q +++ +KL LVLDLDHTL+ N + + D++ L
Sbjct: 236 VSRAEGQRMAQQDAERLQKRKKLSLVLDLDHTLVHATNDTRAQQFCKSRDDVRTLILPML 295
Query: 949 RKKEEQDREKPHRHLFRFPHMGMWT----KLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
R E R P WT K+RP + FL A +E+ +YT G + YA
Sbjct: 296 RPNGEP----------RQPQHPEWTQHFVKMRPHVEVFLNEAQDQYEIGVYTAGTRDYAE 345
Query: 1005 EMAKVL 1010
++ +L
Sbjct: 346 QICILL 351
>gi|390356058|ref|XP_788296.3| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
[Strongylocentrotus purpuratus]
Length = 485
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 83/176 (47%), Gaps = 24/176 (13%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
QK RT L + + + K LVLDLD TL++ + L + E P
Sbjct: 290 QKNRTPVLPLKTR--RSPKYSLVLDLDETLVHCS------------LAEMENCTMSFPV- 334
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F+ ++ + RP FLER SK+FE+ L+T ++YA ++ +LDP+ L R+
Sbjct: 335 -YFQDNEYQVYVRTRPFFRDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRL 393
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + + VIID+S + + + N I +E +
Sbjct: 394 FR--EHCICVQGN----YIKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 442
>gi|290992214|ref|XP_002678729.1| predicted protein [Naegleria gruberi]
gi|284092343|gb|EFC45985.1| predicted protein [Naegleria gruberi]
Length = 181
Score = 50.1 bits (118), Expect = 0.009, Method: Composition-based stats.
Identities = 53/188 (28%), Positives = 88/188 (46%), Gaps = 31/188 (16%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG------MWT 973
K +VLDLD TL+ S F++ V+D + FPHMG ++
Sbjct: 9 KKTIVLDLDETLIKS--FYQEPEVYDFSID--------------IEFPHMGNLIQQHVYI 52
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
K RPG+ FL+ ++ FE+ ++T +YA + K +DP LF+ V+ R +G
Sbjct: 53 KKRPGLENFLQTLAEKFELIMFTAALPVYADAILKHIDPSAELFS-HVLYRHH----CNG 107
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
P KDL +LG +++DD V + K N ++++ + R ++ P
Sbjct: 108 SGMFP-GKDLR-ILGRNLDHTLLVDDGVMNFLQPK-NGLLIKSFKGEEGDRILADIIAPF 164
Query: 1093 LLEIDHDE 1100
LL++D E
Sbjct: 165 LLQLDDPE 172
>gi|85001578|ref|XP_955502.1| ctd-like phosphatase [Theileria annulata strain Ankara]
gi|65303648|emb|CAI76026.1| ctd-like phosphatase, putative [Theileria annulata]
Length = 832
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 32/54 (59%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
+ KLRPGI+ F + F + L+T G K +A +++DP+ + F+ R+ SR
Sbjct: 301 YFKLRPGIFNFFHQIRDKFTLFLFTTGTKQHAESALQIIDPQLIYFSNRIFSRS 354
>gi|118390259|ref|XP_001028120.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89309890|gb|EAS07878.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 623
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 69/153 (45%), Gaps = 22/153 (14%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+K L+LDLD TL++ + +D D IL D + + + + +RP
Sbjct: 432 KKKTLILDLDETLIHCNE--SLDNSSDFIL------DIQADSKEV-----VQAGINVRPF 478
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG--DDGDPFDGDER 1036
FLE S L+E+ ++T +YA E+ LDP+ R+ + D R
Sbjct: 479 AKQFLEEMSHLYEIVIFTASRSVYANEVINKLDPQNKFIFKRLFRENCIYKNRIYIKDLR 538
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN 1069
+ K++D++ +VI+D+ + HN LN
Sbjct: 539 IFKNRDIKN-------LVIVDNCCLSFCHNILN 564
>gi|431896052|gb|ELK05470.1| CTD small phosphatase-like protein 2 [Pteropus alecto]
Length = 282
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 8/107 (7%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R+ +
Sbjct: 140 VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVC 197
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 198 VQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 239
>gi|413950699|gb|AFW83348.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
Length = 400
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H + + F + ++ + RP
Sbjct: 223 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 268
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+ DG
Sbjct: 269 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 321
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V I+D++ +V+
Sbjct: 322 --CYTKDL-TILGIDLAKVAIVDNTPQVF 347
>gi|156404147|ref|XP_001640269.1| predicted protein [Nematostella vectensis]
gi|156227402|gb|EDO48206.1| predicted protein [Nematostella vectensis]
Length = 289
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 81/173 (46%), Gaps = 31/173 (17%)
Query: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
RTRR E LVLDLD TL++ + L K E+ P +
Sbjct: 97 RTRRTPE---------FSLVLDLDETLVHCS------------LNKLEDATLSFPVS--Y 133
Query: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
+ ++ + RP + FLER SK+FE+ L+T ++YA ++ +LDP+ F R+
Sbjct: 134 QDITYQVFVRTRPHLKYFLERVSKVFEVILFTASKRVYADKLLNILDPEKKYFRHRLFR- 192
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ G+ KDL +LG + S +I+D+S + + + N I +E +
Sbjct: 193 -EHCVCVQGN----YIKDL-SILGRDLSKTMIVDNSPQAFAYQIFNGIPIESW 239
>gi|302811311|ref|XP_002987345.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
gi|300144980|gb|EFJ11660.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
Length = 240
Score = 49.7 bits (117), Expect = 0.012, Method: Composition-based stats.
Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 22/157 (14%)
Query: 921 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
+ LVLDLD TL++S H + L F ++ + RP +
Sbjct: 45 VALVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQ 90
Query: 981 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
F+ER ++LFE+ ++T YA ++ +LDPK +F R+ D DG+
Sbjct: 91 MFMERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGNYL---- 144
Query: 1041 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL VLG + S VI+D+S + + N I +E +
Sbjct: 145 KDLS-VLGRDLSKTVIVDNSPQAFGFQVDNGIPIESW 180
>gi|357487783|ref|XP_003614179.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355515514|gb|AES97137.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 306
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 85/184 (46%), Gaps = 24/184 (13%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVL LD TL VH +++ KE+ D F ++ + RP + F
Sbjct: 127 LVLGLDGTL-----------VHSTLVKPKEDHDL--TFTVSFNSVKEDVYVRYRPHLKEF 173
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L+ S +FE+ ++T G ++YA ++ LDP +F R+ + DE+ KD
Sbjct: 174 LDEVSGIFEIIVFTAGQRIYADKLLNKLDPSRKIFRHRLFRES----CVNVDEKY--VKD 227
Query: 1043 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG--PSLLEIDHD 1099
L +LG + A V +ID S + N I +E + P + L+ SL+E+D D
Sbjct: 228 L-SILGRDLARVTMIDSSPHSFGFQVENGIPIETWFADPSDNKLLSLIPFLESLVEVD-D 285
Query: 1100 ERSE 1103
R+E
Sbjct: 286 VRTE 289
>gi|340508046|gb|EGR33849.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 280
Score = 49.3 bits (116), Expect = 0.014, Method: Composition-based stats.
Identities = 32/118 (27%), Positives = 64/118 (54%), Gaps = 19/118 (16%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREK 958
++K+R ++L +QK+ +K L+LDLD TL++S+ + +E D + +++ Q
Sbjct: 11 VKKQRIKQLGQQKQSCIGKK-TLILDLDETLVHSSFQQINEYDFQFEIVVKNIPYQ---- 65
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
++ K RPGI FL++ S+ +E+ +YT YA ++ ++D + V+
Sbjct: 66 ------------IYVKKRPGIHIFLQKLSEKYEIVIYTASISEYANQVCNIIDQQDVI 111
>gi|391328122|ref|XP_003738541.1| PREDICTED: CTD small phosphatase-like protein 2-like [Metaseiulus
occidentalis]
Length = 236
Score = 49.3 bits (116), Expect = 0.014, Method: Composition-based stats.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 14/108 (12%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K SA + LVLDLD TL++ + L + E P LF+ ++
Sbjct: 51 KTRSAPEFSLVLDLDETLVHCS------------LMELEGATFTFPV--LFQGIEYKVYV 96
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ RP FLER SK+FE+ L+T K+YA ++ +LDPK L R+
Sbjct: 97 RTRPFFREFLERVSKMFEVILFTASKKVYADKLLDLLDPKRHLIRYRL 144
>gi|145523063|ref|XP_001447370.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414881|emb|CAK79973.1| unnamed protein product [Paramecium tetraurelia]
Length = 336
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 98/219 (44%), Gaps = 34/219 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LV+DLD TL++S+ E ++D I+ + + K ++ +RPG F
Sbjct: 48 LVIDLDETLVHSS--FEPMKINDLIVEVTMKDQKYK------------IYVNIRPGAQEF 93
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
++ SKLFE+ ++T YA + +DP G L R+ ++ ++G KD
Sbjct: 94 IKETSKLFELIIFTASISEYANSVIDFIDPHG-LVDLRLFR--ENCTVYNG----VLVKD 146
Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERS 1102
L + +V++ID+SV + +N I + Y + + ++ LL P L
Sbjct: 147 LSLLKRNLDSVILIDNSVNSFMFQPMNAIHILNY-FEDKTDQELILLIPFL--------- 196
Query: 1103 EDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 1141
L S + +H+ F + D + +I+ ++Q+
Sbjct: 197 ---KLISQFQDVRPVHEWLFKYAHFDKFEYVDIMGSKQQ 232
>gi|223943303|gb|ACN25735.1| unknown [Zea mays]
Length = 342
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+ + LVLDLD TL++S H + + F + ++ + RP
Sbjct: 165 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 210
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+ DG
Sbjct: 211 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 263
Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A V I+D++ +V+
Sbjct: 264 --CYTKDL-TILGIDLAKVAIVDNTPQVF 289
>gi|302824588|ref|XP_002993936.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
gi|300138208|gb|EFJ04983.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
Length = 159
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 36/174 (20%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL++ A HDE E R + + RPG+ F
Sbjct: 2 LVLDLDQTLVSVAD-------HDE----------ETLLRCVTK----------RPGLDRF 34
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
L+ S+++E+ +++ Y ++ LDP G +F+ + G D D G +RV +
Sbjct: 35 LKDMSQVYEIVIFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-QRVKDLRK 91
Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 1096
L +V IDD+ ++P+N N I V + P + G L P LLE+
Sbjct: 92 LN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLEV 139
>gi|385301990|gb|EIF46141.1| putative rna pol ii ctd phosphatase component [Dekkera bruxellensis
AWRI1499]
Length = 527
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/144 (25%), Positives = 65/144 (45%), Gaps = 11/144 (7%)
Query: 1098 HDERSEDGTLASSLG-VIERLHKIFFSHQSLDDV-----DVRNILAAEQRKILAGCRIVF 1151
H+ +D T +LG + ++H F+ + D DV++I+ + + GC +
Sbjct: 225 HNLLCDDDTELQTLGKALTKIHDKFYLEKZRDVASXEVPDVKDIMNTMKSVVFQGCVFLL 284
Query: 1152 SRVFPVGE-ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRF 1208
S + P+G + +W A +GA VTHV+ + T KV A S
Sbjct: 285 SGILPLGTPLDSADIVIW--ARSYGAQFVSEYSTSVTHVICKNPXTFKVRLAKSVDPNVK 342
Query: 1209 VVHPGWVEASALLYRRANEQDFAI 1232
+V+P W+ ++RR E+D+ +
Sbjct: 343 IVNPDWLFKCMSMWRRVPEEDYTV 366
>gi|345479753|ref|XP_001603378.2| PREDICTED: hypothetical protein LOC100119644 [Nasonia vitripennis]
Length = 563
Score = 48.9 bits (115), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 378 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAS------FRFPVVFQNI 417
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 418 TYTVFVRTRPFFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 475
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 476 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 520
>gi|15929101|gb|AAH15010.1| Unknown (protein for IMAGE:4864552), partial [Homo sapiens]
Length = 565
Score = 48.9 bits (115), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)
Query: 1113 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ +++ D+R I+ + K+LA I+FS + FP+ +
Sbjct: 294 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 353
Query: 1162 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 1214
H H A GA V + D+ TH++A GT+KV A G VV+P W
Sbjct: 354 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 408
Query: 1215 VEASALLYRRANEQDFAIK 1233
+ + + + EQ F ++
Sbjct: 409 LWSCLERWDKVEEQLFPLR 427
>gi|225711928|gb|ACO11810.1| Probable C-terminal domain small phosphatase [Lepeophtheirus
salmonis]
Length = 265
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 79/164 (48%), Gaps = 22/164 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ + LVLDLD TL++ + E+D D +F+ ++
Sbjct: 79 KTRSSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFV 124
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+ RP I FLER SK FE+ L+T K+YA ++ +LDP+ R+ + +G
Sbjct: 125 RTRPRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLFR--EHCVCVNG 182
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 183 N----YIKDL-NILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 221
>gi|291239709|ref|XP_002739764.1| PREDICTED: CTD small phosphatase-like protein 2-like [Saccoglossus
kowalevskii]
Length = 526
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 34/182 (18%)
Query: 902 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
+ E+ RR K S+ + LVLDLD TL++ + +E+D D
Sbjct: 329 EAEKNRRPVLPLKTRSSPEYSLVLDLDETLVHCS-LNELD-------------DANLTFP 374
Query: 962 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR- 1020
+F+ ++ + RP FLE S+ FE+ L+T K+YA ++ +LDP+ R
Sbjct: 375 VVFQDITYQVFVRTRPYFKEFLEAVSQQFEVILFTASKKVYADKLFNLLDPQKKYVKYRL 434
Query: 1021 -----VISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVE 1074
V +G+ KDL G+LG + S V+I+D+S + + + N I +E
Sbjct: 435 FREHCVCVQGN------------YIKDL-GILGRDLSRVIIVDNSPQAFGYQLSNGIPIE 481
Query: 1075 RY 1076
+
Sbjct: 482 SW 483
>gi|313226803|emb|CBY21948.1| unnamed protein product [Oikopleura dioica]
Length = 444
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 33/182 (18%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
+ +AAI +TR+ E LVLDLD TL VH + E Q
Sbjct: 237 KNRAAILPCKTRKTPE---------YTLVLDLDETL-----------VHCSLC---ELQM 273
Query: 956 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
R+ RF ++ ++ K RP + FLER + FE+ ++T K+YA ++ ++DP
Sbjct: 274 RDYEFTFPIRFQNVDYDVYVKTRPYLRDFLERMCEHFEIIIFTASKKVYADKLISIIDPN 333
Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIV 1072
L R+ + G+ KDL +LG + + +I+D+S + + ++ N I
Sbjct: 334 KKLVRHRLFR--EHCMLVQGN----YIKDL-TILGRDLTKTIIVDNSPQAFSYHMDNGIP 386
Query: 1073 VE 1074
+E
Sbjct: 387 IE 388
>gi|397609378|gb|EJK60336.1| hypothetical protein THAOC_19327, partial [Thalassiosira oceanica]
Length = 334
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 17/101 (16%)
Query: 1113 VIERLHKIFFSHQSLDDVDVR---NILAAEQRKILAG----CRIVFSRVFPVGEAN---- 1161
V+ RLH F+S D R ++L + +R++L G C +VFS V P+ + N
Sbjct: 238 VLRRLHARFYSPSPGSDAGRRTVPDLLRSMRREVLGGHPRAC-LVFSGVVPLNQQNVALR 296
Query: 1162 --PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 1200
PHL + AE+ G + VTHVVA G++K N
Sbjct: 297 VRPHL---VRYAEELGGTVLPAVGPDVTHVVAKKDGSEKCN 334
>gi|70921595|ref|XP_734099.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56506520|emb|CAH86297.1| hypothetical protein PC301933.00.0 [Plasmodium chabaudi chabaudi]
Length = 212
Score = 48.5 bits (114), Expect = 0.023, Method: Composition-based stats.
Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 14/136 (10%)
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
E D + ++ + + + K RP + FLE S +E+ +YT + YA + +LDP
Sbjct: 5 ENDELELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 64
Query: 1013 KGVLFAGRVISRGD--DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNK 1067
+FA R+++R D D E++ + D V+ DD VW P +
Sbjct: 65 DRTIFADRIVARCSSVDRDENKHFEKIYPNVD-------PKYVIAFDDRKDVWYDIPDS- 116
Query: 1068 LNLIVVERYTYFPCSR 1083
+++ E Y +F S+
Sbjct: 117 -HILRAEHYNFFELSK 131
>gi|335281247|ref|XP_001925490.3| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
[Sus scrofa]
Length = 666
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 64/134 (47%), Gaps = 13/134 (9%)
Query: 1113 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 1161
++ R+H +++ ++ + D+R I+ + ++LA I+FS + FP+ +
Sbjct: 297 ILARVHSDYYAKYDRFLCGETQEAPDIRKIVPELRSRVLADVAIIFSGLHPTNFPIEKTR 356
Query: 1162 PHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASA 1219
H H A + D D+ TH++A GT+KV A G+ VV+P W+ +
Sbjct: 357 EHYHATALGARILTQLVLDPDDPDRPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCL 416
Query: 1220 LLYRRANEQDFAIK 1233
+ + EQ F ++
Sbjct: 417 ERWDKVEEQLFPLQ 430
>gi|242218013|ref|XP_002474801.1| predicted protein [Postia placenta Mad-698-R]
gi|220726049|gb|EED80012.1| predicted protein [Postia placenta Mad-698-R]
Length = 925
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 36/64 (56%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1226
L++ A + GAVC + D+VTH++A G+ K AL ++HP W+ S ++ R +
Sbjct: 77 LFKQALELGAVCLSDLTDRVTHLLAVEPGSAKYRCALENKITIMHPSWITESHEIWLRGD 136
Query: 1227 EQDF 1230
+ D
Sbjct: 137 DVDL 140
>gi|403353558|gb|EJY76317.1| NLI interacting factor-like phosphatase family protein [Oxytricha
trifallax]
Length = 1037
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 31/151 (20%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH----MGMWT 973
++K L+ D+D TL+ H VD + + E+ D P FP +
Sbjct: 654 SKKKTLIFDMDETLI-----HCVDDI------ESEDPDVIIP----IDFPDEDEIVNAGI 698
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+RP ++ LE A+KLF++ ++T +K YA + LDP+ F R+ R + +G
Sbjct: 699 NIRPYLYECLEEANKLFQVIVFTASHKAYADAILDYLDPENKYFQYRLY-RDNCVQTREG 757
Query: 1034 ----DERVPKSKDLEGVLGMESAVVIIDDSV 1060
D R+ ++DL+ ++IID+SV
Sbjct: 758 YYVKDLRIINNRDLKD-------LIIIDNSV 781
>gi|195125303|ref|XP_002007120.1| GI12759 [Drosophila mojavensis]
gi|193918729|gb|EDW17596.1| GI12759 [Drosophila mojavensis]
Length = 245
Score = 48.5 bits (114), Expect = 0.026, Method: Composition-based stats.
Identities = 32/119 (26%), Positives = 58/119 (48%), Gaps = 14/119 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFH------EVDPVHDEILRKKEEQDREKPHRHLFR 965
+K++ R+ LVLD+D TL++S + E P + + K + P+ + F
Sbjct: 19 KKRLLMVRRKTLVLDMDETLISSVILYRVKSLLEAGPEDNRRYKAKSKIVHSTPYDYSFY 78
Query: 966 FP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
P ++ RP + FL+R SK + + ++T ++ YA+++ LD AGR I
Sbjct: 79 IPMSEASVYVYKRPYVDLFLDRVSKWYNLVVFTAASEAYASQVLDFLD------AGRNI 131
>gi|325180325|emb|CCA14728.1| putative nuclear LIM factor interactorinteracting protein hyphal form
[Albugo laibachii Nc14]
Length = 228
Score = 48.5 bits (114), Expect = 0.027, Method: Composition-based stats.
Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 13/104 (12%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE--EQDREKPHRHLFRFPHMGMWTKLR 976
R+L LVLD+D L++S H D ++ K+ E + R + R
Sbjct: 31 RRLALVLDMDECLIHSIFQH--DNIYQRYPSYKDSFEISTSEGERAI---------VNKR 79
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
PG+ FL A+K F+++++T G ++Y + LDPKG +F R
Sbjct: 80 PGLDAFLREAAKSFDLYVFTAGLRVYGEPILDALDPKGTIFKDR 123
>gi|242205780|ref|XP_002468747.1| predicted protein [Postia placenta Mad-698-R]
gi|220732132|gb|EED85970.1| predicted protein [Postia placenta Mad-698-R]
Length = 925
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 36/64 (56%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1226
L++ A + GAVC + D+VTH++A G+ K AL ++HP W+ S ++ R +
Sbjct: 77 LFKQALELGAVCLSDLTDRVTHLLAVEPGSAKYRCALENKITIMHPSWITESHEIWLRGD 136
Query: 1227 EQDF 1230
+ D
Sbjct: 137 DVDL 140
>gi|412985397|emb|CCO18843.1| predicted protein [Bathycoccus prasinos]
Length = 601
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 80/177 (45%), Gaps = 33/177 (18%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------HMGMWT 973
K LVLDLD TL VH + E++ P F FP +
Sbjct: 405 KNTLVLDLDETL-----------VHSNL-----EEEEGTPD---FTFPVQFNNETHAVNV 445
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
++RP + F++R SK FE+ ++T K+YA ++ LDP+ V F+ R+ D +G
Sbjct: 446 RIRPHLEEFMKRVSKKFEVVIFTASQKVYADKLLDHLDPEHVYFSHRLFR--DSCVLVEG 503
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
+ KDL VLG + S +IID+S + + N + +E + P LL
Sbjct: 504 N----YLKDL-SVLGRDLSRTLIIDNSPQAFGFQVENGVPIESWYDDPTDDHLLRLL 555
>gi|123496394|ref|XP_001326959.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121909881|gb|EAY14736.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 397
Score = 48.1 bits (113), Expect = 0.028, Method: Composition-based stats.
Identities = 65/273 (23%), Positives = 110/273 (40%), Gaps = 42/273 (15%)
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+LRPGI FL+ ++ F++ L + ++ E+ K LDP F R I R D D F
Sbjct: 102 RLRPGIDNFLKSITEHFKIVLISGLDQYIINEVVKRLDPMHTYFGNR-IYRDQDFD-FKN 159
Query: 1034 DER----VPKSKDLEGVLGMESAVVIIDDSVRVWPHNK----LNLIVVERYTYFPCSRRQ 1085
DE P S DL VI+D S W ++ + V Y YF S
Sbjct: 160 DENNYYLFPPSSDL---------TVILDTSRDHWKNDNGFTFSGFVFVCPYNYFQPS--- 207
Query: 1086 FGLLGPSLLEIDHD-----ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140
+++ + D D L +E++H +F+ + +D + + + Q
Sbjct: 208 ------TIINVSEDIALPLVSKLDSVLFGLSRYLEQIH-LFYYVKKVDSIVLS--IGFAQ 258
Query: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 1200
IL GC I + P N + +FG D TH+V +S + +
Sbjct: 259 HPILKGCYICIPDLSPEEAKNFDI-----LIARFGGKFMPSYDSAATHLVISSKDSPAIA 313
Query: 1201 WALS-TGRFVVHPGWVEASALLYRRANEQDFAI 1232
A G ++ GW+ + Y + +E+++ +
Sbjct: 314 QASQYNGVYITTIGWIVDTFTHYEKKDERNYPL 346
>gi|302833726|ref|XP_002948426.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f. nagariensis]
gi|300266113|gb|EFJ50301.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f. nagariensis]
Length = 215
Score = 48.1 bits (113), Expect = 0.028, Method: Composition-based stats.
Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 22/160 (13%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
AR+ LVLDLD TL++S+ L + D P +F ++ + RP
Sbjct: 33 ARRKTLVLDLDETLVHSS------------LEAVDRSDFSFPV--IFNGTEHQVYVRQRP 78
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
+ F+ R + LFE+ ++T ++YA ++ +LDP+ L R+ D DG+
Sbjct: 79 YLREFMVRVAALFEVVVFTASQRIYAEKLLDILDPQQQLVRHRIYR--DSCVVVDGNYL- 135
Query: 1038 PKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
KDL +LG + A VI+D+S + + N I +E +
Sbjct: 136 ---KDLS-ILGRDLATTVIVDNSPQAFGFQVDNGIPIESW 171
>gi|405966502|gb|EKC31780.1| CTD small phosphatase-like protein 2 [Crassostrea gigas]
Length = 402
Score = 48.1 bits (113), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 90/192 (46%), Gaps = 25/192 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ + LVLDLD TL++ + E +D LF ++
Sbjct: 218 KTRSSPEFSLVLDLDETLVHCSL--------------TELEDAAFTFPVLFEDVTYKVFV 263
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+ RP FLE S++FE+ L+T K+YA ++ +LDP+ L R+ + +G
Sbjct: 264 RTRPHFREFLETVSEMFEVILFTASKKVYADKLVNILDPQKQLIKHRLFR--EHCVCING 321
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
+ KDL +LG + S +I+D+S + + + N I +E + + + R+ L P
Sbjct: 322 N----YIKDL-TILGRDLSRTIIVDNSPQAFGYQLDNGIPIESW-FVDKNDRELLNLVPF 375
Query: 1093 LLEIDHDERSED 1104
L + H R+ED
Sbjct: 376 LQSLVH--RNED 385
>gi|341876625|gb|EGT32560.1| hypothetical protein CAEBREN_01530 [Caenorhabditis brenneri]
Length = 286
Score = 48.1 bits (113), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 33/52 (63%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
++ +LRP + TFL R +K FE+ ++T K+YA ++ +LDP+ L R+
Sbjct: 101 VYVRLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRLF 152
>gi|268566879|ref|XP_002639837.1| C. briggsae CBR-SCPL-3 protein [Caenorhabditis briggsae]
Length = 294
Score = 48.1 bits (113), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 33/52 (63%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
++ ++RP + TFL R SK+FE+ ++T K YA ++ +LDP+ + R+
Sbjct: 101 VYVRIRPFLRTFLTRMSKVFEIIVFTASKKCYANKLCDILDPQKTIIKHRLF 152
>gi|290986065|ref|XP_002675745.1| NLI interacting factor domain-containing protein [Naegleria gruberi]
gi|284089343|gb|EFC43001.1| NLI interacting factor domain-containing protein [Naegleria gruberi]
Length = 510
Score = 48.1 bits (113), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 96/231 (41%), Gaps = 39/231 (16%)
Query: 884 GDVEHLF--EGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE-- 939
GD+ H F E ++++A Q +T L Q+ +K LVLDLD TL++S H
Sbjct: 271 GDLIHQFYEEVKTNKKRAHKQAPQTALLPPQRPHVQGKK-TLVLDLDETLVHSVFVHTDQ 329
Query: 940 ---VDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYT 996
V P+ E D ++ + RPG+ +L + +E+ ++T
Sbjct: 330 ADFVIPI---------EMDGRTYSCYVLK----------RPGVDEYLRELGQYYEIIIFT 370
Query: 997 MGNKLYATEMAKVLDPKGVLFAGRVISRGDD--GDPFDGDERVPKSKDLEGVLGMESAVV 1054
LYA + +LD GV+ GR+ GD + KDL + +
Sbjct: 371 ASLSLYANPLLDILDKHGVI-EGRLFREHCTKVGDTY--------IKDLSRLGRDLDQTI 421
Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDG 1105
I+D+S + N + + Y + R+ GLL L ++ ++ DG
Sbjct: 422 IVDNSPSCYAMQPQNALACTTW-YDDPNDRELGLLADCLKRLEREKAVYDG 471
>gi|124513824|ref|XP_001350268.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
gi|23615685|emb|CAD52677.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
Length = 1288
Score = 48.1 bits (113), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 12/125 (9%)
Query: 964 FRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F P + K RP + FL+ S +E+ +YT + YA + +LDP +F+ R+
Sbjct: 888 FYLPQYNFFYYLKFRPYVRQFLQILSLYYELAIYTNATREYADVVIAILDPDRTIFSDRI 947
Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTY 1078
++R D DE S+ V V+ DD VW P + ++ E Y +
Sbjct: 948 VARCSSTDR---DENKYFSRIYPNV--DPKYVIAFDDRKDVWIDIPQSH--ILKAEHYNF 1000
Query: 1079 FPCSR 1083
F S+
Sbjct: 1001 FELSK 1005
>gi|332020757|gb|EGI61161.1| CTD small phosphatase-like protein 2 [Acromyrmex echinatior]
Length = 593
Score = 48.1 bits (113), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 408 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 447
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 448 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 505
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 506 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 550
>gi|322779051|gb|EFZ09448.1| hypothetical protein SINV_03717 [Solenopsis invicta]
Length = 568
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 383 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 422
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 423 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 481 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525
>gi|242053713|ref|XP_002456002.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
gi|241927977|gb|EES01122.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
Length = 400
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 14/104 (13%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+ + LVLDLD TL++S H + + F + ++ + RP
Sbjct: 222 TKHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRP 267
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ FLE+ +++FE+ ++T ++YA ++ LDP G + R+
Sbjct: 268 YLKMFLEKVAQMFEVVIFTASQRIYAEQLIDKLDPDGKYISRRI 311
>gi|428182825|gb|EKX51685.1| hypothetical protein GUITHDRAFT_65993, partial [Guillardia theta
CCMP2712]
Length = 179
Score = 47.8 bits (112), Expect = 0.038, Method: Composition-based stats.
Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 16/99 (16%)
Query: 924 VLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
VLDLD TL++ S +F E + ++ K EQD +W K+RP F
Sbjct: 1 VLDLDETLVHASLEFMEQSHLQFDVTFK--EQDYH-------------VWVKIRPHCLEF 45
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
LER ++ FE+ ++T +YA ++ ++DP L RV
Sbjct: 46 LERLAEKFEIIVFTASQSIYADKLLNLIDPDSRLIKHRV 84
>gi|307165882|gb|EFN60237.1| CTD small phosphatase-like protein 2 [Camponotus floridanus]
Length = 568
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 383 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 422
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 423 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 481 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525
>gi|307194093|gb|EFN76554.1| CTD small phosphatase-like protein 2 [Harpegnathos saltator]
Length = 546
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 361 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 400
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 401 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 458
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 459 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 503
>gi|71649764|ref|XP_813595.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70878493|gb|EAN91744.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 446
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
I K L Q + + +K L+LDLD TL++S+ + P HD IL K E +
Sbjct: 239 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 294
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
++ RP + F++ + LFE+ ++T +Y + +DP+G+L +
Sbjct: 295 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 343
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
R+ + +G KDL +LG E S V I+D+S + + N I
Sbjct: 344 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 389
>gi|308801351|ref|XP_003077989.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
gi|116056440|emb|CAL52729.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
Length = 793
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 26/122 (21%)
Query: 967 PHMGMWTKLRPG---IWTFL---ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
P M +RPG + ++L +R SK E + TM N YA EM ++LDP G +F
Sbjct: 334 PATAMLVHIRPGWGELRSYLSGSDRGSKRAETFVCTMANIDYAREMCRLLDPHGTVF--- 390
Query: 1021 VISRGDDGDPFDGDERVPKSK--------DLEGVLGMESAVVIIDDSVRVW-PHNKLNLI 1071
DP D+R+ K D G+ VI+DD VW P + +++
Sbjct: 391 --------DPAQLDKRIKSVKPDELKSLSDTCGLHFPSELAVIVDDRTAVWEPSAQSHIL 442
Query: 1072 VV 1073
V
Sbjct: 443 AV 444
>gi|224002358|ref|XP_002290851.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974273|gb|EED92603.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
Length = 196
Score = 47.8 bits (112), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 76/157 (48%), Gaps = 22/157 (14%)
Query: 921 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
+CLVLDLD TL++ V+PV D + E F + + RP +
Sbjct: 19 ICLVLDLDETLVHCT----VEPVSDADMIFPVE----------FNGMEYTVHVRCRPFLT 64
Query: 981 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
FLE+ S+ FE+ ++T ++YA ++ ++DP+G R+ D P +G+
Sbjct: 65 EFLEKVSEDFEVVVFTASQQVYADKLLDMIDPEGKFIKHRMFR--DSCLPVEGN----FL 118
Query: 1041 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL +LG + V++D+S + + N I +E +
Sbjct: 119 KDL-TILGRDLRRAVLVDNSPHAFGYQVDNGIPIESW 154
>gi|170050634|ref|XP_001861399.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872200|gb|EDS35583.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 627
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 50/100 (50%), Gaps = 14/100 (14%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ + LVLDLD TL++ + +E D LF+ ++
Sbjct: 482 KTRSSPEFSLVLDLDETLVHCSL--------------QELSDASFKFPVLFQECQYTVFV 527
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ RP FLE+ S++FE+ L+T ++YA ++ +LDP+
Sbjct: 528 RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPE 567
>gi|395517551|ref|XP_003762939.1| PREDICTED: CTD small phosphatase-like protein-like [Sarcophilus
harrisii]
Length = 461
Score = 47.4 bits (111), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 80/189 (42%), Gaps = 33/189 (17%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + K+ K C+V+DLD TL++S+ P+ + E
Sbjct: 186 DQRQVIPIPSPSAKYLLPELKLSDYGKKCMVIDLDETLVHSS----FKPISNADFIVPVE 241
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 242 IDGTVHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 291
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
GV A V RG+ KDL LG E S V+IID+S + +
Sbjct: 292 GVFRARLFRESCVFHRGN------------YVKDLSQ-LGRELSKVIIIDNSPASYIFHP 338
Query: 1068 LNLIVVERY 1076
N + V+ +
Sbjct: 339 ENAVPVQSW 347
>gi|225718796|gb|ACO15244.1| Probable C-terminal domain small phosphatase [Caligus clemensi]
Length = 314
Score = 47.4 bits (111), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 14/97 (14%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
S+ + LVLDLD TL++ + E+D D +F+ ++ + R
Sbjct: 131 SSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFVRTR 176
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
P I FLER SK FE+ L+T K+YA ++ +LDP+
Sbjct: 177 PRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPE 213
>gi|389583329|dbj|GAB66064.1| hypothetical protein PCYB_082250 [Plasmodium cynomolgi strain B]
Length = 821
Score = 47.4 bits (111), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 44/95 (46%), Gaps = 11/95 (11%)
Query: 987 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
+K +E++LYTMG +A +LDP F RV SR D V K L +
Sbjct: 2 NKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSRKD---------SVNGLKHLNRI 52
Query: 1047 L-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
L S + IDDS +W + + I V Y YFP
Sbjct: 53 LPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 86
>gi|302824592|ref|XP_002993938.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
gi|300138210|gb|EFJ04985.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
Length = 163
Score = 47.4 bits (111), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 9/121 (7%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RPG+ FL+ S+++E+ +++ Y ++ LDP G +F+ + G D D G +
Sbjct: 32 RPGLDRFLKDMSQVYEIVVFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-Q 88
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1095
RV + L +V IDD+ ++P+N N I V + P + G L P LLE
Sbjct: 89 RVKDLRKLN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLE 142
Query: 1096 I 1096
+
Sbjct: 143 V 143
>gi|407846470|gb|EKG02580.1| hypothetical protein TCSYLVIO_006391 [Trypanosoma cruzi]
Length = 447
Score = 47.4 bits (111), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
I K L Q + + +K L+LDLD TL++S+ + P HD IL K E +
Sbjct: 240 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 295
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
++ RP + F++ + LFE+ ++T +Y + +DP+G+L +
Sbjct: 296 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 344
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
R+ + +G KDL +LG E S V I+D+S + + N I
Sbjct: 345 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 390
>gi|149392655|gb|ABR26130.1| ctd-phosphatase-like protein [Oryza sativa Indica Group]
Length = 187
Score = 47.4 bits (111), Expect = 0.058, Method: Composition-based stats.
Identities = 43/154 (27%), Positives = 78/154 (50%), Gaps = 28/154 (18%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K ++++ LVLDLD TL++S H D V D L+ F + ++
Sbjct: 5 KSARSKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYV 50
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDP 1030
+ RP + FLE+ +++FE+ ++T ++YA ++ LDP L + R+ +G
Sbjct: 51 RQRPHLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDERLISHRIYRESCIFSEG-- 108
Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+KDL +LG++ A VVI+D++ +V+
Sbjct: 109 -------CYTKDLT-ILGVDLAKVVIVDNTPQVF 134
>gi|401624712|gb|EJS42762.1| psr2p [Saccharomyces arboricola H-6]
Length = 391
Score = 47.4 bits (111), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 49/105 (46%), Gaps = 19/105 (18%)
Query: 914 KMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
+ F RK CLVLDLD TL++S+ H D + E D+ +
Sbjct: 216 QAFQQRK-CLVLDLDETLVHSSFKYMHTAD-----FVLPVEIDDQVH-----------NV 258
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ RPG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 259 YVIKRPGVDEFLHRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 303
>gi|157873633|ref|XP_001685322.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128394|emb|CAJ08450.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 231
Score = 47.0 bits (110), Expect = 0.061, Method: Composition-based stats.
Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 7/113 (6%)
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
P + + RPG+ FL + ++ ++M ++T G LYA + + L P ++ R +R
Sbjct: 60 PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117
Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
D DG R+ K+ L+G E A +I+DDS V+PH N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERATLILDDSAPDNVYPHQ--NALAIPRF 168
>gi|357129571|ref|XP_003566435.1| PREDICTED: uncharacterized protein C2F7.02c-like [Brachypodium
distachyon]
Length = 302
Score = 47.0 bits (110), Expect = 0.062, Method: Composition-based stats.
Identities = 48/146 (32%), Positives = 69/146 (47%), Gaps = 30/146 (20%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWT 973
K L LDLD TL++S + DPV P R+ F P +G +
Sbjct: 125 KKTLFLDLDETLIHS----QTDPV---------------PARYDFTVRPVIGGQAITFYV 165
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
RPG+ FL A++ FE+ ++T G + YA+ + LDP G + A R + RG D DG
Sbjct: 166 TKRPGVDEFLRAAAEAFEVVVFTAGLEQYASLVLDRLDPDGAVIAHR-LYRGACRD--DG 222
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDS 1059
D R+ KDL +I+DD+
Sbjct: 223 DGRL--VKDLAATGRALDCAIIVDDN 246
>gi|328874828|gb|EGG23193.1| CTD small phosphatase-like protein 2 [Dictyostelium fasciculatum]
Length = 692
Score = 47.0 bits (110), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 70/151 (46%), Gaps = 31/151 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K K+ LVLDLD TL++ + DP+ D L F ++
Sbjct: 508 KTLDTPKISLVLDLDETLVHCS----TDPIEDPDLT----------FLVTFNAIEYKVYA 553
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGD 1029
K RP FL +AS+LFE+ ++T ++YA ++ ++DP K LF +
Sbjct: 554 KKRPFFEEFLVKASELFEVIIFTASQEVYANKLLNMIDPNNHVKYRLFRDSCVY------ 607
Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
+G+ KDL +LG + S VVI+D+S
Sbjct: 608 -VEGNYL----KDL-SILGRDLSQVVIVDNS 632
>gi|159473212|ref|XP_001694733.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276545|gb|EDP02317.1| predicted protein [Chlamydomonas reinhardtii]
Length = 215
Score = 47.0 bits (110), Expect = 0.068, Method: Composition-based stats.
Identities = 50/166 (30%), Positives = 75/166 (45%), Gaps = 34/166 (20%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGM---- 971
AR+ LVLDLD TL++S+ E R F FP GM
Sbjct: 32 ARRKTLVLDLDETLVHSS--------------------LEAVDRSDFNFPVTFNGMDHTV 71
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ + RP + F+ R + LFE+ ++T ++YA + +LDP L R+ D
Sbjct: 72 YVRQRPHLHDFMARVAALFEVVVFTASQRIYAERLLDILDPGQALVRHRIYR--DSCVVV 129
Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
DG+ KDL VLG + A VI+D+S + + N I +E +
Sbjct: 130 DGNYL----KDLS-VLGRDLAHTVIVDNSPQAFGFQVDNGIPIESW 170
>gi|66803905|ref|XP_635771.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
gi|74851880|sp|Q54GB2.1|CTSL2_DICDI RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
2
gi|60464148|gb|EAL62309.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
Length = 567
Score = 47.0 bits (110), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 70/150 (46%), Gaps = 29/150 (19%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMG 970
K S+ K+ LVLDLD TL++ + +P+ E+PH F
Sbjct: 384 KEHSSPKISLVLDLDETLVHCS----TEPL-------------EQPHLTFPVFFNNTEYQ 426
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
++ K RP FL + S +FE+ ++T ++YA ++ ++DP + + D
Sbjct: 427 VFAKKRPFFEEFLHKVSDIFEVIIFTASQEVYANKLLNMIDPNNKI---KYRLYRDSCVY 483
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
DG+ KDL VLG + VVIID+S
Sbjct: 484 VDGNYL----KDL-SVLGRDLKQVVIIDNS 508
>gi|413917756|gb|AFW57688.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
Length = 186
Score = 47.0 bits (110), Expect = 0.071, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 33/58 (56%)
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F F ++ + RP + FL+R + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 33 FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90
>gi|224031885|gb|ACN35018.1| unknown [Zea mays]
Length = 190
Score = 47.0 bits (110), Expect = 0.071, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 33/58 (56%)
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
F F ++ + RP + FL+R + +FE ++T +YA ++ VLDPK LF RV
Sbjct: 33 FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90
>gi|219126682|ref|XP_002183580.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404817|gb|EEC44762.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 224
Score = 47.0 bits (110), Expect = 0.074, Method: Composition-based stats.
Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 14/108 (12%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K A + LVLDLD TL++ V+PV + L + H+
Sbjct: 37 KTAGAPPITLVLDLDETLVHCT----VEPVENADLTFPVDFHNVTYQVHV---------- 82
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+LRP ++TFL R +E+ L+T K+YA E+ +DP G F R+
Sbjct: 83 RLRPHLFTFLSRIEGQYEIVLFTASQKVYANELLNRIDPDGKYFHHRL 130
>gi|321474368|gb|EFX85333.1| hypothetical protein DAPPUDRAFT_300415 [Daphnia pulex]
Length = 867
Score = 46.6 bits (109), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 1169 QTAEQFGAVCT--KHIDDQVTHVVANSL-GTDKVNWALSTGRFVVHPGWVEASALLYRRA 1225
QT E G + KH D TH+V L DK+ +++ G+++VHPGW+ S Y
Sbjct: 662 QTIEVLGGIAIDGKHYDSTCTHLVVAKLECNDKLMTSIAAGKWIVHPGWIAKSEQTYHFV 721
Query: 1226 NEQDF 1230
+E+ F
Sbjct: 722 DERIF 726
>gi|351697455|gb|EHB00374.1| CTD small phosphatase-like protein [Heterocephalus glaber]
Length = 356
Score = 46.6 bits (109), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 78/186 (41%), Gaps = 31/186 (16%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 162 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 217
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 218 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 267
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNL 1070
A V RG+ KDL + S V+I+D+S + + N
Sbjct: 268 FRARLFRESCVFHRGN------------YVKDLSRLGRELSKVIIVDNSPASYIFHPENA 315
Query: 1071 IVVERY 1076
+ V+ +
Sbjct: 316 VPVQSW 321
>gi|407407114|gb|EKF31076.1| hypothetical protein MOQ_005093 [Trypanosoma cruzi marinkellei]
Length = 463
Score = 46.6 bits (109), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 24/173 (13%)
Query: 901 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
I K L Q + + +K L+LDLD TL++S+ + P HD +L K E +
Sbjct: 256 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLVLSMKTEPEITT- 311
Query: 960 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
++ RP + F++ + LFE+ ++T +Y + +DP+G+L +
Sbjct: 312 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 360
Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
R+ + +G KDL +LG E S V I+D+S + + N I
Sbjct: 361 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 406
>gi|349603764|gb|AEP99509.1| CTD small phosphatase-like protein 2-like protein, partial [Equus
caballus]
Length = 159
Score = 46.6 bits (109), Expect = 0.082, Method: Composition-based stats.
Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 20/113 (17%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISR 1024
++ +LRP FLER S+++E+ L+T K+YA ++ +LDPK L R V +
Sbjct: 17 VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFREHCVCVQ 76
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
G+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 77 GN------------YIKDLN-ILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 116
>gi|299470416|emb|CBN80177.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 613
Score = 46.6 bits (109), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 32/50 (64%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
+ +LRPG+ FLE+ + ++E+ ++T + YA + +LDP G +FA R
Sbjct: 446 YYVQLRPGLARFLEKVAAIYELVVWTASGRSYADAIIDLLDPAGDIFAER 495
>gi|443696004|gb|ELT96785.1| hypothetical protein CAPTEDRAFT_124156, partial [Capitella teleta]
Length = 209
Score = 46.6 bits (109), Expect = 0.088, Method: Composition-based stats.
Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 14/108 (12%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S + LVLDLD TL++ + E +D LF+ ++
Sbjct: 24 KTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAFSFPVLFQDVTYQVFV 69
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ RP FLER +K+FE+ ++T K+YA ++ +LDP+ L R+
Sbjct: 70 RTRPRFREFLERVAKIFEVTVFTASKKVYANKLLNLLDPEKKLIRHRL 117
>gi|350421968|ref|XP_003493015.1| PREDICTED: hypothetical protein LOC100746789 isoform 2 [Bombus
impatiens]
Length = 457
Score = 46.6 bits (109), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL++ + +E D FRFP
Sbjct: 272 KTRSSPEFSLVLDLDETLVHCSL--------------QELSDAA------FRFPVVFQDV 311
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 312 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 369
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 370 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 414
>gi|145508145|ref|XP_001440022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407228|emb|CAK72625.1| unnamed protein product [Paramecium tetraurelia]
Length = 506
Score = 46.6 bits (109), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 25/145 (17%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LV DLD TL++ + H ++ H+ P + +RP
Sbjct: 313 LVFDLDETLIHCNDINNNSTDHTTVI-------------HIPNEPETEIRFNIRPHCQQM 359
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDGDERVP 1038
L+ S+ +E+ L+T K YA ++ + +DPKG LF+ R ++G D RV
Sbjct: 360 LKALSQYYELILFTASYKEYADKILEYIDPKGNLFSYRFYRESCLELEEG-LLVKDLRVI 418
Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVW 1063
+ + LE + IID+S +
Sbjct: 419 EGRKLEN-------MAIIDNSAYCY 436
>gi|70945368|ref|XP_742511.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521536|emb|CAH80727.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 359
Score = 46.6 bits (109), Expect = 0.092, Method: Composition-based stats.
Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 14/117 (11%)
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD--DGD 1029
+ K RP + FLE S +E+ +YT + YA + +LDP +FA R+++R D D
Sbjct: 3 YLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRD 62
Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYFPCSR 1083
E++ + D V+ DD VW P + +++ E Y +F S+
Sbjct: 63 ENKHFEKIYPNVD-------PKYVIAFDDRKDVWYDIPDS--HILRAEHYNFFELSK 110
>gi|350421965|ref|XP_003493014.1| PREDICTED: hypothetical protein LOC100746789 isoform 1 [Bombus
impatiens]
Length = 558
Score = 46.6 bits (109), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
K S+ + LVLDLD TL VH + +E D FRFP
Sbjct: 373 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 412
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
++ + RP FLE S L+E+ L+T ++YA ++ +LDP L R+ +
Sbjct: 413 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 470
Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S VIID+S + + + N I +E +
Sbjct: 471 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 515
>gi|32564286|ref|NP_871854.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
gi|351059572|emb|CCD67162.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
Length = 312
Score = 46.6 bits (109), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 30/43 (69%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
++ +LRP + TFL R +K FE+ ++T K+YA ++ +LDP+
Sbjct: 101 VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143
>gi|240273650|gb|EER37170.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces capsulatus
H143]
Length = 592
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 22/88 (25%)
Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
V+DLD T++++ VDP E +QD++ P+ F+ P M G W
Sbjct: 276 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 326
Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMG 998
KLRPG+ FL S LFE+H+YTMG
Sbjct: 327 YYIKLRPGLEEFLRNISTLFELHIYTMG 354
>gi|66361684|ref|XP_627365.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
with a BRCT domain at the C-terminus [Cryptosporidium
parvum Iowa II]
gi|46228744|gb|EAK89614.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
with a BRCT domain at the C-terminus [Cryptosporidium
parvum Iowa II]
Length = 762
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 11/112 (9%)
Query: 972 WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ KLRPG+ L SK +E+++YTMG + +A ++LDP+ F + I ++G
Sbjct: 348 YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 405
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN-LIVVERYTYFP 1080
+ K L + + +VI+DD + W +N L+ V Y +FP
Sbjct: 406 ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWT--DINSLLKVYPYNFFP 451
>gi|440293350|gb|ELP86476.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase, putative [Entamoeba invadens IP1]
Length = 213
Score = 46.2 bits (108), Expect = 0.11, Method: Composition-based stats.
Identities = 39/160 (24%), Positives = 70/160 (43%), Gaps = 25/160 (15%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
+L ++ DLD TL+++ D H K Q++E T +RPG
Sbjct: 46 RLTVIFDLDETLIHTHSLLPEDSKHSRETCKVVVQNKEYT-------------TSIRPGA 92
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS---RGDDGDPFDGDER 1036
FL + SK E+ L+T ++YA ++ ++ G +F ++ + G + +
Sbjct: 93 IQFLRQLSKTCEVVLFTASKQVYADQIIDYMEKDGKIFEHKLYQQSCKNKFGRVYKDATK 152
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ +D++ VVI DD VW + L+V +RY
Sbjct: 153 L--GRDIKN-------VVIFDDCELVWTMTQDKLVVCKRY 183
>gi|154342859|ref|XP_001567375.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064707|emb|CAM42811.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 228
Score = 46.2 bits (108), Expect = 0.11, Method: Composition-based stats.
Identities = 37/134 (27%), Positives = 65/134 (48%), Gaps = 18/134 (13%)
Query: 957 EKP------HRHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
EKP + + F+F P + + RPG+ FL + ++ ++M ++T G LYA
Sbjct: 39 EKPTFFIDTNENFFQFTLEDDPSVTYYAFRRPGLSEFLHQCAEHYDMRIFTAGEDLYART 98
Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL-EGVLGMESAVVIIDDSV--RV 1062
+ + L P ++ R +R D DG R+ K+ + +G E +I+DDS V
Sbjct: 99 LLRWLLPDNLIDESRWYTR--DACVGDGYGRLIKNLSMIDGFQFEERTALILDDSAPDNV 156
Query: 1063 WPHNKLNLIVVERY 1076
+PH N + + R+
Sbjct: 157 YPHQ--NALAIPRF 168
>gi|17509983|ref|NP_491348.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
gi|75023288|sp|Q9N4V4.1|SCPL3_CAEEL RecName: Full=CTD small phosphatase-like protein 3; Short=CTDSP-like
3
gi|351059571|emb|CCD67161.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
Length = 287
Score = 46.2 bits (108), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 30/43 (69%)
Query: 971 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
++ +LRP + TFL R +K FE+ ++T K+YA ++ +LDP+
Sbjct: 101 VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143
>gi|91086797|ref|XP_973406.1| PREDICTED: similar to CTD (carboxy-terminal domain, RNA polymerase
II, polypeptide A) small phosphatase like 2 [Tribolium
castaneum]
gi|270009707|gb|EFA06155.1| hypothetical protein TcasGA2_TC009000 [Tribolium castaneum]
Length = 451
Score = 46.2 bits (108), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 79/168 (47%), Gaps = 36/168 (21%)
Query: 917 SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
S+ + LVLDLD TL L+ A FH PV LF+
Sbjct: 269 SSPEFSLVLDLDETLVHCSLQELSDASFHF--PV-------------------LFQDCSY 307
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
++ + RP F+E+ S++FE+ L+T ++YA ++ +LDP+ R+ +
Sbjct: 308 TVYVRTRPYFREFMEKVSQMFEVILFTASKRVYADKLLNLLDPERKWIKYRLFR--EHCV 365
Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL +LG + S +IID+S + + ++ N I +E +
Sbjct: 366 CVNGN----YIKDL-SILGRDLSKTIIIDNSPQAFGYHLNNGIPIESW 408
>gi|158293726|ref|XP_315066.4| AGAP004967-PA [Anopheles gambiae str. PEST]
gi|157016584|gb|EAA10342.4| AGAP004967-PA [Anopheles gambiae str. PEST]
Length = 226
Score = 46.2 bits (108), Expect = 0.12, Method: Composition-based stats.
Identities = 48/164 (29%), Positives = 78/164 (47%), Gaps = 22/164 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ + LVLDLD TL VH ++ E D LF+ ++
Sbjct: 41 KTRSSPEFSLVLDLDETL-----------VHCSLM---ELSDASFKFPVLFQECKYTVFV 86
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
+ RP FLER S++FE+ L+T ++YA ++ +LDP L R+ + +G
Sbjct: 87 RTRPYFREFLERVSQMFEVILFTASKRVYADKLLNLLDPDRRLIKYRLFR--EHCVLVNG 144
Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ KDL +LG + S +IID+S + + + N I +E +
Sbjct: 145 N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 183
>gi|328767798|gb|EGF77846.1| hypothetical protein BATDEDRAFT_13622 [Batrachochytrium dendrobatidis
JAM81]
Length = 192
Score = 46.2 bits (108), Expect = 0.13, Method: Composition-based stats.
Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 16/101 (15%)
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFPHMGM 971
KK S+ + LVLDLD TL++ + P+ H +I E F +
Sbjct: 23 KKTRSSPPITLVLDLDETLVHCS----TSPLDHCDITFPVE-----------FNNITYTV 67
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
+LRP TFLER S++FE+ ++T K+YA + ++DP
Sbjct: 68 SGRLRPHYKTFLERCSEIFEVVVFTASQKIYADRLLNIIDP 108
>gi|51013613|gb|AAT93100.1| YLR019W [Saccharomyces cerevisiae]
Length = 397
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|6323047|ref|NP_013119.1| Psr2p [Saccharomyces cerevisiae S288c]
gi|55583862|sp|Q07949.1|PSR2_YEAST RecName: Full=Probable phosphatase PSR2; AltName: Full=Plasma
membrane sodium response protein 2
gi|1360322|emb|CAA97541.1| unnamed protein product [Saccharomyces cerevisiae]
gi|151941187|gb|EDN59565.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
gi|207343198|gb|EDZ70734.1| YLR019Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256269170|gb|EEU04502.1| Psr2p [Saccharomyces cerevisiae JAY291]
gi|285813441|tpg|DAA09337.1| TPA: Psr2p [Saccharomyces cerevisiae S288c]
gi|323332484|gb|EGA73892.1| Psr2p [Saccharomyces cerevisiae AWRI796]
gi|323353905|gb|EGA85758.1| Psr2p [Saccharomyces cerevisiae VL3]
Length = 397
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|444322726|ref|XP_004182004.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
gi|387515050|emb|CCH62485.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
Length = 688
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 29/107 (27%)
Query: 912 QKKMFSARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
Q ++FS +K CL+LDLD TL L SA F V PV + E+ H
Sbjct: 511 QNQIFSGKK-CLILDLDETLVHSSFKYLTSADF--VIPVDID----------EQIH---- 553
Query: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
++ RPG+ FLE SK+FE+ ++T Y + VLD
Sbjct: 554 -----NVYVIKRPGVDQFLETVSKIFEVVVFTASVSRYGDPLLDVLD 595
>gi|190406060|gb|EDV09327.1| hypothetical protein SCRG_05007 [Saccharomyces cerevisiae RM11-1a]
Length = 397
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|401411253|ref|XP_003885074.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
gi|325119493|emb|CBZ55046.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
Length = 630
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 16/98 (16%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
+ LVLDLD TL++S+ F V I + E KPH+ H+ + RPG
Sbjct: 470 RTTLVLDLDETLVHSS-FRPVSVAAFVITVEVEG----KPHKIHVCK----------RPG 514
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ FLE S L+E+ ++T + YA + +LDPKG+
Sbjct: 515 VDRFLEVVSSLYEVVIFTASLQTYADPLIDLLDPKGLC 552
>gi|323336572|gb|EGA77838.1| Psr2p [Saccharomyces cerevisiae Vin13]
Length = 398
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 227 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 270
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 271 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 310
>gi|349579745|dbj|GAA24906.1| K7_Psr2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 397
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|392297996|gb|EIW09095.1| Psr2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 397
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|170587764|ref|XP_001898644.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
gi|158593914|gb|EDP32508.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
Length = 314
Score = 45.8 bits (107), Expect = 0.15, Method: Composition-based stats.
Identities = 51/186 (27%), Positives = 86/186 (46%), Gaps = 33/186 (17%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
S + LVLDLD TL++ + D PVH F+ +
Sbjct: 126 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 166
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ ++RP + FLER S+ FE+ L+T ++YA ++ +LDP L R+ + F
Sbjct: 167 YVRVRPHLQEFLERLSRSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 223
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
+ KDL +LG + S +IID+S++ + + N I +E + +F ++ L
Sbjct: 224 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFQQDDQELLKLI 278
Query: 1091 PSLLEI 1096
P L +I
Sbjct: 279 PFLEQI 284
>gi|323308065|gb|EGA61318.1| Psr2p [Saccharomyces cerevisiae FostersO]
Length = 319
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 47/105 (44%), Gaps = 28/105 (26%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
+K CL+LDLD TL++S+ H D + D++ +
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPVEIDDQV---------------------HNV 264
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ RPG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 265 YVIKRPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|300121382|emb|CBK21762.2| unnamed protein product [Blastocystis hominis]
Length = 399
Score = 45.8 bits (107), Expect = 0.15, Method: Composition-based stats.
Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 31/229 (13%)
Query: 804 LQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNS-PIQPGQIKSGADMKA--VVTNHD 860
L+ IT + ++ ++ S S + PMV+ + P P I G D ++ +V++H
Sbjct: 98 LRNSITGKVPESSIRNSNIDSASTIIRQYPMVTLSCFPSSPKVIIRGKDKQSSDLVSSHH 157
Query: 861 DKQTGTGS------GPEAGPVGAHPQSAWGDVEHL--FEGYDDQQKAAIQKERTRRLEEQ 912
++ S PE A P+S + V + YD AA+Q+ R+
Sbjct: 158 KEEIEPASPNLITSAPETQQ-EAEPESTFNPVIIIKNLPPYD-SLPAALQE----RVLLP 211
Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
K +A K LVLDLD TL++ + E DP D + E R ++
Sbjct: 212 PKSPTAPKYTLVLDLDETLVHCSM--ERDPSADLAFSIRHEGQR------------FTIY 257
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+RP ++ L+R + +E+ +YT K YA + +LD + L R+
Sbjct: 258 ANVRPFLFYLLKRVAPYYEIVIYTASQKCYADRLLDILDSEQHLITHRL 306
>gi|259148008|emb|CAY81257.1| Psr2p [Saccharomyces cerevisiae EC1118]
Length = 397
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)
Query: 919 RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+K CL+LDLD TL++S+ H D + E D+ ++ R
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
PG+ FL R S+L+E+ ++T YA + LDP G +
Sbjct: 270 PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309
>gi|365985822|ref|XP_003669743.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
gi|343768512|emb|CCD24500.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
Length = 514
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 28/111 (25%)
Query: 919 RKLCLVLDLDHTLLNS-------AKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
K CLVLDLD TL++S A F+ + D+I +
Sbjct: 342 HKKCLVLDLDETLVHSSFKYLPNADFNLPVNIDDQI---------------------HNV 380
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
+ RPG+ FLE+ KLFE+ ++T Y + LDPKG R+
Sbjct: 381 YVIKRPGVDEFLEKVGKLFEVVIFTASVSRYGDPLLDRLDPKGKSIHHRLF 431
>gi|308811648|ref|XP_003083132.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
(involved in RNA polymerase II regulation) (ISS)
[Ostreococcus tauri]
gi|116055010|emb|CAL57087.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
(involved in RNA polymerase II regulation) (ISS)
[Ostreococcus tauri]
Length = 485
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 72/156 (46%), Gaps = 21/156 (13%)
Query: 922 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
LVLDLD TL VH + + D P +F + + RP + T
Sbjct: 288 TLVLDLDETL-----------VHSNLENTGGKSDFSFPV--VFNGEIHQVNVRTRPHLQT 334
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
F+E SK +E+ ++T ++YA ++ +LDPK A RV D +G+ K
Sbjct: 335 FMETVSKKYEIVVFTASQQIYADKLLDLLDPKREWIAHRVFR--DSCVQIEGN----YMK 388
Query: 1042 DLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
DL VLG + S +IID+S + + N I +E +
Sbjct: 389 DLR-VLGRDLSKTIIIDNSPQAFGLQVENGIPIESW 423
>gi|221487382|gb|EEE25614.1| protein phosphatase, putative [Toxoplasma gondii GT1]
Length = 621
Score = 45.4 bits (106), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 16/98 (16%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
+ LVLDLD TL++S+ F V PV + + E KPH H+ + RPG
Sbjct: 443 RTTLVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPG 487
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ FLE S+L+E+ ++T + YA + +LDPKG+
Sbjct: 488 VDRFLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525
>gi|393909936|gb|EFO24836.2| SCP small domain phosphatase [Loa loa]
Length = 321
Score = 45.4 bits (106), Expect = 0.18, Method: Composition-based stats.
Identities = 51/186 (27%), Positives = 86/186 (46%), Gaps = 33/186 (17%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
S + LVLDLD TL++ + D PVH F+ +
Sbjct: 133 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ ++RP + FLER S+ FE+ L+T ++YA ++ +LDP L R+ + F
Sbjct: 174 YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
+ KDL +LG + S +IID+S++ + + N I +E + +F ++ L
Sbjct: 231 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFQQDDQELLKLI 285
Query: 1091 PSLLEI 1096
P L +I
Sbjct: 286 PFLEQI 291
>gi|195383304|ref|XP_002050366.1| GJ22116 [Drosophila virilis]
gi|194145163|gb|EDW61559.1| GJ22116 [Drosophila virilis]
Length = 703
Score = 45.4 bits (106), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 62/121 (51%), Gaps = 4/121 (3%)
Query: 1113 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 1170
++ +HK F++ ++ + D++ I+ + ++L G +VFS + P + +
Sbjct: 382 ILRNIHKRFYAIYDETTEIPDLKIIVPKIRCEVLRGQNLVFSGLVPT-QMKLEQSRAYFI 440
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
A+ GA +I+ +TH+VA + GT KVN A + VV+ W+ A A + +E+
Sbjct: 441 AKSLGAEVQSNINKDITHLVAVNAGTYKVNAAKKESKIKVVNANWLWACAERWEHVDERL 500
Query: 1230 F 1230
F
Sbjct: 501 F 501
>gi|341894763|gb|EGT50698.1| hypothetical protein CAEBREN_25349 [Caenorhabditis brenneri]
Length = 250
Score = 45.4 bits (106), Expect = 0.20, Method: Composition-based stats.
Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 14/108 (12%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K ++ + LVLDLD TL++ + + P+ + + +F+ ++
Sbjct: 22 KTRASAEYTLVLDLDETLVHCS----LTPLDNATM----------IFPVVFQNITYQVYV 67
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+LRP + TFL R +K FE+ ++T K+YA ++ +LDP+ L R+
Sbjct: 68 RLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRL 115
>gi|401426731|ref|XP_003877849.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494096|emb|CBZ29393.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 231
Score = 45.4 bits (106), Expect = 0.20, Method: Composition-based stats.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
P + + RPG+ FL + ++ ++M ++T G LYA + + L P ++ R +R
Sbjct: 60 PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPANLIDESRWYTR-- 117
Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
D DG R+ K+ L+G E +I+DDS V+PH N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168
>gi|146096062|ref|XP_001467692.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020532|ref|XP_003863429.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072058|emb|CAM70757.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501662|emb|CBZ36743.1| hypothetical protein, conserved [Leishmania donovani]
Length = 231
Score = 45.4 bits (106), Expect = 0.20, Method: Composition-based stats.
Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
P + + RPG+ FL + ++ ++M ++T G LYA + + L P ++ R +R
Sbjct: 60 PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117
Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
D DG R+ K+ L+G E +I+DDS V+PH N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168
>gi|237830029|ref|XP_002364312.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211961976|gb|EEA97171.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|221507180|gb|EEE32784.1| dullard protein, putative [Toxoplasma gondii VEG]
Length = 621
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 16/98 (16%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
+ LVLDLD TL++S+ F V PV + + E KPH H+ + RPG
Sbjct: 443 RTTLVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPG 487
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ FLE S+L+E+ ++T + YA + +LDPKG+
Sbjct: 488 VDRFLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525
>gi|254578106|ref|XP_002495039.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
gi|238937929|emb|CAR26106.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
Length = 336
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 76/174 (43%), Gaps = 20/174 (11%)
Query: 907 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
++L Q +F+ RK LV+DLD TL++SA R + + H RF
Sbjct: 140 KKLVPQSILFAERKKRLVVDLDETLIHSAT------------RSVSHSNSAQGHMVEVRF 187
Query: 967 PHMGM----WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
P + + RP FL + SK +++ ++T K YA + L+ F G+
Sbjct: 188 PPSSISTLYYVHKRPHCDLFLSKVSKWYDLIIFTASMKEYADPVIDWLESS---FTGKFC 244
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
R + E V KDL V + VV+ID+S + N+ N I VE +
Sbjct: 245 KRLYRHNCV-VREGVGYIKDLSVVTEVLDEVVLIDNSPTSYARNEDNAIQVEGW 297
>gi|403368592|gb|EJY84135.1| Putative tfiif-interacting component of the c-terminal domain
phosphatase [Oxytricha trifallax]
Length = 525
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 108/286 (37%), Gaps = 95/286 (33%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHE-------VDPVHDEILRKKEEQDREKPHRHLFRF 966
++ RKL LVLDLD+TLL++ E DP ++ P + ++
Sbjct: 3 QLLQDRKLVLVLDLDNTLLHTKSIEEREFQTKSRDPTFINLI---------DPLKSIYEI 53
Query: 967 PHM--GMWTKLRPGIWTFLERA--SKLFEMHLYTMGNK----------------LYATEM 1006
G TKLRP ++ FL++ + FE++ YT G K L+ E
Sbjct: 54 KLFRGGFHTKLRPFLFEFLKKVFDERKFEIYFYTAGTKDYGMLIIDIFKMEITRLFGKEY 113
Query: 1007 AKVLDPKGVLFAGRVISRGDD---------------------------------GDPFDG 1033
AK + + L ++ISR D GD
Sbjct: 114 AKQISEE--LSHRKLISRCDKERFANKNSSNEIDIDSMQQQLYQQIENQQMGQGGDATLN 171
Query: 1034 DERVPKS-KDLEGVLGMESAVVIIDDSVRVWPH-------NKL-----NLIVVERYTYFP 1080
+ KS L G G ES +IIDD VW NKL NLI++ Y Y+
Sbjct: 172 MKHFIKSLSSLAG--GDESIFIIIDDRSDVWTEEVKDQNGNKLRRVSDNLILIPEYFYWE 229
Query: 1081 CSRRQF----GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 1122
S+ + ++ S ++ D D +L ++ ++H F+
Sbjct: 230 TSQNRLKSYHKIIQQSGVKYDF-----DLSLVYHYKLLTKIHNKFY 270
>gi|399218895|emb|CCF75782.1| unnamed protein product [Babesia microti strain RI]
Length = 460
Score = 45.1 bits (105), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVH-DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
KL +VLD+D TL++ H VH D + E E P + H M+ LRPG
Sbjct: 199 KLLVVLDMDETLIH---MHTNPHVHYDYTINLLEHSKSEDP-MFISCNIHPTMYISLRPG 254
Query: 979 IWTFLERAS---KLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
+ FL S +E+ L+T G +LYA + + LDP +
Sbjct: 255 VKEFLRYLSVNSDFYEVALFTAGTQLYADAVLEGLDPNCTII 296
>gi|159483225|ref|XP_001699661.1| hypothetical protein CHLREDRAFT_111940 [Chlamydomonas reinhardtii]
gi|158281603|gb|EDP07357.1| predicted protein [Chlamydomonas reinhardtii]
Length = 202
Score = 45.1 bits (105), Expect = 0.26, Method: Composition-based stats.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 28/124 (22%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH-MG 970
Q++ A KLC++LDLD TL++S R P + H +G
Sbjct: 2 QQRPEHAGKLCVLLDLDGTLVSSY------------------TPRRAPRLPSYVRTHVVG 43
Query: 971 MWTKL---------RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
M +KL RPG+ FLE + E+ ++T G + YA + +DP LFA R+
Sbjct: 44 MGSKLNPAGVFVVERPGLTEFLEELATFAEVIIFTAGLEDYAKPIIDAIDPSNRLFAHRI 103
Query: 1022 ISRG 1025
G
Sbjct: 104 YREG 107
>gi|321470826|gb|EFX81801.1| hypothetical protein DAPPUDRAFT_49973 [Daphnia pulex]
Length = 237
Score = 45.1 bits (105), Expect = 0.27, Method: Composition-based stats.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 14/100 (14%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S+ LVLDLD TL++ + +E +D F+ ++
Sbjct: 52 KTRSSPTFSLVLDLDETLVHCSL--------------EELEDAAFSFPVFFQDTTYQVFV 97
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ RP FLER S++FE+ L+T K+YA ++ +LDP+
Sbjct: 98 RTRPHFREFLERVSQIFEVILFTASKKVYADKLLNLLDPQ 137
>gi|126644240|ref|XP_001388239.1| hypothetical protein [Cryptosporidium parvum Iowa II]
gi|126117312|gb|EAZ51412.1| hypothetical protein cgd2_3810 [Cryptosporidium parvum Iowa II]
Length = 475
Score = 45.1 bits (105), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 67/151 (44%), Gaps = 18/151 (11%)
Query: 863 QTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKL 921
+ G G G +S GD E + YD+ ++ ++ + LE Q++ + RK
Sbjct: 285 EAGIYDKDNLGRNGFSLRSITGDRESII--YDEDYESYLESTIKEPFLEPQRREYIGRK- 341
Query: 922 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
LVLDLD TL++S+ P+ + E D + + ++ RPG+
Sbjct: 342 TLVLDLDETLIHSS----FQPIRNASFTINIEIDGD----------YYDVYVLKRPGVDK 387
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
FL S +FE+ ++T YA + LDP
Sbjct: 388 FLNIVSAIFEVVIFTASLSKYANPLLDRLDP 418
>gi|312072812|ref|XP_003139236.1| SCP small domain phosphatase [Loa loa]
Length = 321
Score = 45.1 bits (105), Expect = 0.29, Method: Composition-based stats.
Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
S + LVLDLD TL++ + D PVH F+ +
Sbjct: 133 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ ++RP + FLER S+ FE+ L+T ++YA ++ +LDP L R+ + F
Sbjct: 174 YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
+ KDL +LG + S +IID+S++ + + N I +E + +
Sbjct: 231 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 274
>gi|340508012|gb|EGR33824.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 222
Score = 44.7 bits (104), Expect = 0.30, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 67/126 (53%), Gaps = 17/126 (13%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L+LDLD TL++S +E +P D ++ +EE + +K + FR +RP F
Sbjct: 61 LLLDLDETLIHSCGLNE-NP--DAVIMAQEEYNSQKQFQIAFR---------IRPYCIEF 108
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
L++ SK ++++++T + YA + LD + + +V++R + + +G D R+
Sbjct: 109 LQQVSKYWDIYVFTASSASYANAIVNYLDSQQE-YIHQVLTRQNCMETKNGFFIKDLRII 167
Query: 1039 KSKDLE 1044
K DL+
Sbjct: 168 KDIDLQ 173
>gi|157870945|ref|XP_001684022.1| nuclear lim interactor-interacting factor-like protein [Leishmania
major strain Friedlin]
gi|68127090|emb|CAJ04515.1| nuclear lim interactor-interacting factor-like protein [Leishmania
major strain Friedlin]
Length = 290
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
S K+ LVLD+D TL++S D V+D++L E + K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK------------TYTVSVKYR 156
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
P + FL S+ FE+ ++T + Y ++ +DP G+L R+ + F
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFSERSY 213
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
V KDL + VVI+D+S + + N I ++ +
Sbjct: 214 V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250
>gi|344253634|gb|EGW09738.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Cricetulus griseus]
Length = 354
Score = 44.7 bits (104), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 159 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 214
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 215 GVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 264
Query: 1016 LFA 1018
A
Sbjct: 265 FRA 267
>gi|123404051|ref|XP_001302356.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121883637|gb|EAX89426.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 205
Score = 44.7 bits (104), Expect = 0.32, Method: Composition-based stats.
Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 18/102 (17%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
++ + + + LVLDLD TL++++ F P H ++ +F
Sbjct: 21 RRTIVTDSRKALVLDLDETLIHTSTF----PPHSDV--------------ESLKFDDSPD 62
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ LRP + FL++ S+LFE+ ++T G + YA + +L P+
Sbjct: 63 YVFLRPNVRIFLDKVSELFEVFIFTAGTQNYAERILDLLCPQ 104
>gi|339237973|ref|XP_003380541.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
gi|316976534|gb|EFV59811.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
Length = 281
Score = 44.7 bits (104), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 83/170 (48%), Gaps = 29/170 (17%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E+ KK+++A VLDLD TL++S + DP + +I+ + R
Sbjct: 97 EKSKKLYTA-----VLDLDQTLVHSRSKRKGDPRY-KIVNIPQATRR------------- 137
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA-KVLDPKGVLFAGRVISRGDDG 1028
+T +RP FLE S+ +E+ L+T G YA + +++DP+ F+ D
Sbjct: 138 -FYTAVRPCCAEFLESISEFYEVILFTAGTPRYAAAVIDQLVDPEHKYFSN--FYYRPDC 194
Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
P D + KDL +LG + S VI+DD++ + + N I+VE +T
Sbjct: 195 APVDHE----FVKDL-SILGRDLSKTVIMDDNMMSFCCHIDNGILVEPWT 239
>gi|327260340|ref|XP_003214992.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Anolis carolinensis]
Length = 345
Score = 44.7 bits (104), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ ++ K R L + K A K+C+V+DLD TL++S+ PV++ E D
Sbjct: 150 EENGSVTKATVRYLLPEIKPQDANKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 205
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL R +LFE L+T YA +A +LD G
Sbjct: 206 GVMHQVYVLKRPHVD----------EFLRRMGELFECVLFTASLAKYADPVADLLDKWGA 255
Query: 1016 L 1016
Sbjct: 256 F 256
>gi|194866038|ref|XP_001971726.1| GG14268 [Drosophila erecta]
gi|190653509|gb|EDV50752.1| GG14268 [Drosophila erecta]
Length = 260
Score = 44.7 bits (104), Expect = 0.34, Method: Composition-based stats.
Identities = 34/120 (28%), Positives = 60/120 (50%), Gaps = 19/120 (15%)
Query: 903 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--- 959
++R + +++ RK LVLD+D T++ S L+K ++ R KP
Sbjct: 54 EDRLSPVSKRRLSLVGRK-TLVLDMDETMITSW------------LKKSGKKPRNKPRVA 100
Query: 960 HRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVL 1016
H F P ++ RP + FL+R SK +++ ++T G +LYA+ + LD +G+L
Sbjct: 101 HDFKFYLPAYEATIYVYKRPYLDHFLDRVSKWYDLTVFTAGAELYASPILDFLDRGRGIL 160
>gi|281210104|gb|EFA84272.1| CTD small phosphatase-like protein 2 [Polysphondylium pallidum PN500]
Length = 539
Score = 44.7 bits (104), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 71/148 (47%), Gaps = 37/148 (25%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHM--GMWTKLR 976
K+ LVLDLD TL++ + +P+ ++P F F ++ ++ K R
Sbjct: 362 KISLVLDLDETLVHCS----TEPI-------------DEPDLTFFVTFNNVEYKVFAKKR 404
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGDPFD 1032
P FL +AS LFE+ ++T ++YA ++ ++DP K L+ + D
Sbjct: 405 PFFEDFLSKASSLFELIIFTASQEVYANKLLNMIDPNKHIKYRLYRDSCVC-------VD 457
Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDS 1059
G KDL +LG + S VVI+D+S
Sbjct: 458 G----TYLKDL-SILGRDLSQVVIVDNS 480
>gi|302806320|ref|XP_002984910.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
gi|300147496|gb|EFJ14160.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
Length = 198
Score = 44.3 bits (103), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T KLYA + LDP G LF R+
Sbjct: 65 RPGVDTFLNEMSQIYEIVVFTRAVKLYADRILDRLDPAGNLFTHRL 110
>gi|403266874|ref|XP_003925585.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Saimiri boliviensis
boliviensis]
Length = 262
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 67 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 123 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 172
Query: 1016 LFA 1018
A
Sbjct: 173 FRA 175
>gi|390464816|ref|XP_003733289.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Callithrix jacchus]
Length = 260
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|340504501|gb|EGR30938.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 230
Score = 44.3 bits (103), Expect = 0.43, Method: Composition-based stats.
Identities = 40/157 (25%), Positives = 76/157 (48%), Gaps = 25/157 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L LDLD TL++S +E +P D IL+ E + P + ++RP F
Sbjct: 49 LFLDLDETLIHSCSLNE-NP--DVILKVGEINE-----------PQFHIGFRIRPYCMDF 94
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
L+ + ++++++T + Y+ + LDP+ G +++R + + +G D R+
Sbjct: 95 LKALVEYWDIYIFTASSSTYSNAIINYLDPERKYING-ILNRSNCMETKNGFFIKDLRIA 153
Query: 1039 KSKDLEGVLGME----SAVVIIDDSVRV--WPHNKLN 1069
K KDL ++ ++ S ID+ + + W HNK +
Sbjct: 154 KGKDLRKIILVDNLSHSFGFQIDNGIPILEWHHNKYD 190
>gi|159485684|ref|XP_001700874.1| hypothetical protein CHLREDRAFT_142839 [Chlamydomonas reinhardtii]
gi|158281373|gb|EDP07128.1| predicted protein, partial [Chlamydomonas reinhardtii]
Length = 418
Score = 44.3 bits (103), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 7/103 (6%)
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
LRPG+ FLE +FE+ L+T + +AT + +DP G +F R+ D
Sbjct: 34 LRPGLRRFLESVRPMFEVVLFTAAGESWATCAMQRIDPDGRIFDTRLYR-----DHTVSH 88
Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+ P KDL LG + A VVI+DD+ ++ + N + V Y
Sbjct: 89 DDWPWVKDLSR-LGRDLARVVIVDDNPLMFMYQPDNALHVAPY 130
>gi|384247176|gb|EIE20663.1| hypothetical protein COCSUDRAFT_30404 [Coccomyxa subellipsoidea
C-169]
Length = 243
Score = 44.3 bits (103), Expect = 0.46, Method: Composition-based stats.
Identities = 51/173 (29%), Positives = 81/173 (46%), Gaps = 22/173 (12%)
Query: 905 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
R RR ++ ++ LVLDLD TL++S L +E D P F
Sbjct: 43 RWRRSLLPRQTRQCKRKTLVLDLDETLVHST------------LDGCDEPDFSFPVA--F 88
Query: 965 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
+ + RP + FL+R ++LFE+ ++T K+YA ++ +LDP L RV
Sbjct: 89 NGREHRVHVRRRPHLQHFLQRCAELFEVVVFTASQKVYAEQLLNILDPTRTLIRHRVFR- 147
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
D +G+ KDL VLG + A VI+D+S + + + N I +E +
Sbjct: 148 -DSCVFVEGNYL----KDLS-VLGRDLAHTVIVDNSPQAFGYQLPNGIPIESW 194
>gi|209882797|ref|XP_002142834.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209558440|gb|EEA08485.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 536
Score = 44.3 bits (103), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 15/107 (14%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
LE QK + RK LVLDLD TL++S+ P+ E + E + ++ +
Sbjct: 332 LEPQKPEYFGRK-TLVLDLDETLVHSS----FQPIRAASFVISVEIEYEMYNVYVLK--- 383
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
RPG+ FLE S L+E+ ++T YA + LDP+G+
Sbjct: 384 -------RPGVDKFLEVVSSLYEVVIFTASLSKYANPLLDKLDPRGL 423
>gi|335298853|ref|XP_003132160.2| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
Length = 265
Score = 44.3 bits (103), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 71 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213
>gi|449542561|gb|EMD33540.1| hypothetical protein CERSUDRAFT_160532 [Ceriporiopsis subvermispora
B]
Length = 965
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 44/99 (44%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
D ++L Q IL C F + L++ A + GA + D +TH+V
Sbjct: 43 DDEDVLGETQASILESCPRPFRGITLCTTGISDKTTLFKQAIELGAQYDSDLRDNITHLV 102
Query: 1191 ANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 1229
A G+ K AL+ ++HP WV S ++ R ++ D
Sbjct: 103 AEVPGSAKYKCALANKIPIMHPSWVTDSYEIWLRGDDVD 141
>gi|290981094|ref|XP_002673266.1| BRCT domain-containing protein [Naegleria gruberi]
gi|284086848|gb|EFC40522.1| BRCT domain-containing protein [Naegleria gruberi]
Length = 706
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 1142 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVN 1200
KIL G I S G NP L +TA + GA T + TH++A TDK
Sbjct: 239 KILPGVTIAIS-----GIVNPERTTLRETAMKMGAKYTPEFQAGKTTHMIAAFSNTDKTK 293
Query: 1201 WALSTGRFVVHPGWVEASALLYRRANEQDFAI 1232
A++ G FVV WV + L +R +E+ +++
Sbjct: 294 EAINAGIFVVSKQWVYDCSTLSKRMDERRYSM 325
>gi|145538816|ref|XP_001455108.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422896|emb|CAK87711.1| unnamed protein product [Paramecium tetraurelia]
Length = 282
Score = 44.3 bits (103), Expect = 0.49, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 23/132 (17%)
Query: 889 LFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
LF+ + K QK + K +S +K+ LVLDLD TL VH E
Sbjct: 6 LFKNCYEHFKGKFQKNYIS--AQTPKQYSQKKV-LVLDLDETL-----------VHCEF- 50
Query: 949 RKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
KE ++ + H L H G ++ K RP + FL+ ASK +E+ ++T G + Y
Sbjct: 51 --KENENFQ--HEVLLEVIHKGQLYTVYLKARPYLNQFLQEASKDYEIFIFTAGYEAYCQ 106
Query: 1005 EMAKVLDPKGVL 1016
E+ +D K ++
Sbjct: 107 EVLSFIDKKKII 118
>gi|326430493|gb|EGD76063.1| hypothetical protein PTSG_00772 [Salpingoeca sp. ATCC 50818]
Length = 840
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 7/70 (10%)
Query: 1171 AEQFGAVCTKHIDDQVTHVVA----NSL---GTDKVNWALSTGRFVVHPGWVEASALLYR 1223
A+ GA D+ VTHV+A N L T KV +++G+++V P W+ A R
Sbjct: 635 ADAMGATVLSKFDESVTHVLAALDENHLLISNTLKVMMGIASGKWIVSPAWLYACDTENR 694
Query: 1224 RANEQDFAIK 1233
RANE+ F ++
Sbjct: 695 RANEEAFEMQ 704
>gi|340501300|gb|EGR28100.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 306
Score = 44.3 bits (103), Expect = 0.49, Method: Composition-based stats.
Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 12/90 (13%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
L LDLD TL++S + +E + ++ K +D +L +F ++RP F
Sbjct: 111 LFLDLDETLIHSCRINE-----NYNVQIKAFEDNNSQQEYLIQF-------RIRPYCMEF 158
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
L++ SK ++++L+T + YA + LDP
Sbjct: 159 LQKISKYWDIYLFTASSTTYANAIVNYLDP 188
>gi|359323950|ref|XP_003640241.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Canis lupus familiaris]
Length = 260
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K + L + K A K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAVPKTPVQYLLPEAKAQDADKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|402889397|ref|XP_003908003.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Papio anubis]
Length = 260
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|395816723|ref|XP_003781843.1| PREDICTED: CTD small phosphatase-like protein isoform 1 [Otolemur
garnettii]
Length = 265
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 71 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213
>gi|363736290|ref|XP_003641697.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Gallus gallus]
Length = 275
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ + K + L + K A KLC+V+DLD TL++S+ PV++ E D
Sbjct: 80 EENGTVPKAAVKHLLPEIKPQDASKLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 135
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 136 GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 185
Query: 1016 LFAGRVI 1022
F R+
Sbjct: 186 -FRARLF 191
>gi|332308973|ref|NP_001193807.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 3 [Homo sapiens]
gi|397495664|ref|XP_003818667.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Pan paniscus]
gi|410036206|ref|XP_003950023.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Pan troglodytes]
gi|426338591|ref|XP_004033259.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Gorilla gorilla gorilla]
Length = 260
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|388579583|gb|EIM19905.1| hypothetical protein WALSEDRAFT_58475 [Wallemia sebi CBS 633.66]
Length = 798
Score = 43.9 bits (102), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 32/58 (55%)
Query: 1165 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 1222
H L++ A + GA + D VTH++A G+ K N A+ G V+ P ++E S L +
Sbjct: 66 HELYKKAMEMGATYESNFTDNVTHLIAIEFGSPKYNCAIQLGATVLLPAFIEQSYLEW 123
>gi|145544070|ref|XP_001457720.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425538|emb|CAK90323.1| unnamed protein product [Paramecium tetraurelia]
Length = 659
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 92/249 (36%), Gaps = 44/249 (17%)
Query: 860 DDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919
D QT E + +VE L Y D I KE L+ KK R
Sbjct: 193 DSSQTCNHLKIENNYCLICNEKVIRNVESLDLNYSDDISKKISKEIV--LDILKK----R 246
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--------- 970
KL +VLDLD T+L++ K + E K+ + + F +G
Sbjct: 247 KLIMVLDLDQTILHAIKVSTTFNKY-EFCEKQNKMIQADSEAQFNGFQQLGFNIKEHLLD 305
Query: 971 --------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA---- 1018
KLRP F LF++ +YT +K YA + + + F
Sbjct: 306 MTCDQQSKFIIKLRPYFEQFFLTLIPLFDIFIYTKASKSYADFILSFITHRLNEFIPEHK 365
Query: 1019 -----GRVISRGDDGDPFDGDERVPKSKDLEGVL--GMES-AVVIIDDSVRVWPHNKLNL 1070
RV+SR D SK L + G+ + +VI+DD+ +W K NL
Sbjct: 366 PFFPPQRVLSREDTI--------CSNSKSLNRLFYPGIATNLLVILDDNAGMWNQFKENL 417
Query: 1071 IVVERYTYF 1079
I + + YF
Sbjct: 418 IHTKPFVYF 426
>gi|297597243|ref|NP_001043640.2| Os01g0629400 [Oryza sativa Japonica Group]
gi|255673485|dbj|BAF05554.2| Os01g0629400, partial [Oryza sativa Japonica Group]
Length = 177
Score = 43.9 bits (102), Expect = 0.58, Method: Composition-based stats.
Identities = 28/105 (26%), Positives = 57/105 (54%), Gaps = 14/105 (13%)
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
F + ++ + RP + FLE+ +++F++ ++T ++YA ++ LDP G L + R+
Sbjct: 30 FFNMKNHTVYVRQRPHLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIY 89
Query: 1023 SRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
+G +KDL +LG++ A VVI+D++ +V+
Sbjct: 90 RESCIFSEG---------CYTKDLT-ILGVDLAKVVIVDNTPQVF 124
>gi|146089360|ref|XP_001470364.1| nuclear lim interactor-interacting factor-like protein [Leishmania
infantum JPCM5]
gi|134070397|emb|CAM68734.1| nuclear lim interactor-interacting factor-like protein [Leishmania
infantum JPCM5]
Length = 290
Score = 43.9 bits (102), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
S K+ LVLD+D TL++S D V+D++L E + K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
P + FL S+ FE+ ++T + Y ++ +DP G+L R+ + F
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFCERSY 213
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
V KDL + VVI+D+S + + N I ++ +
Sbjct: 214 V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250
>gi|384250655|gb|EIE24134.1| hypothetical protein COCSUDRAFT_41430 [Coccomyxa subellipsoidea
C-169]
Length = 1029
Score = 43.9 bits (102), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 85/197 (43%), Gaps = 45/197 (22%)
Query: 920 KLCLVLDLDHTLL-----NSAKFH------EVDP-----VHDEILRKKEEQDREKPHRHL 963
+L LVLDLD TLL N + H E+D V ++ K+E+ +E+ + L
Sbjct: 120 RLPLVLDLDETLLEAFTANQLRKHIKDLSAEIDGGNWSNVEKKLQLKREKAFKEEDYNLL 179
Query: 964 FRFPHMGMWTKLRPGI---WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-- 1018
+F T L I W ER FE+++ T ++ YA E + LDP +L
Sbjct: 180 VQFIQTNSVT-LNGQIHKAWPGRER----FEVYVCTTADRSYALEAWRHLDPSALLIPYA 234
Query: 1019 ---GRVISRGDDGDPFDGDERVPKSKDLEGVLGM------------ESAV---VIIDDSV 1060
R + D D D D V KDL V+G+ SA+ VIIDD
Sbjct: 235 DRRKRFHNVHQDKDSKDKDGNVKPVKDLAHVMGLLGHPWSAPCTPPNSAMPLAVIIDDQP 294
Query: 1061 RVW-PHNKLNLIVVERY 1076
VW ++ L VE++
Sbjct: 295 AVWTAESQGQLYQVEKF 311
>gi|296491169|tpg|DAA33242.1| TPA: epithelial cell transforming sequence 2 oncogene [Bos taurus]
Length = 914
Score = 43.9 bits (102), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 1164 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 1223
L L G V K + +VTH+VAN +K A+S G ++ P W+ + +
Sbjct: 192 LARLVTLVHHMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WE 248
Query: 1224 RANEQDFA 1231
R NEQDF
Sbjct: 249 RRNEQDFC 256
>gi|55740291|gb|AAV63946.1| nuclear LIM factor interactor-interacting protein [Phytophthora
sojae]
gi|348684596|gb|EGZ24411.1| hypothetical protein PHYSODRAFT_325530 [Phytophthora sojae]
Length = 336
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+CLVLDLD TL++S+ P + E D H ++ RPG
Sbjct: 165 KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
FL +K +E+ +YT YA + LDP+GV+
Sbjct: 211 EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247
>gi|410224860|gb|JAA09649.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Pan troglodytes]
Length = 260
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|148236343|ref|NP_001091042.1| protein ECT2 [Bos taurus]
gi|146186917|gb|AAI40474.1| ECT2 protein [Bos taurus]
Length = 914
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 1164 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 1223
L L G V K + +VTH+VAN +K A+S G ++ P W+ + +
Sbjct: 192 LARLVTLVHHMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WE 248
Query: 1224 RANEQDFA 1231
R NEQDF
Sbjct: 249 RRNEQDFC 256
>gi|119591022|gb|EAW70616.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1, isoform CRA_b [Homo sapiens]
gi|119591023|gb|EAW70617.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1, isoform CRA_b [Homo sapiens]
Length = 255
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 60 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 115
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 116 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 165
Query: 1016 LFA 1018
A
Sbjct: 166 FRA 168
>gi|55740285|gb|AAV63944.1| nuclear LIM factor interactor-interacting protein [Phytophthora
sojae]
gi|348684603|gb|EGZ24418.1| hypothetical protein PHYSODRAFT_296510 [Phytophthora sojae]
Length = 336
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+CLVLDLD TL++S+ P + E D H ++ RPG
Sbjct: 165 KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
FL +K +E+ +YT YA + LDP+GV+
Sbjct: 211 EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247
>gi|392338692|ref|XP_003753607.1| PREDICTED: protein ECT2 isoform 1 [Rattus norvegicus]
Length = 882
Score = 43.5 bits (101), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ PGW+ + + R NEQ F
Sbjct: 170 HMGGVIRKECNSKVTHLVANCTQGEKFRVAVSLGTPIMRPGWIYEA---WERRNEQCFC 225
>gi|32813443|ref|NP_872580.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Homo sapiens]
gi|31074175|gb|AAP34397.1| small CTD phosphatase 1 [Homo sapiens]
gi|410351181|gb|JAA42194.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Pan troglodytes]
Length = 260
Score = 43.5 bits (101), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|398016831|ref|XP_003861603.1| nuclear lim interactor-interacting factor-like protein [Leishmania
donovani]
gi|322499830|emb|CBZ34903.1| nuclear lim interactor-interacting factor-like protein [Leishmania
donovani]
Length = 290
Score = 43.5 bits (101), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
S K+ LVLD+D TL++S D V+D++L E + K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
P + FL S+ FE+ ++T + Y ++ +DP G+L R+ + F
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFCERSY 213
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
V KDL + VVI+D+S + + N I ++ +
Sbjct: 214 V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250
>gi|395823467|ref|XP_003785008.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Otolemur garnettii]
Length = 260
Score = 43.5 bits (101), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|145483633|ref|XP_001427839.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394922|emb|CAK60441.1| unnamed protein product [Paramecium tetraurelia]
Length = 308
Score = 43.5 bits (101), Expect = 0.71, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 15/96 (15%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
ARKLC VLDLD TL++S + D +D +L + LF+ ++ +RP
Sbjct: 54 ARKLC-VLDLDETLVHSQ--FKGDNGYDFLLDIIVQS-------QLFK-----VFVTVRP 98
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
G+ TFLE+ S+ F++ L+T K YA + ++DP+
Sbjct: 99 GVETFLEQLSEHFDIVLWTASLKEYADPVIDIIDPQ 134
>gi|422294377|gb|EKU21677.1| nli interacting factor family protein [Nannochloropsis gaditana
CCMP526]
Length = 378
Score = 43.5 bits (101), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 60/150 (40%), Gaps = 47/150 (31%)
Query: 909 LEEQKKMFSA----RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL- 963
++ Q + FS KL +VLD+D LL+S +F E ++E+ RE H+ L
Sbjct: 109 MQGQGRRFSVYRKREKLTVVLDMDECLLHS-RFEE---------DMRDERGRELAHQLLP 158
Query: 964 ------FRFPHMG----------------MW----------TKLRPGIWTFLERASKLFE 991
F + H W LRPG+ FL+R S +
Sbjct: 159 NGDSESFHYQHQADVGEALGDVRHRSVDYFWLELEEGERVRVNLRPGVEAFLQRLSDEYN 218
Query: 992 MHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ ++T + YA + LDP G L GR
Sbjct: 219 VFVFTAATETYARPVLDRLDPTGSLLDGRF 248
>gi|45184666|ref|NP_982384.1| AAL158Wp [Ashbya gossypii ATCC 10895]
gi|44980012|gb|AAS50208.1| AAL158Wp [Ashbya gossypii ATCC 10895]
Length = 478
Score = 43.5 bits (101), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 966
Q+ F RK CLVLDLD TL++S+ H D PV E D + + ++ +
Sbjct: 301 QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
RPG+ FL+R +LFE+ ++T Y + +LD
Sbjct: 350 ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385
>gi|255087422|ref|XP_002505634.1| predicted protein [Micromonas sp. RCC299]
gi|226520904|gb|ACO66892.1| predicted protein [Micromonas sp. RCC299]
Length = 548
Score = 43.5 bits (101), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 27/109 (24%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMW 972
K LVLDLD TL VH + + EE D F FP H+ +
Sbjct: 354 KNTLVLDLDETL-----------VHSNLEQTIEEAD--------FSFPVTFNGQQHI-VN 393
Query: 973 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
+ RP + F+E A++ FE+ ++T ++YA + +DP VL R+
Sbjct: 394 VRRRPYLTEFMEFAARHFEVVVFTASQRVYAERLLNKIDPNQVLIKHRL 442
>gi|440910832|gb|ELR60586.1| Protein ECT2 [Bos grunniens mutus]
Length = 934
Score = 43.5 bits (101), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256
>gi|395527897|ref|XP_003766073.1| PREDICTED: protein ECT2 isoform 2 [Sarcophilus harrisii]
Length = 915
Score = 43.5 bits (101), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1226
L A G K + +VTH+VAN +K A+S G +V P W+ + + R N
Sbjct: 195 LVTLAHYMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIVKPEWIFKA---WERRN 251
Query: 1227 EQDFA 1231
EQDF
Sbjct: 252 EQDFC 256
>gi|410970977|ref|XP_003991950.1| PREDICTED: LOW QUALITY PROTEIN: protein ECT2 [Felis catus]
Length = 914
Score = 43.5 bits (101), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRIAVSLGTPIMKPEWIYKA---WERRNEQDFC 256
>gi|358421543|ref|XP_003585009.1| PREDICTED: protein ECT2-like [Bos taurus]
Length = 524
Score = 43.5 bits (101), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256
>gi|401841683|gb|EJT44034.1| PSR2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 392
Score = 43.5 bits (101), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 49/99 (49%), Gaps = 16/99 (16%)
Query: 919 RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+K CL+LDLD TL++S+ K+ + + + E D+ ++ RP
Sbjct: 221 QKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 265
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
G+ FL R S+++E+ ++T YA + LDP G +
Sbjct: 266 GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 304
>gi|2289786|dbj|BAA21667.1| HYA22 [Homo sapiens]
Length = 340
Score = 43.5 bits (101), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 146 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 201
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 202 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 251
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 252 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 288
>gi|351709551|gb|EHB12470.1| Protein ECT2, partial [Heterocephalus glaber]
Length = 856
Score = 43.5 bits (101), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPDWIYKA---WERRNEQDFC 225
>gi|302819617|ref|XP_002991478.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
gi|300140680|gb|EFJ07400.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
Length = 219
Score = 43.5 bits (101), Expect = 0.80, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 17/102 (16%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K LVLD+D TL+++ H I K + P + RPG+
Sbjct: 2 KPTLVLDMDETLIHA---------HKAIASLKLFSGKTLPLKRYL--------VAKRPGV 44
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
TFL+ SK++E+ ++T K YA + LDP G LF R+
Sbjct: 45 NTFLDEMSKIYEIVVFTRVVKPYADRILDRLDPVGNLFTHRL 86
>gi|242089885|ref|XP_002440775.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
gi|241946060|gb|EES19205.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
Length = 319
Score = 43.5 bits (101), Expect = 0.80, Method: Composition-based stats.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 30/143 (20%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 976
L LDLD TL++S + DP P R F P +G + R
Sbjct: 145 LFLDLDETLIHS----QTDP---------------PPSRFDFTVRPVIGGHAVTFYVVKR 185
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
PG+ FL A+++F++ ++T G + YA+ + LDP G +FA R + RG D GD R
Sbjct: 186 PGVEAFLRAAAEIFDVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDA--GDGR 242
Query: 1037 VPKSKDLEGVLGMESAVVIIDDS 1059
+ KDL VI+DD+
Sbjct: 243 L--VKDLAATGRALDRAVIVDDN 263
>gi|391344643|ref|XP_003746605.1| PREDICTED: phosphatase PSR1-like [Metaseiulus occidentalis]
Length = 239
Score = 43.5 bits (101), Expect = 0.83, Method: Composition-based stats.
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 12/96 (12%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
A+K+ LVLDLD TL++ + + P +D K Q + M ++ +RP
Sbjct: 45 AKKILLVLDLDETLIHGT--YCMPPKYDFRFELKLPQSKRV----------MNVYVLVRP 92
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ FLE A K FE+ YT +YA ++ +DPK
Sbjct: 93 YLQDFLEFAHKWFEVMAYTASLPIYADKILDEIDPK 128
>gi|324518550|gb|ADY47137.1| CTD small phosphatase-like protein 2 [Ascaris suum]
Length = 248
Score = 43.5 bits (101), Expect = 0.83, Method: Composition-based stats.
Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
S + LVLDLD TL++ + D PVH F+ +
Sbjct: 60 STPEFALVLDLDETLVHCSLTELPDASLTFPVH-------------------FQDNTYQV 100
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ ++RP + FLER S+ FE+ L+T ++YA ++ +LDP L R+ + F
Sbjct: 101 YVRVRPHLHEFLERLSQSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 157
Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
+ KDL +LG + S +IID+S++ + + N I +E + +
Sbjct: 158 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 201
>gi|432102147|gb|ELK29956.1| Protein ECT2 [Myotis davidii]
Length = 882
Score = 43.5 bits (101), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 169 HMGGVIRKDFNSKVTHLVANCTQGEKFRIAVSLGTPIMKPEWIYKA---WERRNEQDFC 224
>gi|410258922|gb|JAA17427.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Pan troglodytes]
gi|410290720|gb|JAA23960.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Pan troglodytes]
Length = 260
Score = 43.5 bits (101), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|403333806|gb|EJY66027.1| hypothetical protein OXYTRI_13811 [Oxytricha trifallax]
Length = 509
Score = 43.1 bits (100), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 76/168 (45%), Gaps = 17/168 (10%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVH--DEILRKKEEQDREKP----HRHLFRFPHMGMWT 973
K LVLD+D TL++ + ++P + E++ QD KP ++ + ++
Sbjct: 297 KKTLVLDMDETLIHCS----LEPFYGYQEVIHVM--QDTYKPISPDSDLIYSQKSLQIYV 350
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-----ISRGDDG 1028
RP + FLE+ S +E+ ++T +K YA + +DP F+ R+ + +
Sbjct: 351 AYRPYLIHFLEKVSSQYEVVVFTASDKSYADVILDKIDPYHKYFSYRLYRDSCLQVNINA 410
Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ KDL + S +I+D+S++ + + N I + Y
Sbjct: 411 KNSSSQQTTLFVKDLSALGRDLSQTIIVDNSIQAFGYQLSNGIPIPSY 458
>gi|374105582|gb|AEY94493.1| FAAL158Wp [Ashbya gossypii FDAG1]
Length = 478
Score = 43.1 bits (100), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 966
Q+ F RK CLVLDLD TL++S+ H D PV E D + + ++ +
Sbjct: 301 QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349
Query: 967 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
RPG+ FL+R +LFE+ ++T Y + +LD
Sbjct: 350 ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385
>gi|365759524|gb|EHN01307.1| Psr2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 338
Score = 43.1 bits (100), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 16/99 (16%)
Query: 919 RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
K CL+LDLD TL++S+ K+ + + + E D+ ++ RP
Sbjct: 167 HKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 211
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
G+ FL R S+++E+ ++T YA + LDP G +
Sbjct: 212 GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 250
>gi|154339115|ref|XP_001562249.1| nuclear lim interactor-interacting factor-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
gi|134062832|emb|CAM39277.1| nuclear lim interactor-interacting factor-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
Length = 290
Score = 43.1 bits (100), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 20/104 (19%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MWT-- 973
S K+ LVLD+D TL++S D V+D++L H+ P G M+T
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVL-------------HV---PSDGRMYTVS 152
Query: 974 -KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
K RP + FL S+ FE+ ++T + Y ++ +DP+G+L
Sbjct: 153 VKYRPYLEDFLRFISRRFEVVVFTASMRAYCDKLMDEIDPQGIL 196
>gi|395527895|ref|XP_003766072.1| PREDICTED: protein ECT2 isoform 1 [Sarcophilus harrisii]
Length = 884
Score = 43.1 bits (100), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 1226
L A G K + +VTH+VAN +K A+S G +V P W+ + + R N
Sbjct: 164 LVTLAHYMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIVKPEWIFKA---WERRN 220
Query: 1227 EQDFA 1231
EQDF
Sbjct: 221 EQDFC 225
>gi|449275333|gb|EMC84205.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1, partial [Columba livia]
Length = 230
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K R L + K A LC+V+DLD TL++S+ PV++ E D
Sbjct: 35 EENGALPKAAVRHLLPEIKPQDASNLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 90
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 91 GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 140
Query: 1016 LFAGRVI 1022
F R+
Sbjct: 141 -FRARLF 146
>gi|345788882|ref|XP_851254.2| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase-like [Canis lupus
familiaris]
Length = 328
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 132 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 187
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 188 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 237
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 238 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 276
>gi|410969412|ref|XP_003991189.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Felis catus]
Length = 259
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K + L + K K+C+V+DLD TL++S+ PV++ E D
Sbjct: 64 EENGAVPKTPVQYLLPEAKAQDVDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 119
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 120 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 169
Query: 1016 LFA 1018
A
Sbjct: 170 FRA 172
>gi|145534239|ref|XP_001452864.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420563|emb|CAK85467.1| unnamed protein product [Paramecium tetraurelia]
Length = 414
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 16/135 (11%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
+ LDLD TL+++ E V L+++ E E M + +RP F
Sbjct: 231 IFLDLDETLIHACHARETPSVK---LKQQNEDGSETDS--------MQVGINVRPYTGYF 279
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
L+ ++ + +++YT ++ YA + LDP +G ++SR + + +G D R+
Sbjct: 280 LQELAQYYTIYIYTASSQQYAQTIVNYLDPLKQYISG-ILSRSNCMETKNGFFIKDLRII 338
Query: 1039 KSKDLEGVLGMESAV 1053
K DL+ L +++ V
Sbjct: 339 KDLDLDRTLIVDNLV 353
>gi|403265922|ref|XP_003925160.1| PREDICTED: protein ECT2 isoform 2 [Saimiri boliviensis boliviensis]
Length = 914
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256
>gi|301617231|ref|XP_002938048.1| PREDICTED: CTD small phosphatase-like protein-like [Xenopus
(Silurana) tropicalis]
Length = 276
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 73/172 (42%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D Q I + L + K+ K C+V+DLD TL++S+ P+++ E
Sbjct: 80 DQTQALTIPSPPAKYLLPELKVSDYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL++ +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQKMGELFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+IID+S
Sbjct: 186 GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224
>gi|441633012|ref|XP_004089721.1| PREDICTED: protein ECT2 [Nomascus leucogenys]
Length = 914
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256
>gi|402860934|ref|XP_003894870.1| PREDICTED: protein ECT2 isoform 2 [Papio anubis]
Length = 917
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256
>gi|403265920|ref|XP_003925159.1| PREDICTED: protein ECT2 isoform 1 [Saimiri boliviensis boliviensis]
Length = 883
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225
>gi|402860932|ref|XP_003894869.1| PREDICTED: protein ECT2 isoform 1 [Papio anubis]
gi|402860936|ref|XP_003894871.1| PREDICTED: protein ECT2 isoform 3 [Papio anubis]
Length = 886
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 1231
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225
>gi|387219521|gb|AFJ69469.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis gaditana
CCMP526]
Length = 268
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 2/98 (2%)
Query: 1131 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 1190
+V L++ +R++LAG I+FS V P +P L A GA + VTH+V
Sbjct: 80 NVAGCLSSVRRQVLAGVTILFSGVLPR-NVDPRRSDLGYMALSLGARIVEDFSPTVTHLV 138
Query: 1191 ANSLGTDKVNWA-LSTGRFVVHPGWVEASALLYRRANE 1227
A + T+KV A G ++V W++ S R +E
Sbjct: 139 AENASTEKVFRARRQGGMWIVARTWLQLSFFHCDRKDE 176
>gi|354547373|emb|CCE44108.1| hypothetical protein CPAR2_503330 [Candida parapsilosis]
Length = 373
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 51/115 (44%), Gaps = 33/115 (28%)
Query: 917 SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
+A K CLVLDLD TL L SA F V PV E D + H ++ +
Sbjct: 200 TANKKCLVLDLDETLVHSSFKYLRSADF--VIPV---------EIDGQVHHVYVIK---- 244
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
RPG+ FLER KL+E+ ++T Y + LD F+ V+ R
Sbjct: 245 ------RPGVDEFLERVGKLYEVVVFTASVSKYGDPLLNKLD-----FSQSVLHR 288
>gi|159112651|ref|XP_001706554.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
50803]
gi|157434651|gb|EDO78880.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
50803]
Length = 432
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 8/97 (8%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K LVLDLD TL++S+ F++VD I E+ + H ++ RP +
Sbjct: 258 KKLLVLDLDETLVHSS-FNKVDNADMIIPLSIEDPVSKATISH-------QVYVYKRPYV 309
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
FLE +K +E+ ++T ++Y + + LDP G+
Sbjct: 310 DEFLETMAKYYELAIFTASLRVYCDAVMEKLDPNGLC 346
>gi|146185627|ref|XP_001032201.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|146142847|gb|EAR84538.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 446
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 59/107 (55%), Gaps = 14/107 (13%)
Query: 945 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
D I++ K + D + ++ +G+ ++RP FL++ ++ ++++++T + YA+
Sbjct: 282 DHIIKAKADNDDKVGYQ-------IGL--RVRPYCLEFLQKLAQYWDIYIFTASSPTYAS 332
Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVL 1047
+ K LDP+G G +++R + + +G D R+ K KDL+ +
Sbjct: 333 AIVKFLDPEGKYING-ILNRSNCMETKNGFFIKDLRIVKGKDLKKTV 378
>gi|387539470|gb|AFJ70362.1| CTD small phosphatase-like protein isoform 1 [Macaca mulatta]
Length = 276
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|403336757|gb|EJY67573.1| NLI interacting factor-like phosphatase family protein [Oxytricha
trifallax]
Length = 515
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 975 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---GDDGDPF 1031
+RP LE +K FE+ ++T +K YA + +DP G L R+ + +
Sbjct: 249 IRPYTQECLEFVNKYFEVVVFTASHKFYADVILDYIDPTGTLIQHRLYREHCIKTQDNVY 308
Query: 1032 DGDERVPKSKDLEGVLGMESAV 1053
D RV K++DL+ ++ +++AV
Sbjct: 309 IKDLRVFKNRDLKDLIIVDNAV 330
>gi|291392229|ref|XP_002712521.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase 1 [Oryctolagus
cuniculus]
Length = 260
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L + K + K+C+V+DLD TL++S+ PV + E D
Sbjct: 65 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVSNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|344238378|gb|EGV94481.1| CTD small phosphatase-like protein [Cricetulus griseus]
Length = 239
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ + +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 45 EENSGLQKPPAKSLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 100
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 101 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 150
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 151 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 187
>gi|300122949|emb|CBK23956.2| Tim50 [Blastocystis hominis]
Length = 348
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 74/171 (43%), Gaps = 35/171 (20%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L EQ RK LVLDLD TL++S F D ++ + E D
Sbjct: 163 LGEQSSANQGRKT-LVLDLDETLVHST-FQPTDDC-SYVIPVEIEGDL------------ 207
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV----LFAGRVISR 1024
++ LRPG FL R S+++E+ +YT +YA + +DP + LF +
Sbjct: 208 YNVYVYLRPGTTEFLRRMSEIYEVVVYTASLPVYADPLLDKIDPNNLISARLFRDHCVQS 267
Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDD---SVRVWPHNKLNLI 1071
G KDL G+LG +VV+ID+ S + P+N + +
Sbjct: 268 GG-----------ILVKDL-GLLGRSLDSVVMIDNSAVSFQFQPNNGIECV 306
>gi|426342904|ref|XP_004038068.1| PREDICTED: protein ECT2 isoform 2 [Gorilla gorilla gorilla]
Length = 914
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 255
>gi|300794122|ref|NP_001179369.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Bos taurus]
gi|296490317|tpg|DAA32430.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1-like [Bos taurus]
Length = 260
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAVPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|164658690|ref|XP_001730470.1| hypothetical protein MGL_2266 [Malassezia globosa CBS 7966]
gi|159104366|gb|EDP43256.1| hypothetical protein MGL_2266 [Malassezia globosa CBS 7966]
Length = 280
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 1171 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 1229
AE FGA C + ++D +TH+VA S T K A + VV P W+ S + R E
Sbjct: 2 AEDFGARCHRTLNDNITHLVATSPRTSKAQQAFRKQKIQVVWPSWLNDSICRWVRQGEVP 61
Query: 1230 FAI 1232
+ I
Sbjct: 62 YLI 64
>gi|397523977|ref|XP_003831992.1| PREDICTED: protein ECT2 isoform 2 [Pan paniscus]
gi|410037742|ref|XP_003950278.1| PREDICTED: protein ECT2 [Pan troglodytes]
gi|410339813|gb|JAA38853.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
Length = 914
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 255
>gi|380815184|gb|AFE79466.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
gi|383420375|gb|AFH33401.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
gi|384948522|gb|AFI37866.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
Length = 260
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|396479189|ref|XP_003840695.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
gi|312217268|emb|CBX97216.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
Length = 511
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 45/189 (23%)
Query: 845 QIKSGADMKAV-VTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQK 903
QI +MK V ++ HD Q G +A P HP+ V+ +++ A+Q
Sbjct: 263 QIDDDIEMKDVPLSTHDVHQEGEDQSTDAQP--DHPK-----VDLPPPPPLVERQHAVQS 315
Query: 904 ERT---RRLEEQKKM-------FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
+ T E QK + F +K CLVLDLD TL++S+ +IL
Sbjct: 316 QVTDASEASEPQKYLLGPIAPRFKGKK-CLVLDLDETLVHSSF---------KIL----- 360
Query: 954 QDREKPHRHLFRFP------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
H+ F P + ++ RPG+ F++R +L+E+ ++T Y +
Sbjct: 361 ------HQADFTIPVEIEGQYHNVYVIKRPGVDQFMKRVGELYEVVVFTASVSKYGDPLL 414
Query: 1008 KVLDPKGVL 1016
LD GV+
Sbjct: 415 DQLDIHGVV 423
>gi|413948975|gb|AFW81624.1| hypothetical protein ZEAMMB73_313891 [Zea mays]
Length = 322
Score = 42.7 bits (99), Expect = 1.3, Method: Composition-based stats.
Identities = 46/143 (32%), Positives = 61/143 (42%), Gaps = 30/143 (20%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 976
L LDLD TL++S Q P R F P +G + R
Sbjct: 148 LFLDLDETLIHS-------------------QTEPPPSRFDFTVRPVIGGHAVTFYVVKR 188
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
PG+ FL A+ FE+ ++T G + YA+ + LDP G +FA R + RG D DG
Sbjct: 189 PGVEAFLRAAADAFEVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDAGDGT-- 245
Query: 1037 VPKSKDLEGVLGMESAVVIIDDS 1059
KDL VIIDD+
Sbjct: 246 --LVKDLAATGRALDRAVIIDDN 266
>gi|361067247|gb|AEW07935.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149610|gb|AFG56720.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149612|gb|AFG56721.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149614|gb|AFG56722.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149616|gb|AFG56723.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149618|gb|AFG56724.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149620|gb|AFG56725.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149622|gb|AFG56726.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149624|gb|AFG56727.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149628|gb|AFG56729.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149630|gb|AFG56730.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149632|gb|AFG56731.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149634|gb|AFG56732.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149636|gb|AFG56733.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
gi|383149638|gb|AFG56734.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
Length = 140
Score = 42.7 bits (99), Expect = 1.3, Method: Composition-based stats.
Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F ++ + RP + F++R + +FE+ ++T +YA ++ VLDPK L RV
Sbjct: 18 FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRVYR 77
Query: 1024 RG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL VLG + A V IID+S + + N I +E +
Sbjct: 78 ESCVFVEGNYL---------KDLT-VLGRDLAQVAIIDNSPQAFGFQVDNGIPIESW 124
>gi|302794308|ref|XP_002978918.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
gi|300153236|gb|EFJ19875.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
Length = 218
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ FL+ SK++E+ ++T K YA + LDP G LFA R+
Sbjct: 81 RPGVDIFLDEMSKIYEIVVFTRAVKPYADRILDRLDPAGNLFAHRL 126
>gi|355746816|gb|EHH51430.1| hypothetical protein EGM_10796, partial [Macaca fascicularis]
Length = 270
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 74 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 129
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 130 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 179
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 180 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 218
>gi|388852402|emb|CCF54017.1| related to DNA topoisomerase II binding protein [Ustilago hordei]
Length = 1166
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 29/49 (59%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 1215
L Q A + GA ++ + VTH++A+ G++K +AL G +V P W+
Sbjct: 105 LTQYARELGARVEGNLTEDVTHLIADRPGSEKYRYALELGMHIVSPNWI 153
>gi|302766621|ref|XP_002966731.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
gi|300166151|gb|EFJ32758.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
Length = 131
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FLER +KLF++ +T + A + VLDP F+ R+ D +
Sbjct: 5 RPHLSKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ K KDL + + V+I+DD+ N NL++V R+
Sbjct: 57 KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRF 97
>gi|426342902|ref|XP_004038067.1| PREDICTED: protein ECT2 isoform 1 [Gorilla gorilla gorilla]
gi|426342906|ref|XP_004038069.1| PREDICTED: protein ECT2 isoform 3 [Gorilla gorilla gorilla]
Length = 883
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224
>gi|335298855|ref|XP_003358412.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
Length = 256
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 60 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 115
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 116 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 165
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 166 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 204
>gi|302808549|ref|XP_002985969.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
gi|300146476|gb|EFJ13146.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
Length = 198
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 5/102 (4%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GD 1034
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+ D P + G
Sbjct: 65 RPGVDTFLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGG 122
Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
+V KDL + VI+DD + + N IV+ +
Sbjct: 123 RKV--VKDLSRLGRDLRHTVIVDDKLESFCLQPSNGIVIRAF 162
>gi|21735572|ref|NP_060568.3| protein ECT2 isoform b [Homo sapiens]
gi|385198081|ref|NP_001245245.1| protein ECT2 isoform b [Homo sapiens]
gi|85566758|gb|AAI12087.1| Epithelial cell transforming sequence 2 oncogene [Homo sapiens]
gi|189069491|dbj|BAG37157.1| unnamed protein product [Homo sapiens]
gi|261859358|dbj|BAI46201.1| Protein ECT2 [synthetic construct]
Length = 883
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 1173 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1230
G V K + +VTH+VAN +K A+S G ++ P W+ + + R NEQDF
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224
>gi|402860629|ref|XP_003894728.1| PREDICTED: CTD small phosphatase-like protein [Papio anubis]
Length = 276
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|328779252|ref|XP_391964.4| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Apis
mellifera]
Length = 233
Score = 42.4 bits (98), Expect = 1.6, Method: Composition-based stats.
Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 21/156 (13%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKL 975
++ LVLDLD TL++S D V +R D + K RH RF +
Sbjct: 56 KRKVLVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHK 106
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FL+ S+ +E+ ++T ++Y +A+ LD R I R E
Sbjct: 107 RPHVDFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPE 160
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
+KDL + ++V I+D+S R +PHN +
Sbjct: 161 MGSYTKDLSAICSDLASVFILDNSPGAYRAYPHNAI 196
>gi|156848006|ref|XP_001646886.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
70294]
gi|156117567|gb|EDO19028.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
70294]
Length = 477
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 15/103 (14%)
Query: 909 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
L ++ +F +K CLVLDLD TL++S+ F +D + ++Q +
Sbjct: 297 LAKKDPVFKNKK-CLVLDLDETLVHSS-FKYIDTADFVLPVTIDDQTHQ----------- 343
Query: 969 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
++ RPG+ FL+R K+FE+ ++T Y + +LD
Sbjct: 344 --VYVIKRPGVDEFLKRVGKIFEVVVFTASVSRYGDPLLDILD 384
>gi|302850140|ref|XP_002956598.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f. nagariensis]
gi|300258125|gb|EFJ42365.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f. nagariensis]
Length = 119
Score = 42.4 bits (98), Expect = 1.6, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 31/56 (55%)
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
G++ RPG+ FLE + E+ ++T G + YA + LDP G LFA R+ G
Sbjct: 5 GVFVVERPGLQEFLEELASFAEVVIFTAGLEDYAKPIIDALDPSGKLFAHRIYREG 60
>gi|56549681|ref|NP_005799.2| CTD small phosphatase-like protein isoform 2 [Homo sapiens]
gi|31074181|gb|AAP34400.1| small CTD phosphatase 3 [Homo sapiens]
gi|34392245|emb|CAE11804.1| RB serine phosphatase [Homo sapiens]
gi|34596234|gb|AAQ76797.1| HYA22 [Homo sapiens]
gi|187252491|gb|AAI66643.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [synthetic construct]
gi|410228142|gb|JAA11290.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
gi|410291072|gb|JAA24136.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
gi|410334183|gb|JAA36038.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
Length = 265
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 71 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213
>gi|383149626|gb|AFG56728.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
Length = 140
Score = 42.4 bits (98), Expect = 1.6, Method: Composition-based stats.
Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query: 964 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
F ++ + RP + F++R + +FE+ ++T +YA ++ VLDPK L RV
Sbjct: 18 FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRVYR 77
Query: 1024 RG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
+G+ KDL VLG + A V IID+S + + N I +E +
Sbjct: 78 ESCVFVEGNYL---------KDLT-VLGRDLAQVAIIDNSPQAFGFQVDNGIPIESW 124
>gi|335298851|ref|XP_003358411.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
Length = 276
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|397610231|gb|EJK60724.1| hypothetical protein THAOC_18872 [Thalassiosira oceanica]
Length = 231
Score = 42.4 bits (98), Expect = 1.7, Method: Composition-based stats.
Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 20/166 (12%)
Query: 921 LCLVLDLDHTLLNSAK-----FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
L + DLDHT+L S + VD LR ++ D + P F T L
Sbjct: 31 LDIFFDLDHTILCSISPLPISDNGVDNGVGRTLRWFDQIDDDFP----FEGNSPNTRTFL 86
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + S + +H+YT K Y + KV+DP+ LF G+V+ R D D
Sbjct: 87 RPLSTATIYFCSVIGRVHVYTAAQKSYTDNILKVIDPRRTLF-GQVLHRDDHPDI----- 140
Query: 1036 RVPKSKDL----EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
V KDL G +G+ +V+ D P N +++ +T
Sbjct: 141 -VRNGKDLLFAGGGEVGLRRSVLFDDKFSNFVPQQYRNGVLIRPFT 185
>gi|350406069|ref|XP_003487644.1| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Bombus
impatiens]
Length = 243
Score = 42.4 bits (98), Expect = 1.7, Method: Composition-based stats.
Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 21/156 (13%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKL 975
++ LVLDLD TL++S D V +R D + K RH RF +
Sbjct: 59 KRKVLVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHK 109
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FL+ S+ +E+ ++T ++Y +A+ LD R I R E
Sbjct: 110 RPHVDFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPE 163
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
+KDL + ++V I+D+S R +PHN +
Sbjct: 164 MGSYTKDLSAICSDLASVFILDNSPGAYRAYPHNAI 199
>gi|302792499|ref|XP_002978015.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
gi|300154036|gb|EFJ20672.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
Length = 131
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
RP + FLER +KLF++ +T + A + VLDP F+ R+ D +
Sbjct: 5 RPHLGKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
+ K KDL + + V+I+DD+ N NL++V R+
Sbjct: 57 KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRF 97
>gi|297286147|ref|XP_001086442.2| PREDICTED: CTD small phosphatase-like protein-like [Macaca mulatta]
Length = 260
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + + K C+V+DLD TL++S+ P+ + E D
Sbjct: 66 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 121
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 122 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 171
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 172 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 208
>gi|145527362|ref|XP_001449481.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417069|emb|CAK82084.1| unnamed protein product [Paramecium tetraurelia]
Length = 249
Score = 42.4 bits (98), Expect = 1.7, Method: Composition-based stats.
Identities = 53/184 (28%), Positives = 76/184 (41%), Gaps = 32/184 (17%)
Query: 893 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
+DD+ K I ++T + + LVLDLD TL++S E DE + K
Sbjct: 55 FDDECKDKITAKKTEK-----------EFTLVLDLDETLIHSDM--ERTSFLDEEILVKI 101
Query: 953 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
EK + K+RP FL+ S FE+ ++T K YA ++ LDP
Sbjct: 102 GNTIEK------------YYVKIRPFARDFLKALSNYFELVIFTAAIKEYADKVIDYLDP 149
Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1072
G F R R D DG KDL V IID+S+ N N I+
Sbjct: 150 SG--FIKRRFYR-DSCTKKDG----VFYKDLTKVNSNLDKTFIIDNSLSGMSLNPQNGIL 202
Query: 1073 VERY 1076
++ +
Sbjct: 203 IKSW 206
>gi|431919455|gb|ELK17974.1| CTD small phosphatase-like protein [Pteropus alecto]
Length = 309
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 113 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 168
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 169 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 218
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 219 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 257
>gi|355750837|gb|EHH55164.1| hypothetical protein EGM_04316, partial [Macaca fascicularis]
Length = 237
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ AI K + L K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 42 EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 97
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 98 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 147
Query: 1016 LFAGRVI 1022
F R+
Sbjct: 148 -FRARLF 153
>gi|395816725|ref|XP_003781844.1| PREDICTED: CTD small phosphatase-like protein isoform 2 [Otolemur
garnettii]
Length = 276
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|148232046|ref|NP_001084286.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Xenopus laevis]
gi|32396218|gb|AAP43959.1| NIF [Xenopus laevis]
gi|114107822|gb|AAI23152.1| NIF protein [Xenopus laevis]
Length = 276
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D Q I T+ L + K+ K C+V+DLD TL++S+ P+++ E
Sbjct: 80 DQTQALTIPSPPTKYLLPELKVSEYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL++ ++FE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQKMGEMFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+IID+S
Sbjct: 186 GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224
>gi|443896177|dbj|GAC73521.1| nucleotide excision repair factor NEF2, RAD4/CUT5 component
[Pseudozyma antarctica T-34]
Length = 1956
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 29/51 (56%)
Query: 1167 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEA 1217
L Q A + GA ++ + VTH++A+ G+ K +AL G +V P W+ A
Sbjct: 100 LTQYARELGARVEGNLTEDVTHLIADRPGSQKYRFALDLGMHIVSPDWIHA 150
>gi|296475139|tpg|DAA17254.1| TPA: small CTD phosphatase 3-like [Bos taurus]
Length = 276
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|330688428|ref|NP_001180010.2| CTD small phosphatase-like protein [Bos taurus]
Length = 276
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|302808547|ref|XP_002985968.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
gi|300146475|gb|EFJ13145.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
Length = 134
Score = 42.4 bits (98), Expect = 1.9, Method: Composition-based stats.
Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 24/156 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP-HRHLFRFPHMGMWTKLRPGIWT 981
LVLD+D TL+++ H I K + P R+L RPG+ T
Sbjct: 1 LVLDMDETLIHA---------HKAIASLKLFSGKTLPLQRYL---------VAKRPGVDT 42
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GDERVPKS 1040
FL S+++E+ ++T K YA + LDP G LF R+ D P + G +V
Sbjct: 43 FLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGGRKV--V 98
Query: 1041 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL + VI+DD + N IV+ +
Sbjct: 99 KDLSRLGRDLRHTVIVDDKPESFCLQPSNGIVIRAF 134
>gi|145549970|ref|XP_001460664.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428494|emb|CAK93267.1| unnamed protein product [Paramecium tetraurelia]
Length = 346
Score = 42.0 bits (97), Expect = 1.9, Method: Composition-based stats.
Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 21/101 (20%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
+K + +DLD TL VH E L KP+R + F ++ T +RP
Sbjct: 161 KKYSIAIDLDETL-----------VHSEEL---------KPNRR-YDFQNLQFGTFIRPY 199
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
FL+ +K + ++T N YAT + ++LDP+ LF G
Sbjct: 200 CLQFLQLLNKHANLFVFTSSNIKYATTIMQILDPQKDLFQG 240
>gi|268565001|ref|XP_002639300.1| C. briggsae CBR-SCPL-1 protein [Caenorhabditis briggsae]
Length = 484
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
++K CLV+DLD TL++S+ PV + E D + ++ + RP
Sbjct: 329 SKKKCLVIDLDETLVHSS----FKPVKNPDFVIPVEIDGVEHQVYVLK----------RP 374
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
+ FL R + FE L+T YA +A +LD K V F GR+
Sbjct: 375 YVDEFLARVGEHFECILFTASLAKYADPVADLLDKKKV-FRGRLF 418
>gi|67588036|ref|XP_665317.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54655944|gb|EAL35087.1| hypothetical protein Chro.80553 [Cryptosporidium hominis]
Length = 364
Score = 42.0 bits (97), Expect = 2.0, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 55/111 (49%), Gaps = 9/111 (8%)
Query: 972 WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
+ KLRPG+ L SK +E+++YTMG + +A ++LDP+ F + I ++G
Sbjct: 181 YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 238
Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
+ K L + + +VI+DD + W +L+ V Y +FP
Sbjct: 239 ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWTDIN-SLLKVYPYNFFP 284
>gi|410971731|ref|XP_003992318.1| PREDICTED: CTD small phosphatase-like protein [Felis catus]
Length = 307
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 111 DQRQVIPIPSPPAKYLLPEATVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 166
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 167 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 216
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 217 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 255
>gi|145533244|ref|XP_001452372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420060|emb|CAK84975.1| unnamed protein product [Paramecium tetraurelia]
Length = 250
Score = 42.0 bits (97), Expect = 2.1, Method: Composition-based stats.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 14/98 (14%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
++ LVLDLD TL++S E + DE + K ++ EK + K+RP
Sbjct: 71 KEFTLVLDLDETLIHSDL--ERTSILDEEIIVKIGENIEK------------YYIKVRPY 116
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
FL+ S+LF++ ++T K YA ++ LDP G +
Sbjct: 117 AREFLQSLSQLFDLVIFTAALKEYADKVIDFLDPCGFI 154
>gi|71026803|ref|XP_763045.1| nuclear LIM interactor-interacting factor 1 [Theileria parva strain
Muguga]
gi|68349998|gb|EAN30762.1| nuclear LIM interactor-interacting factor 1, putative [Theileria
parva]
Length = 254
Score = 42.0 bits (97), Expect = 2.2, Method: Composition-based stats.
Identities = 59/219 (26%), Positives = 98/219 (44%), Gaps = 33/219 (15%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K+ + RK LVLDLD TL++S+ +P ++ + + E+ ++
Sbjct: 64 KVCTVRKKMLVLDLDETLIHSS----FEPSNNSFPMQLMQNGVERT-----------IYI 108
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
RP + FL S +E+ ++T G K YA + +DP GV R + R D ++G
Sbjct: 109 GKRPYLSEFLSVVSNFYEIVIFTAGLKSYADPVIDFIDPDGV--CKRRLFR-DSCKYWNG 165
Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
KDLE + VV ID+S + N N I +E + + + + L P L
Sbjct: 166 ----YYIKDLEILNKPLKDVVTIDNSPCCYCLNPENAIPIETW-FNDENDSELCDLVPLL 220
Query: 1094 LEIDHDERSEDGTLASSLGVIERL--HKIFFSHQSLDDV 1130
+ H +ED T +I L H+IFF+ ++ +
Sbjct: 221 RRLAH---TEDVT-----NIIPSLFNHEIFFTKVTISQI 251
>gi|380011615|ref|XP_003689895.1| PREDICTED: LOW QUALITY PROTEIN: CTD nuclear envelope phosphatase 1
homolog [Apis florea]
Length = 233
Score = 42.0 bits (97), Expect = 2.2, Method: Composition-based stats.
Identities = 44/152 (28%), Positives = 68/152 (44%), Gaps = 21/152 (13%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKLRPGI 979
LVLDLD TL++S D V +R D + K RH RF + RP +
Sbjct: 60 LVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHKRPHV 110
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
FL+ S+ +E+ ++T ++Y +A+ LD R I R E
Sbjct: 111 DFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPEMGSY 164
Query: 1040 SKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
+KDL + ++V I+D+S R +PHN +
Sbjct: 165 TKDLSAICSDLASVFILDNSPGAYRAYPHNAI 196
>gi|301623726|ref|XP_002941162.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Xenopus
(Silurana) tropicalis]
Length = 353
Score = 42.0 bits (97), Expect = 2.3, Method: Composition-based stats.
Identities = 36/133 (27%), Positives = 59/133 (44%), Gaps = 30/133 (22%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS-------AKFHEVDPVHDEIL 948
QQ Q++R + + + + SA + LVLDLD L++S A F + P D
Sbjct: 153 QQTPCDQRQRGKDIPFKTR--SAPESTLVLDLDEILVDSSLLPLTGADFTFLIPFQDTYY 210
Query: 949 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
+ ++ KLRP FLE K++E+ ++T K YA ++
Sbjct: 211 K---------------------VYVKLRPHAMEFLETLCKVYEIFVFTTAKKEYAEKILD 249
Query: 1009 VLDPKGVLFAGRV 1021
+LDP+ L R+
Sbjct: 250 LLDPQKKLIRHRL 262
>gi|195587438|ref|XP_002083469.1| GD13336 [Drosophila simulans]
gi|194195478|gb|EDX09054.1| GD13336 [Drosophila simulans]
Length = 253
Score = 42.0 bits (97), Expect = 2.3, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 70/142 (49%), Gaps = 22/142 (15%)
Query: 883 WGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVD 941
W +E ++ + + Q E R + + + ARK LVLD+D+T++ S
Sbjct: 33 WFFLERVYRDFMEYTPIIYQSEDRLSPVSKSRLSLVARK-TLVLDMDNTMITSW------ 85
Query: 942 PVHDEILRKKEEQDREKPH-RHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 995
K+ ++ KP H F+F ++G ++ RP + FL+R SK +++ ++
Sbjct: 86 ------FIKRGKKPENKPRIAHDFKF-YLGAYGATIYVYKRPYLDHFLDRVSKWYDLTVF 138
Query: 996 TMGNKLYATEMAKVLDP-KGVL 1016
T G ++YA+ + LD +G+L
Sbjct: 139 TSGAEIYASPILDFLDRGRGIL 160
>gi|338714770|ref|XP_001489080.3| PREDICTED: CTD small phosphatase-like protein-like [Equus caballus]
Length = 268
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 72 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 127
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 128 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 177
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 178 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 216
>gi|390476487|ref|XP_002759760.2| PREDICTED: CTD small phosphatase-like protein-like [Callithrix
jacchus]
Length = 451
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 255 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 310
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 311 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 360
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 361 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 399
>gi|355559784|gb|EHH16512.1| hypothetical protein EGK_11800, partial [Macaca mulatta]
Length = 250
Score = 42.0 bits (97), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 54 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 110 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 160 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198
>gi|426221551|ref|XP_004004972.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Ovis aries]
Length = 260
Score = 42.0 bits (97), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K + L + K K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAVPKAPVQYLLPEAKAQDLDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|380799053|gb|AFE71402.1| CTD small phosphatase-like protein isoform 1, partial [Macaca
mulatta]
Length = 249
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 53 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 108
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 109 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 158
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 159 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 197
>gi|322779024|gb|EFZ09423.1| hypothetical protein SINV_01392 [Solenopsis invicta]
Length = 216
Score = 41.6 bits (96), Expect = 2.5, Method: Composition-based stats.
Identities = 49/169 (28%), Positives = 73/169 (43%), Gaps = 22/169 (13%)
Query: 906 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRH 962
+R L E RK+ LVLDLD TL++S D V +R D + RH
Sbjct: 20 SRYLNENVPGIVKRKV-LVLDLDETLIHSHH----DGVARPTVRPGTPPDFVLKVTIDRH 74
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
RF + RP + FL+ S+ +E+ ++T ++Y +A LD R I
Sbjct: 75 PVRF-----FVHKRPHVDFFLDIVSQWYELVVFTASMEIYGAAVADKLD------NNRGI 123
Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
R E +KDL + S+V I+D+S R +PHN +
Sbjct: 124 LRRRYYRQHCTPEMGSYTKDLSAICSDLSSVFILDNSPGAYRAYPHNAI 172
>gi|194752999|ref|XP_001958806.1| GF12569 [Drosophila ananassae]
gi|190620104|gb|EDV35628.1| GF12569 [Drosophila ananassae]
Length = 282
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 94/234 (40%), Gaps = 16/234 (6%)
Query: 875 VGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS 934
+G++ +S + + E Y I KE E Q+++ + LVLDLD TL++S
Sbjct: 50 LGSYVRSFFSLIATKVESYLRPVTEPIYKEVPLSPESQRRLRQVGRKTLVLDLDETLVHS 109
Query: 935 AKFHEVDPVHDEILRKKEEQDREKPHRHLFR----FPHMGMWTKLRPGIWTFLERASKLF 990
DP +E++ KP L + RP + FL+ ASK +
Sbjct: 110 CY---SDPETNELVGCSLVPQTAKPDYELSVTLEGLDPIAFQVYKRPHVDVFLKFASKWY 166
Query: 991 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
++ ++T ++YA ++ LD GR + + SKDL V
Sbjct: 167 DLVIFTASLEVYAAQVVDRLD------NGRGMIQKRYYRQHCSSTTSMISKDLTVVNPDM 220
Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL--LEIDHDERS 1102
S IID+S + N I ++ + Y P LL P L L D RS
Sbjct: 221 SGTFIIDNSPNAYRDFPDNAIPIKTFIYDPDDTELLNLL-PFLDALRFTKDVRS 273
>gi|92399505|gb|ABE76501.1| SCP3-like protein [Mustela putorius furo]
Length = 276
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|403333986|gb|EJY66132.1| Serine/threonine-protein phosphatase dullard [Oxytricha trifallax]
Length = 913
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 84/188 (44%), Gaps = 21/188 (11%)
Query: 912 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
Q+K ++ K LVLDLD TL++S+ P D +L + E + +
Sbjct: 113 QQKPWATGKKTLVLDLDETLVHSSFKPPAKP--DIVLPVEIEGNV------------CNV 158
Query: 972 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
+ +RPG FL+R +K +E+ +YT YA + +LD K + R + F
Sbjct: 159 FVLIRPGTEFFLQRLAKCYEIVIYTASLSKYADPLIDILDNKTQKIIDYRLFR-EHCTFF 217
Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL--L 1089
G K L G L +S +IID+S + ++ N + + + P R F L L
Sbjct: 218 QG--VFIKDMSLPGRLLQDS--IIIDNSPTSYAFHQENALPILSWYDDPKDRCLFELIPL 273
Query: 1090 GPSLLEID 1097
SL E+D
Sbjct: 274 LESLAEVD 281
>gi|440900046|gb|ELR51261.1| CTD small phosphatase-like protein, partial [Bos grunniens mutus]
Length = 250
Score = 41.6 bits (96), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 54 DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 110 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 160 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198
>gi|344288137|ref|XP_003415807.1| PREDICTED: CTD small phosphatase-like protein-like [Loxodonta
africana]
Length = 281
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 85 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 140
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 141 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 190
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 191 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 229
>gi|354467729|ref|XP_003496321.1| PREDICTED: CTD small phosphatase-like protein-like [Cricetulus
griseus]
Length = 342
Score = 41.6 bits (96), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 172 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 218
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 219 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 269
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 270 ----YVKDLSR-LGRELSKVIIVDNS 290
>gi|148677300|gb|EDL09247.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Mus musculus]
Length = 288
Score = 41.6 bits (96), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q + + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 92 DQRQVIPVPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 147
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 148 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 197
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 198 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 236
>gi|432849192|ref|XP_004066577.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Oryzias latipes]
Length = 262
Score = 41.6 bits (96), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 55/121 (45%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ + K + + L K A K+C+V+DLD TL++S+ PV++ E D
Sbjct: 67 EENGTLSKAQVKPLLPPVKSKDAGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA ++ +LD G
Sbjct: 123 GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172
Query: 1016 L 1016
Sbjct: 173 F 173
>gi|9972366|gb|AAG10616.1|AC008030_16 Hypothetical protein [Arabidopsis thaliana]
Length = 247
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 81/163 (49%), Gaps = 24/163 (14%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
+LE+ ++ K ++LDLD TL++ A H HD ++ K E++
Sbjct: 63 KLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI----------- 110
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
M ++ RPG+ FLER + +++ ++T G + YA+++ LD GV+ + R+ D
Sbjct: 111 -MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI-SQRLYR--DS 166
Query: 1028 GDPFDGDERVPKSKDLEGVLG--MESAVVIIDD--SVRVWPHN 1066
+G KDL V+G + SA+++ D+ S + P N
Sbjct: 167 CTEVNG----KYVKDLSLVVGKDLRSALIVDDNPSSYSLQPEN 205
>gi|291234069|ref|XP_002736972.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase 1-like [Saccoglossus
kowalevskii]
Length = 251
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 23/142 (16%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
KLC+V+DLD TL++S+ PV + E D ++ + RP
Sbjct: 61 HKLCIVIDLDETLVHSS----FKPVSNADFVVPVEIDGTVHQVYVLK----------RPF 106
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
+ FL++ +LFE L+T YA +A +LD GV F R+ D F R
Sbjct: 107 VDEFLQKMGELFECVLFTASLSKYADPVADLLDKWGV-FRARLFR---DSCVF---HRGN 159
Query: 1039 KSKDLEGVLGME-SAVVIIDDS 1059
KDL G LG + +VI+D+S
Sbjct: 160 YVKDL-GRLGRDLKKIVIVDNS 180
>gi|401423666|ref|XP_003876319.1| nuclear lim interactor-interacting factor-like protein [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322492561|emb|CBZ27838.1| nuclear lim interactor-interacting factor-like protein [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 290
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 52/118 (44%), Gaps = 12/118 (10%)
Query: 899 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
+Q+ R L S K+ LVLD+D TL++S D V+D++L E
Sbjct: 91 CLVQEAEIRPLLPPLPFASVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK--- 147
Query: 959 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+ K RP + FL S+ FE+ ++T + Y ++ +D +G+L
Sbjct: 148 ---------TYTVSVKYRPYLEDFLRFVSRRFEIVVFTASMRAYCDKLMDEIDTQGIL 196
>gi|348575309|ref|XP_003473432.1| PREDICTED: CTD small phosphatase-like protein-like [Cavia porcellus]
Length = 294
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 98 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 153
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 154 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 203
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 204 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 242
>gi|171460950|ref|NP_598471.3| CTD small phosphatase-like protein [Mus musculus]
gi|408360295|sp|P58465.3|CTDSL_MOUSE RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
AltName: Full=Carboxy-terminal domain RNA polymerase II
polypeptide A small phosphatase 3; AltName: Full=NIF-like
protein; AltName: Full=Nuclear LIM interactor-interacting
factor 1; Short=NLI-interacting factor 1; AltName:
Full=Small C-terminal domain phosphatase 3; Short=SCP3;
Short=Small CTD phosphatase 3
gi|62948141|gb|AAH94289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Mus musculus]
Length = 276
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q + + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPVPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|432100877|gb|ELK29230.1| CTD small phosphatase-like protein [Myotis davidii]
Length = 280
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 84 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 139
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 140 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 189
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 190 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 228
>gi|32812783|emb|CAC69078.2| Mya22 protein [Mus musculus]
Length = 276
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q + + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPVPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|145356819|ref|XP_001422622.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582865|gb|ABP00939.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 190
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 63/138 (45%), Gaps = 21/138 (15%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
LVLDLD TL VH + E D P +F + + RP + TF
Sbjct: 17 LVLDLDETL-----------VHSNLENTVERCDFSFPV--VFNGDMHRVNVRKRPHLSTF 63
Query: 983 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
+E SK +E+ ++T ++YA ++ +LDP R+ D DG+ KD
Sbjct: 64 MELVSKQYEIVVFTASQQIYADKLLDILDPSQKWIKHRIFR--DSCVQIDGN----FMKD 117
Query: 1043 LEGVLGME-SAVVIIDDS 1059
L VLG + S +IID+S
Sbjct: 118 LR-VLGRDLSRTIIIDNS 134
>gi|452823685|gb|EME30693.1| putative CTD small phosphatase [Galdieria sulphuraria]
Length = 397
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 67/156 (42%), Gaps = 21/156 (13%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K LVLDLD TL++S F D +L + E ++ ++ K+RP +
Sbjct: 203 KKTLVLDLDETLVHSG-FEGSRETSDFVLSMQVEN------------TNLQLFVKMRPYL 249
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVL-DPKGVLF---AGRVISRGDDGDPFDGDE 1035
FL+ +K FE+ ++T YA + ++ D GV R+ + DP E
Sbjct: 250 KEFLQEVTKHFEIVIFTASMVTYADPVIDLMFDATGVAHIPETHRLFRESCEYDP----E 305
Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLI 1071
KDL + V+I+D+S + N N I
Sbjct: 306 TCSFHKDLMALGRDIKKVIIVDNSPTAYTKNPYNAI 341
>gi|6572954|gb|AAF17482.1|AF189774_1 NLI-interacting factor isoform T2 [Gallus gallus]
Length = 293
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + K C+V+DLD TL++S+ P+ + E D
Sbjct: 70 EENGGLQKPPAKYLLPELTASDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 125
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 126 GTIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDRWGV 175
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 176 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 212
>gi|302806322|ref|XP_002984911.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
gi|300147497|gb|EFJ14161.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
Length = 198
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 65 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110
>gi|302808565|ref|XP_002985977.1| hypothetical protein SELMODRAFT_123069 [Selaginella moellendorffii]
gi|300146484|gb|EFJ13154.1| hypothetical protein SELMODRAFT_123069 [Selaginella moellendorffii]
Length = 214
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
RPG+ TFL+ SK++E+ ++T K YA + LDP G LF
Sbjct: 81 RPGVDTFLDEMSKIYEIVVFTRAVKPYADRILDRLDPTGNLFT 123
>gi|375097582|ref|ZP_09743847.1| hypothetical protein SacmaDRAFT_4978 [Saccharomonospora marina
XMU15]
gi|374658315|gb|EHR53148.1| hypothetical protein SacmaDRAFT_4978 [Saccharomonospora marina
XMU15]
Length = 527
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 44/71 (61%), Gaps = 4/71 (5%)
Query: 150 EIELDLESESNEKVSEQVKEEMKLINVE--SIREALESVLRGDISFEGVCSKLEFTLESL 207
E ELD+E+++ E+V ++ +EE++ + E ++RE+L+++ G++ +E V + T +
Sbjct: 96 EFELDVEAQTRERVEDETQEELQALRSEVTALRESLQALFGGEVLWERVALTAQST--RM 153
Query: 208 RELVNENNVPT 218
R L E V T
Sbjct: 154 RSLAEEPRVVT 164
>gi|145498624|ref|XP_001435299.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402430|emb|CAK67902.1| unnamed protein product [Paramecium tetraurelia]
Length = 417
Score = 41.2 bits (95), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)
Query: 956 REKPHRHLFRFPHMGMWTKL----RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
RE P + F G K+ RP FL++ ++L+ +++YT + YA + LD
Sbjct: 253 RENPQVTVTAFGEYGEEAKIHFNIRPFCTWFLQQMNQLYTIYVYTASSSAYANAIVNYLD 312
Query: 1012 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDD 1058
PK G ++SRG+ + +G D R+ +K L+ +VI+D+
Sbjct: 313 PKRQWIMG-ILSRGNCMETKNGFFIKDLRIIGNKQLKD-------MVIVDN 355
>gi|417409172|gb|JAA51106.1| Putative ctd carboxy-terminal domain rna polymer, partial [Desmodus
rotundus]
Length = 265
Score = 41.2 bits (95), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 95 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTTHQVYVLKRPHVD--------- 141
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 142 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 192
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 193 ----YVKDLSR-LGRELSKVIIVDNS 213
>gi|417397992|gb|JAA46029.1| Putative carboxy-terminal domain rna polymerase ii polypeptide a
small phosphatase 1 isoform 2 [Desmodus rotundus]
Length = 260
Score = 41.2 bits (95), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ A+ K + L + K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 65 EENGAVPKTPVQYLLPEAKPQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + RP + FL+R +LFE L+T YA +A +LD G
Sbjct: 121 GVVHQVYVLK----------RPYVDEFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFA 1018
A
Sbjct: 171 FRA 173
>gi|410897359|ref|XP_003962166.1| PREDICTED: uncharacterized protein LOC101077160 [Takifugu rubripes]
Length = 934
Score = 41.2 bits (95), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
A+K+C+V+DLD TL++S+ PV D E + ++ + PH+
Sbjct: 761 AKKICVVIDLDETLVHSS----FTPVSDADFIIPVEIEGTVHQVYVLKRPHVD------- 809
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
FL+R +LFE L+T YA ++ +LD G F R+
Sbjct: 810 ---EFLKRMGELFECVLFTASLSKYADPVSDMLDTWGA-FRNRLF 850
>gi|302806326|ref|XP_002984913.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
gi|300147499|gb|EFJ14163.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
Length = 173
Score = 41.2 bits (95), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 41 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 86
>gi|354502403|ref|XP_003513276.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Cricetulus griseus]
Length = 342
Score = 41.2 bits (95), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 146 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 201
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 202 DGVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 251
Query: 1015 VLFA 1018
A
Sbjct: 252 AFRA 255
>gi|145494426|ref|XP_001433207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400324|emb|CAK65810.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 41.2 bits (95), Expect = 4.1, Method: Composition-based stats.
Identities = 43/168 (25%), Positives = 77/168 (45%), Gaps = 34/168 (20%)
Query: 903 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
K R RL+E + ++ LVLDLD TL++S + + I QD ++P
Sbjct: 27 KSRFVRLKESN---NRKQKILVLDLDETLIHSCTHRDFPHITITI------QDNDEPIDI 77
Query: 963 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
F +RP F++ S + ++L+T +++YA + LDPK + ++
Sbjct: 78 AF---------NVRPYCKEFIKEMSNYYTIYLFTASSEMYARAIVNHLDPKR-QYITDIL 127
Query: 1023 SRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
R + + +G D R+ ++DL+ +VIID+ PH+
Sbjct: 128 CRNNCFETKNGFFIKDLRIITNRDLKD-------IVIIDN----LPHS 164
>gi|145497555|ref|XP_001434766.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401894|emb|CAK67369.1| unnamed protein product [Paramecium tetraurelia]
Length = 249
Score = 41.2 bits (95), Expect = 4.2, Method: Composition-based stats.
Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 21/160 (13%)
Query: 917 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
+ ++ LVLDLD TL+ S E DE + K EK + K+R
Sbjct: 68 TEKEFTLVLDLDETLIRSEM--ERTSFLDEEIIVKIGNTIEK------------YYVKIR 113
Query: 977 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
P FL+ SK FE+ ++T K YA ++ LDP G F R R D DG
Sbjct: 114 PFARDFLKALSKYFELVIFTAALKEYADKVIDYLDPSG--FIKRRFYR-DSCTKKDG--- 167
Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
KDL V IID+S+ N N ++++ +
Sbjct: 168 -VFYKDLTKVNSNLEKTFIIDNSLSGMSLNPQNGLLIKSW 206
>gi|224044591|ref|XP_002196499.1| PREDICTED: uncharacterized protein LOC100232268 isoform 2
[Taeniopygia guttata]
gi|6572958|gb|AAF17484.1|AF189776_1 NLI-interacting factor isoform R5 [Gallus gallus]
Length = 264
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + K C+V+DLD TL++S+ P+ + E D
Sbjct: 70 EENGGLQKPPAKYLLPELTASDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 125
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 126 GTIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDRWGV 175
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 176 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 212
>gi|326921454|ref|XP_003206974.1| PREDICTED: CTD small phosphatase-like protein-like [Meleagris
gallopavo]
Length = 264
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 33/170 (19%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ +QK + L + K C+V+DLD TL++S+ P+ + E D
Sbjct: 70 EENGGLQKPPAKYLLPELTASDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 125
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA +A +LD GV
Sbjct: 126 GTIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDRWGV 175
Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
A V RG+ KDL LG E S V+I+D+S
Sbjct: 176 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 212
>gi|302808545|ref|XP_002985967.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
gi|300146474|gb|EFJ13144.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
Length = 198
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 65 RPGVDTFLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110
>gi|403346652|gb|EJY72729.1| NLI interacting factor-like phosphatase family protein [Oxytricha
trifallax]
Length = 368
Score = 40.8 bits (94), Expect = 4.3, Method: Composition-based stats.
Identities = 18/43 (41%), Positives = 28/43 (65%)
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
M M+ LRPGI+TFL+ S+ FE+ L+ GN+ Y + K++
Sbjct: 179 QMKMFAYLRPGIYTFLDTLSEHFEIVLFNNGNQEYTENLVKLI 221
>gi|433602192|ref|YP_007034561.1| putative membrane protein [Saccharothrix espanaensis DSM 44229]
gi|407880045|emb|CCH27688.1| putative membrane protein [Saccharothrix espanaensis DSM 44229]
Length = 609
Score = 40.8 bits (94), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 45/69 (65%), Gaps = 4/69 (5%)
Query: 150 EIELDLESESNEKVSEQVKEEMKLINVE--SIREALESVLRGDISFEGVCSKLEFTLESL 207
E EL++E+E+ ++V E+ K++++ + E ++RE LE++L G++ E V + E T +
Sbjct: 99 EYELEIEAETRKRVQEEAKDDLEALRGELRALRENLEALLGGEVLVERVALRAESTR--M 156
Query: 208 RELVNENNV 216
R L +++ V
Sbjct: 157 RALSDQSRV 165
>gi|290990355|ref|XP_002677802.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
gi|284091411|gb|EFC45058.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
Length = 332
Score = 40.8 bits (94), Expect = 4.4, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 14/100 (14%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
K S + LVLDLD TL++ + +P+ D + LF ++
Sbjct: 145 KELSQPDITLVLDLDETLVHCS----TEPIPDP----------DFTFTVLFHGVEYTVYV 190
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
+ RP FLE SK+FE+ ++T +YA ++ +LDP+
Sbjct: 191 RKRPYFVEFLEAVSKIFEVVVFTASQSVYADKLLSILDPE 230
>gi|426339960|ref|XP_004033903.1| PREDICTED: CTD small phosphatase-like protein [Gorilla gorilla
gorilla]
Length = 369
Score = 40.8 bits (94), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 168 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 214
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 215 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 265
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 266 ----YVKDLSR-LGRELSKVIIVDNS 286
>gi|302806328|ref|XP_002984914.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
gi|302806330|ref|XP_002984915.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
gi|300147500|gb|EFJ14164.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
gi|300147501|gb|EFJ14165.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
Length = 198
Score = 40.8 bits (94), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 65 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110
>gi|410906319|ref|XP_003966639.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Takifugu rubripes]
Length = 262
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ + K + + L K + K+C+V+DLD TL++S+ PV++ E D
Sbjct: 67 EENGTVSKVQVKPLLPPAKSKDSGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA ++ +LD G
Sbjct: 123 GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172
Query: 1016 L 1016
Sbjct: 173 F 173
>gi|403278958|ref|XP_003931046.1| PREDICTED: CTD small phosphatase-like protein [Saimiri boliviensis
boliviensis]
Length = 390
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 124 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 170
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 171 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 221
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 222 ----YVKDLSR-LGRELSKVIIVDNS 242
>gi|255562534|ref|XP_002522273.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase, putative [Ricinus communis]
gi|223538526|gb|EEF40131.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase, putative [Ricinus communis]
Length = 300
Score = 40.8 bits (94), Expect = 4.9, Method: Composition-based stats.
Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 26/148 (17%)
Query: 879 PQSAWGDVEHLFEGYDD---QQKAAIQKERTRRL----EEQKKMFSARKLCLVLDLDHTL 931
P S + + L G D +Q+ ++K+ R+ E+ + S K + LDLD TL
Sbjct: 70 PNSRYKGYKILKNGVKDRKREQEPDVEKDGICRVLFFNEKLPPLISPNKRTVFLDLDETL 129
Query: 932 LNSAKFHEVDP---VHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
++S + DP V D ++R + + M + RPG+ FLE +
Sbjct: 130 VHS----KADPPPHVFDFVVRPNIDGE------------FMNFYVLKRPGVDEFLEALAA 173
Query: 989 LFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
+E+ ++T G K YA+ + LD KG++
Sbjct: 174 KYEVVVFTAGLKAYASLVLDRLDEKGLI 201
>gi|145545436|ref|XP_001458402.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426222|emb|CAK91005.1| unnamed protein product [Paramecium tetraurelia]
Length = 423
Score = 40.8 bits (94), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)
Query: 956 REKPHRHLFRFPHMGMWTKL----RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
RE P + F G K+ RP FL++ ++LF ++++T + YA + LD
Sbjct: 259 RENPQVTVTVFGDYGEEAKIHFNIRPFCTWFLQQMNQLFTIYVFTASSSAYANAIVNYLD 318
Query: 1012 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDD 1058
PK G ++SRG+ + +G D R+ +K L+ +VI+D+
Sbjct: 319 PKKQWIMG-ILSRGNCMETKNGFFIKDLRIVGNKQLKD-------MVIVDN 361
>gi|384484378|gb|EIE76558.1| hypothetical protein RO3G_01262 [Rhizopus delemar RA 99-880]
Length = 348
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 50/175 (28%), Positives = 75/175 (42%), Gaps = 39/175 (22%)
Query: 919 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
RKL LVLDLD TL V V +E R E D K + G L
Sbjct: 113 RKLPLVLDLDDTL--------VRLVGNENGRFVSESDIPKCKDRVAVLKD-GKRVVLTER 163
Query: 979 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-----KGVLFAGR-----VISRGDDG 1028
+ FLE A +L+++ + ++G++ Y + VLDP KG+L++ R + S D G
Sbjct: 164 VREFLEWAQQLYDISICSLGDQNYVDSVIDVLDPTRSWVKGILYSARAEHDYIRSSPDPG 223
Query: 1029 DPFDGDERVPKSKDLEGVLGM-----------ESAVVIIDDSVRVWPHNKLNLIV 1072
P KDL+ + S +I+DD R+WP + + I+
Sbjct: 224 RP---------PKDLQALYSFCALRDQTLGSGFSLPLILDDETRMWPAEQHDNII 269
>gi|348552620|ref|XP_003462125.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Cavia porcellus]
Length = 261
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 65 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 DGVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170
Query: 1015 VLFA 1018
A
Sbjct: 171 AFRA 174
>gi|327274307|ref|XP_003221919.1| PREDICTED: CTD small phosphatase-like protein-like [Anolis
carolinensis]
Length = 340
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 170 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 216
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 217 -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 267
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 268 ----YVKDLSR-LGRELSKVIIVDNS 288
>gi|351699531|gb|EHB02450.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Heterocephalus glaber]
Length = 261
Score = 40.8 bits (94), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 65 EENGAIPKQSPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170
Query: 1015 VLFA 1018
A
Sbjct: 171 AFRA 174
>gi|10864009|ref|NP_067021.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 1 [Homo sapiens]
gi|397495662|ref|XP_003818666.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Pan paniscus]
gi|402889395|ref|XP_003908002.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Papio anubis]
gi|426338589|ref|XP_004033258.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Gorilla gorilla gorilla]
gi|17865510|sp|Q9GZU7.1|CTDS1_HUMAN RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1; AltName: Full=Nuclear LIM
interactor-interacting factor 3; Short=NLI-IF;
Short=NLI-interacting factor 3; AltName: Full=Small
C-terminal domain phosphatase 1; Short=SCP1; Short=Small
CTD phosphatase 1
gi|10257407|gb|AAG15402.1|AF229162_1 nuclear LIM interactor-interacting factor [Homo sapiens]
gi|10257410|gb|AAG15404.1| nuclear LIM interactor-interacting factor [Homo sapiens]
gi|15278033|gb|AAH12977.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Homo sapiens]
gi|119591021|gb|EAW70615.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1, isoform CRA_a [Homo sapiens]
gi|119591024|gb|EAW70618.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1, isoform CRA_a [Homo sapiens]
gi|167773945|gb|ABZ92407.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [synthetic construct]
gi|208966090|dbj|BAG73059.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [synthetic construct]
Length = 261
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 65 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170
Query: 1015 VLFA 1018
A
Sbjct: 171 AFRA 174
>gi|15220552|ref|NP_174271.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
gi|124301096|gb|ABN04800.1| At1g29780 [Arabidopsis thaliana]
gi|332193007|gb|AEE31128.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
Length = 221
Score = 40.8 bits (94), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 81/163 (49%), Gaps = 24/163 (14%)
Query: 908 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
+LE+ ++ K ++LDLD TL++ A H HD ++ K E++
Sbjct: 37 KLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI----------- 84
Query: 968 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
M ++ RPG+ FLER + +++ ++T G + YA+++ LD GV+ + R+ D
Sbjct: 85 -MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI-SQRLYR--DS 140
Query: 1028 GDPFDGDERVPKSKDLEGVLG--MESAVVIIDD--SVRVWPHN 1066
+G KDL V+G + SA+++ D+ S + P N
Sbjct: 141 CTEVNG----KYVKDLSLVVGKDLRSALIVDDNPSSYSLQPEN 179
>gi|392578955|gb|EIW72082.1| hypothetical protein TREMEDRAFT_41494 [Tremella mesenterica DSM 1558]
Length = 193
Score = 40.4 bits (93), Expect = 5.6, Method: Composition-based stats.
Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 14/96 (14%)
Query: 922 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
CLVLDLD TLL+S+ ++ P D I+ + E ++ RPG+
Sbjct: 27 CLVLDLDETLLHSS--FKMLPSADYIVPVEIEGQVHN------------VYVIKRPGVDR 72
Query: 982 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
FL K++E+ ++T YA + +LDP GV+
Sbjct: 73 FLYEMGKIYEVVVFTASLSKYADPVLDMLDPNGVVL 108
>gi|296205578|ref|XP_002749828.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Callithrix jacchus]
Length = 261
Score = 40.4 bits (93), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 65 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170
Query: 1015 VLFA 1018
A
Sbjct: 171 AFRA 174
>gi|301757683|ref|XP_002914696.1| PREDICTED: CTD small phosphatase-like protein-like [Ailuropoda
melanoleuca]
Length = 283
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 113 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 159
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 160 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 210
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 211 ----YVKDLSR-LGRELSKVIIVDNS 231
>gi|332216348|ref|XP_003257311.1| PREDICTED: CTD small phosphatase-like protein [Nomascus leucogenys]
Length = 341
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 140 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 186
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 187 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 237
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 238 ----YVKDLSR-LGRELSKVIIVDNS 258
>gi|56549683|ref|NP_001008393.1| CTD small phosphatase-like protein isoform 1 [Homo sapiens]
gi|114586014|ref|XP_001170981.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase-like isoform 2 [Pan
troglodytes]
gi|51704233|sp|O15194.2|CTDSL_HUMAN RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
AltName: Full=Carboxy-terminal domain RNA polymerase II
polypeptide A small phosphatase 3; AltName: Full=NIF-like
protein; AltName: Full=Nuclear LIM interactor-interacting
factor 1; Short=NLI-interacting factor 1; AltName:
Full=Protein YA22; Short=hYA22; AltName: Full=RBSP3;
AltName: Full=Small C-terminal domain phosphatase 3;
Short=SCP3; Short=Small CTD phosphatase 3
gi|34392247|emb|CAE11805.1| RB serine phosphatase [Homo sapiens]
gi|410228144|gb|JAA11291.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
gi|410291074|gb|JAA24137.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
gi|410334185|gb|JAA36039.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase-like [Pan troglodytes]
Length = 276
Score = 40.4 bits (93), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)
Query: 894 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
D +Q I + L + + K C+V+DLD TL++S+ P+ + E
Sbjct: 80 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
D ++ + PH+ FL+R +LFE L+T YA +A +LD
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185
Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
GV A V RG+ KDL LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224
>gi|302806318|ref|XP_002984909.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
gi|300147495|gb|EFJ14159.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
Length = 214
Score = 40.4 bits (93), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 81 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 126
>gi|308321688|gb|ADO27995.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Ictalurus furcatus]
Length = 264
Score = 40.4 bits (93), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ I K + L Q K K+C+V+DLD TL++S+ PV++ E D
Sbjct: 69 EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 124
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA ++ +LD G
Sbjct: 125 GAVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 174
Query: 1016 L 1016
Sbjct: 175 F 175
>gi|403266876|ref|XP_003925586.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Saimiri boliviensis
boliviensis]
Length = 248
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 16/128 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 52 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 107
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 108 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 157
Query: 1015 VLFAGRVI 1022
F R+
Sbjct: 158 A-FRARLF 164
>gi|397512000|ref|XP_003826348.1| PREDICTED: CTD small phosphatase-like protein [Pan paniscus]
Length = 326
Score = 40.4 bits (93), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 125 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 171
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 172 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 222
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 223 ----YVKDLSR-LGRELSKVIIVDNS 243
>gi|302806324|ref|XP_002984912.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
gi|300147498|gb|EFJ14162.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
Length = 171
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 40 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPVGNLFTHRL 85
>gi|302806336|ref|XP_002984918.1| hypothetical protein SELMODRAFT_424005 [Selaginella moellendorffii]
gi|300147504|gb|EFJ14168.1| hypothetical protein SELMODRAFT_424005 [Selaginella moellendorffii]
Length = 199
Score = 40.4 bits (93), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 976 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
RPG+ TFL S+++E+ ++T K YA + LDP G LF R+
Sbjct: 66 RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 111
>gi|301608836|ref|XP_002933982.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Xenopus (Silurana) tropicalis]
Length = 260
Score = 40.4 bits (93), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ ++ K + L + K A K+C+V+DLD TL++S+ PV++ E +
Sbjct: 65 EENGSVPKSSVKYLLPEVKAQDAGKICVVIDLDETLVHSS----FKPVNNADFIIPVEIE 120
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL R ++FE L+T YA +A +LD G
Sbjct: 121 GTVHQVYVLKRPHVD----------EFLRRMGEMFECVLFTASLAKYADPVADLLDKWGA 170
Query: 1016 LFAGRVI 1022
F R+
Sbjct: 171 -FRSRLF 176
>gi|395734008|ref|XP_002813985.2| PREDICTED: LOW QUALITY PROTEIN: CTD small phosphatase-like protein
[Pongo abelii]
Length = 336
Score = 40.4 bits (93), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 135 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 181
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 182 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 232
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 233 ----YVKDLSR-LGRELSKVIIVDNS 253
>gi|114583310|ref|XP_001156881.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Pan troglodytes]
Length = 261
Score = 40.4 bits (93), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 896 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
++ AI K+ + L + K + K+C+V+DLD TL++S+ PV++ E
Sbjct: 65 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120
Query: 955 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
D ++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170
Query: 1015 VLFA 1018
A
Sbjct: 171 AFRA 174
>gi|431917984|gb|ELK17213.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Pteropus alecto]
Length = 261
Score = 40.4 bits (93), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|63101171|gb|AAH95870.1| Zgc:113169 [Danio rerio]
Length = 230
Score = 40.4 bits (93), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ I K + L Q K K+C+V+DLD TL++S+ PV++ E D
Sbjct: 70 EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 125
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA ++ +LD G
Sbjct: 126 GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 175
Query: 1016 L 1016
Sbjct: 176 F 176
>gi|426249781|ref|XP_004018627.1| PREDICTED: CTD small phosphatase-like protein [Ovis aries]
Length = 255
Score = 40.4 bits (93), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K C+V+DLD TL++S+ P+ + E D ++ + PH+
Sbjct: 85 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 131
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
FL+R +LFE L+T YA +A +LD GV A V RG+
Sbjct: 132 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 182
Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
KDL LG E S V+I+D+S
Sbjct: 183 ----YVKDLSR-LGRELSKVIIVDNS 203
>gi|118371686|ref|XP_001019041.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89300808|gb|EAR98796.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 40.4 bits (93), Expect = 7.1, Method: Composition-based stats.
Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 14/103 (13%)
Query: 914 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
KM+ LVLDLD TL++ +E D + + Q+++K + +
Sbjct: 203 KMYEEHPNNLVLDLDETLIHC---NEKSLNDDSSIITVQFQNQQKNY-----------YL 248
Query: 974 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
R + FLE+ + F +++YT + YA E+ K++DP+ V+
Sbjct: 249 HQRGYLQEFLEQCALNFNIYIYTASTRDYAEEVVKIIDPRSVI 291
>gi|340723842|ref|XP_003400297.1| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Bombus
terrestris]
Length = 286
Score = 40.4 bits (93), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 68/152 (44%), Gaps = 21/152 (13%)
Query: 923 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKLRPGI 979
LVLDLD TL++S D V +R D + K RH RF + RP +
Sbjct: 106 LVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHKRPHV 156
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
FL+ S+ +E+ ++T ++Y +A+ LD R I R E
Sbjct: 157 DFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPEMGSY 210
Query: 1040 SKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
+KDL + ++V I+D+S R +PHN +
Sbjct: 211 TKDLSAICSDLASVFILDNSPGAYRAYPHNAI 242
>gi|440911023|gb|ELR60752.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Bos grunniens mutus]
Length = 261
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|399215866|emb|CCF72554.1| unnamed protein product [Babesia microti strain RI]
Length = 248
Score = 40.0 bits (92), Expect = 7.8, Method: Composition-based stats.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 15/123 (12%)
Query: 895 DQQKAAIQKERTRRLEEQKKMFSARK-LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
+ + Q + + L +K + S +K LVLDLD TL++S + + ++ E
Sbjct: 30 EANRPTFQTQLKKFLTSEKPVTSGKKKFTLVLDLDETLIHSEFVTDGNHSFSTTIKNDTE 89
Query: 954 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
++ RP FLE+ +KLFE+ ++T G++ YA + +LD
Sbjct: 90 NQT--------------IYVYKRPYADEFLEQVAKLFEVVIFTAGSEPYAKAVIDILDKN 135
Query: 1014 GVL 1016
V+
Sbjct: 136 KVV 138
>gi|126337836|ref|XP_001365381.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Monodelphis domestica]
Length = 346
Score = 40.0 bits (92), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 14/119 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ + K + L + K K+C+V+DLD TL++S+ PV + E D
Sbjct: 151 EENGTVPKAPVKYLLPEAKAQDLGKICVVIDLDETLVHSS----FKPVSNADFIIPVEID 206
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
++ + PH+ FL+R +LFE L+T YA +A +LD G
Sbjct: 207 GMVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 255
>gi|344268533|ref|XP_003406112.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Loxodonta africana]
Length = 261
Score = 40.0 bits (92), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|145529824|ref|XP_001450695.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418317|emb|CAK83298.1| unnamed protein product [Paramecium tetraurelia]
Length = 308
Score = 40.0 bits (92), Expect = 8.4, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 48/95 (50%), Gaps = 15/95 (15%)
Query: 918 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
+RK+C VLDLD TL++S + K E D + + + ++ +RP
Sbjct: 54 SRKVC-VLDLDETLVHS--------------QFKAENDHDFSLDIIVQSQLFKVYVTVRP 98
Query: 978 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
G+ F++ S+ FE+ ++T K YA + ++DP
Sbjct: 99 GVENFIDTLSEYFEVIMWTASLKEYADPVMDIIDP 133
>gi|23346509|ref|NP_694728.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Mus musculus]
gi|17865506|sp|P58466.1|CTDS1_MOUSE RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1; AltName: Full=Golli-interacting
protein; Short=GIP; AltName: Full=Nuclear LIM
interactor-interacting factor 3; Short=NLI-interacting
factor 3; AltName: Full=Small C-terminal domain
phosphatase 1; Short=SCP1; Short=Small CTD phosphatase 1
gi|15145799|gb|AAK83555.1| golli-interacting protein [Mus musculus]
gi|40796195|gb|AAH65158.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Mus musculus]
gi|51258970|gb|AAH79638.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Mus musculus]
gi|57169202|gb|AAH49184.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1 [Mus musculus]
gi|74191312|dbj|BAE39480.1| unnamed protein product [Mus musculus]
gi|148667908|gb|EDL00325.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
phosphatase 1, isoform CRA_a [Mus musculus]
Length = 261
Score = 40.0 bits (92), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|189303571|ref|NP_001121551.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Rattus norvegicus]
gi|149016108|gb|EDL75354.1| rCG23761 [Rattus norvegicus]
gi|171846749|gb|AAI61976.1| Ctdsp1 protein [Rattus norvegicus]
Length = 261
Score = 40.0 bits (92), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|318037543|ref|NP_001188083.1| carboxy-terminal domain RNA polymerase II polypeptide a small
phohatase 1 [Ictalurus punctatus]
gi|308323757|gb|ADO29014.1| carboxy-terminal domain RNA polymerase II polypeptide a small
phohatase 1 [Ictalurus punctatus]
Length = 264
Score = 40.0 bits (92), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 14/121 (11%)
Query: 896 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
++ I K + L Q K K+C+V+DLD TL++S+ PV++ E D
Sbjct: 69 EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 124
Query: 956 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
++ + PH+ FL+R +LFE L+T YA ++ +LD G
Sbjct: 125 GAVHQVYVLKRPHVD----------EFLKRMGELFERVLFTASLAKYADPVSDLLDKWGA 174
Query: 1016 L 1016
Sbjct: 175 F 175
>gi|15242476|ref|NP_199382.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9758673|dbj|BAB09212.1| unnamed protein product [Arabidopsis thaliana]
gi|332007902|gb|AED95285.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 272
Score = 40.0 bits (92), Expect = 8.8, Method: Composition-based stats.
Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 25/153 (16%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E K F K +VLDLD TL++S+ P +D ++ K + +
Sbjct: 87 ERSGKSFDETKKTIVLDLDETLVHSSMEKPEVP-YDFVVNPKIDGQI------------L 133
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
+ RPG+ FL++ + +++ ++T G + YA+ + LDP+ RVISR D
Sbjct: 134 TFFVIKRPGVDEFLKKIGEKYQIVVFTAGLREYASLVLDKLDPE-----RRVISRSFYRD 188
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
DG KDL V+ VVI+DD+
Sbjct: 189 ACSEIDGR----LVKDLGFVMRDLRRVVIVDDN 217
>gi|297794689|ref|XP_002865229.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297311064|gb|EFH41488.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 272
Score = 40.0 bits (92), Expect = 8.9, Method: Composition-based stats.
Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 25/153 (16%)
Query: 910 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
E K F K +VLDLD TL++S+ P +D ++ K + +
Sbjct: 87 ERSDKSFDETKKTIVLDLDETLVHSSMEKPEVP-YDFVVNPKIDGQI------------L 133
Query: 970 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
+ RPG+ FL++ + +++ ++T G + YA+ + LDP+ RVISR D
Sbjct: 134 TFFVIKRPGVDEFLKKIGEKYQIVVFTAGLREYASLVLDKLDPE-----RRVISRSFYRD 188
Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
DG KDL V+ VVI+DD+
Sbjct: 189 ACSEIDGR----LVKDLGFVMRDLRRVVIVDDN 217
>gi|355681366|gb|AER96785.1| CTD small phosphatase 1 [Mustela putorius furo]
Length = 260
Score = 40.0 bits (92), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 90 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 137 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174
>gi|281340231|gb|EFB15815.1| hypothetical protein PANDA_001554 [Ailuropoda melanoleuca]
Length = 243
Score = 40.0 bits (92), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K+C+V+DLD TL++S+ PV++ E D ++ + PH+
Sbjct: 72 KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 118
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
FL+R +LFE L+T YA +A +LD G A
Sbjct: 119 -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 156
>gi|224000223|ref|XP_002289784.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974992|gb|EED93321.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 179
Score = 40.0 bits (92), Expect = 9.2, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 14/97 (14%)
Query: 920 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
K CLVLDLD TL++S+ F V P D ++ + E + F ++ RPG+
Sbjct: 12 KKCLVLDLDETLVHSS-FRAV-PGADFVIPVQIED--------VVHF----VYVAKRPGV 57
Query: 980 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
FL +K +E+ +YT YA + +LDP V+
Sbjct: 58 DEFLTEMAKHYEIVVYTASLNKYADPLLDLLDPNRVI 94
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.130 0.374
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,236,606,595
Number of Sequences: 23463169
Number of extensions: 931617807
Number of successful extensions: 2623123
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 700
Number of HSP's successfully gapped in prelim test: 1505
Number of HSP's that attempted gapping in prelim test: 2611979
Number of HSP's gapped (non-prelim): 9547
length of query: 1234
length of database: 8,064,228,071
effective HSP length: 155
effective length of query: 1079
effective length of database: 8,722,404,172
effective search space: 9411474101588
effective search space used: 9411474101588
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)